Scalability AOSA Presentation

Scalable Web Architecture and
Distributed Systems

Gloria Chow, Jacqui Killow, Peter Moon, Jessica Wong

Introduction to Large-Scale Web Systems
 The design of large-scale web systems is
influenced by six key principles:
 Availability
 Performance
 Reliability
 Scalability
 Manageability
 Cost

Introduction to Large-Scale Web Systems
 Four core factors are central to almost all large
web applications:
 Services
 Redundancy
 Partitions
 Failure Handling

Core Factors of Large Web
Systems
 Services
 Increase scalability by separating services so that
each service has its own function
 Interactions outside of the service’s context are
handled by an abstract interface
 This is called “Service-Oriented Architecture (SOA)”

 Redundancy
 The server needs to be able to create redundant
copies of data in case of data loss

Core Factors of Large Web
Systems
 Partitions
 When performance degrades, more resources
should be added
 Systems can:
 scale vertically: add more resources to the server
(e.g. increasing capacity)
 scale horizontally: add more servers

 Failure Handling
 System should be able to recover from errors and
faults

Achieving Fast and Scalable Data Access
 By providing fast access to data, services also
become faster and more scalable.

 There are many options available for increasing
the speed of data access:
 Caches
 Proxies
 Indexes
 Load Balancers
 Queues

Caches
 Similar to short-term memory
 A limited amount of storage space
 Only contains the most recently-accessed items
 Faster than accessing the original data source
 Used in all levels of architecture
 Most commonly found near the front end

 Two types of cache: Global vs. Distributed

Global Cache
 Involves adding a server or a file store that is
faster than the original storage and accessible
by all request nodes
 Request nodes query the cache in the same way
they would query a local storage
 There are two ways a global cache can be set up

Two types of Global Cache:
1. Cache is responsible for retrieval of
data.

Request For ea ch
I f t he da t a wa s
Node r equest , t he
not in t he ca che,
r equest node
t he ca che wil l
wil l check t he
pul l t he da t a
ca che f ir st .
f r om t he or igin
Request a nd t hen keep it
Node f or f ut ur e
r equest s.
Gl oba l Cache Dat a
Request
Node

Request
Node

Two types of Global Cache:
2. Nodes are responsible for retrieval of data.
1. For ea ch
r equest , t he
r equest node wil l 3. When da t a is
check t he gl oba l r et r ieved f r om
ca che f ir st . Gl oba l Ca che t he or igin, it ca n
Request be a dded t o t he
Node ca che.

Request
Node
Da t a
Request
Node

2. I f t he da t a wa s not in t he ca che,
t hen t he Request Node wil l
Request r et r ieve it f r om t he or igin.
Node

Distributed Cache
 Each node owns Request Node
The node checks t he
part of the cached Cache A cache based on an
it em key obt ained by
data using a hash
al gor it hm and t hen
Request Node t he dat a or igin.
 Uses a hashing
function so nodes Cache B

can quickly find Dat a
Request Node
the data they
need Cache C

 Storage space
Request Node
can easily be
increased just by Cache D

adding more
nodes

Proxies
 Coordinates
requests from Request A Request
multiple clients and Node
relays them to the
backend Request A
Request A Request Data
 Can speed up data
Node
access by
grouping similar
requests and
Request A Request Proxy
treating it as one Node Server
(“collapsed-
forwarding”)

Indexes
 A table that stores Index Memor y
the location of data Locat ion 0
Dat a Locat ion A
 Stored in memory
or somewhere local A 0 Locat ion 1 B- par t 1
to incoming B- par t 2
B 1 B- par t 3
requests Locat ion 2
 Increases speed of C 2 C
data reads Locat ion 3
D 3
 Data writes are D
slower because the
index must be
updated for each
read

Load Balancers
 Distributes Request
Node
load across a Request A Request B
set of nodes
responsible
for handling Request B
Load Bal ancer
(LB)
Request C
Request
requests Node

 Algorithms
Request C
determine Request A

how to to
distribute Request
Node
requests

Queues
Queue
 Incoming
requests are Task 1

added to a Running Task
queue and are Task 2
Task 1
taken off when
Task 3
the server has
Task 2
processed it
Task 4
 Allows clients
to work Task 5
asynchronousl
y

Summary
 The six principles that govern the design of large
web systems are: availability, performance,
reliability, scalability, manageability, and cost.
 Services, redundancy, partitions, and failure
handling are core factors common in almost all
large web systems.
 Caches, Proxies, Indexes, Load Balancers and
Queues are all used to increase the speed of
data access. Multiple methods can be used
together in one web system.

Scalability AOSA Presentation

Recommended

Recommended

More Related Content

Similar to Scalability AOSA Presentation

Similar to Scalability AOSA Presentation (20)

Recently uploaded

Recently uploaded (20)

Scalability AOSA Presentation

Editor's Notes