S4: Distributed Stream Computing Platform

S4: Distributed
Stream Computing
Platform YAHOO LABS 2010
FARZAD NOZARIAN, MAZAHER BAZARI
Cloud Computing
CEIT@AUT 12/22/2014

/* Who we are! */
Farzad Nozarian
fnozarian@aut.ac.ir
Big Data Processing And Mining
Mazaher Bazari
mbazari@aut.ac.ir
Mobile Cloud Computing
1

What is S4
 Simple Scalable Streaming System
 Inspired by the MapReduce model!
S4 is a general-purpose, distributed, scalable,
partially fault-tolerant, pluggable platform for
processing continuous unbounded streams of data
2

Motivation
 Real-time search
 High frequency trading
 Social networks
3

“cost-per-click” billing model 4

“cost-per-click” billing model
 Render the most relevant ads in an optimal
position on the page
 Include user preferences from context:
 Processing thousands of queries per sec.
 recent user activity
 Geographic location
 Prior queries
 Prior clicks
5

Reinvent the Wheel!
 Extending the open source Hadoop platform to support
computation of unbound streams
But, Hadoop isn’t suitable!
 The Hadoop platform was highly optimized for batch processing
 MapReduce systems typically operate on static data by
scheduling batch jobs.
6

Real world systems!
Partition #1
Partition #2
Partition #...
Partition #N
Partition #3
Data Stream
Latency
Latency is proportional to
Length of the segment
Overhead of segmentation
and initiate the processing jobs
Fixed-size segments
7

Design goals
Simple API
Scale using commodity hardware
Minimize latency by using local memory in
each processing node
8

Design goals
Decentralized and symmetric architecture
Pluggable architecture
Science friendly
9

S4 Model
Avoiding the use of shared memory across the cluster
Distributed operation on commodity hardware
Actors model
G. Agha, Actors: A Model of Concurrent Computation in Distributed Systems.
10

S4 model (cont.)
 Computation is performed by Processing Elements (PEs)
 Messages are transmitted between them in the form of data events
 The state of each PE is inaccessible to other PEs
 Event emission and consumption is the only mode of interaction
between PEs
 The framework provides the capability to route events to
appropriate PEs and to create new instances of PEs
11

Design Assumptions
 Lossy failover is acceptable!
 Nodes will not be added to or removed from a running cluster!
12

Design: Example
 What is the task?
The task is to continuously produce a sorted list
of the top K most frequent words across all
documents with minimal latency
13

EV Quote
KEY Null
VALUE Quote=“I …”
A keyless event (EV) arrives at PE1 with quote:
EV Quote “I meant what I said and I said what I meant.”, Dr. Seuss
PE1
QuoteSplitterPE (PE1) counts
unique words in Quote and
emits events for each word.
EV WordEvent
KEY word="said"
VALUE count=2
PE2 PE3 PE4…
EV WordEvent
KEY word=“i"
VALUE count=4
PE2 PE3 PE4…
PE3
EV UpdatedCountEv
KEY sortID=2
VALUE word=said count=9
EV UpdatedCountEv
KEY sortID=9
VALUE word="i" count=35
WordCountPE (PE2-4) keeps
total counts for each word
across all quotes. Emits an event
any time a count is updated.
EV PartialTopKEv
KEY topk=1234
VALUE words={w:cnt}
MergePE (PE8) combines partial
TopK lists and outputs final TopK list.
14

Design: Processing Elements
PE
Functionality
Types of events
Keyed attribute of events
Value of the keyed attribute
EV WordEvent
KEY word="said"
VALUE count=2
15

Design: Processing Elements (cont.)
EV Quote
KEY Null
VALUE Quote=“I …”
 Keyless PEs
No keyed attribute or value
Consume all events of the type with which they are
associated
Typically used at the input layer of an S4 cluster where
events are assigned a key
 Standard PE
Count
aggregate
 join
16

Design: Processing Node
 Processing Nodes (PNs) are the logical hosts to PEs.
 They are responsible for:
 listening to events
 executing operations on the incoming events
 dispatching events with the assistance of the communication layer
 emitting output events
17

Communication Layer
Zookeeper
Design: Processing Node (cont.)
Processing Element Container
PE1 PE2 PEn
…
Event
Listener
Dispatcher Emiter
Routing Load Balancing
Failover Management
Transport Protocols
18

Programming Model
 High-level programming paradigm
 Generic
 Reusable
 Configurable
 Java Programming Language
19

Programming Model (cont.)
 Define Many ProcessEvent
(polymorphism)
 Create PE
 Inherited from AbstractePE
 Implement ProcessEvent()
 Implement Output()(Optional)
20

PE1
PE2
PE3
PE4
PE5
PE1
PE2 PE3
PE4
PE5
 Configuration
22

 Configuration
23

Streaming Click-Through Rate
Computation
 CTR = (ratio of the number of clicks )/(number of impressions)
Two types of
events
Serve Event
Click Event
Serve is a search result page is returned to
the user
24

Streaming Click-Through Rate
Computation(cont.)
 Serve event contain:
 serveID
 query
 user
 Ads
 …..
 Click event contain:
 Click information
 serveID
Use a set of heuristic rules to eliminate
suspicious serves and click
25

EV RawServe
KEY Null
VALUE _Serve_Data
Event Flow of CTR Computation
PE1
EV serve
KEY Serve=123
VALUE Serve Data
EV Click
KEY Serve=123
VALUE Click Data
PE4
EV JoinedServe
KEY usr=Peter
VALUE JoinedData
EV JoinedClick
KEY usr=Peter
VALUE JoinedData
EV FilteredServe
KEY g-ad=Ipod-78
VALUE JoinedData
EV RawClick
KEY Null
VALUE _Click_Data
PE2
PE3
EV FilteredClick
KEY g-ad=Ipod-78
VALUE JoinedData
26

Zookeeper
Node1
Node2
Cluster 1
Repo
Apache S4 28

Zookeeper
Node1
Node2
Cluster 1
Repo S4 App
12
3
4
Apache S4 29

Zookeeper
Node1
Node2
Cluster 1
Node1
Cluster 2
Repo
1
2
3
Apache S4 30

Zookeeper
Node1
Node2
Cluster 1
Node1
Cluster 2
Repo
1
2 3
Apache S4 31

Apache S4: Commands
 s4 <command> <options>
Command Purpose
newApp Create a new application
zkServer Start a ZooKeeper server
newCluster Define an S4 cluster
s4r Package an application
deploy Deploy/configure an application
node Start an S4 node
status Get information about S4 infrastructure
32

Apache S4: Failover
while preserving low processing latency
High Availability
State Recovery
33

Apache S4: Failover
Zookeeper
Node 2
Node 3
Node 1
Node 4 Standb
Standby
Standby
Node 3
High Availability State Recovery
34

Apache S4: Failover
 PE recover a previous state()
 periodically checkpoint (uncoordinated and asynchronous)
 lazily recover
PE 1
PE 2
Keyed Message
Checkpoint
Framework
Hooks
Storage
Backend Storage
S4 node
35

Summary
 S4: Simple Scalable Streaming System
 Design
 Processing Elements
 Processing Nodes
 Communication Layer
 Programming Model
 Apache S4
 Deployment
 Failover
36

S4: Distributed Stream Computing Platform

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to S4: Distributed Stream Computing Platform

Similar to S4: Distributed Stream Computing Platform (20)

Recently uploaded

Recently uploaded (20)

S4: Distributed Stream Computing Platform

Editor's Notes