Feeding a Squirrel in Time---Windows in Flink

dbisINSTITUT FÜR INFORMATIK
HUMBOLDT−UNIVERSITÄT ZU ERLINB
Feeding a Squirrel in Time—Windows in Flink
Apache Flink Meetup Munich
Matthias J. Sax
mjsax@{informatik.hu-berlin.de|apache.org}
@MatthiasJSax
Humboldt-Universit¨at zu Berlin
Department of Computer Science
November 11st
2015

–MatthiasJ.Sax–WindowsinApacheFlink
1/21
About Me
Ph. D. student in CS, DBIS Group, HU Berlin
involved in Stratosphere research project
working on data stream processing and optimization
Aeolus: build on top of Apache Storm
(https://github.com/mjsax/aeolus)
Committer at Apache Flink

2/21
Stream Processing
Processing data in motion:
external sources create data constantly
data is pushed to the system
need to keep up with incoming data rate
usage of ingestion buﬀers (e. g., Apache Kafka)
handle data peaks
back pressure, dynamic scaling (or even load-shedding)
low processing latency (milliseconds)
no micro-batching

3/21
Other Systems
Apache Storm
widely used in industry
diﬀerent processing guarantees
no guarantee
at-least-once
exactly-once (not for external writes)
no ordering guarantees
no type system
dynamic scaling (to some extent)
some high-level abstractions using Trident
windows, state, exactly-once-processing

4/21
Other Systems (cont.)
Apache Samza
similar to Storm
at-least-once processing
active state handling

5/21
Google Dataﬂow
similar API to Flink oﬀering very rich semantics
windows, triggers
can deal with late arriving data
dynamic scaling
only available as service in the cloud

6/21
Apache Spark (Streaming)
micro-batching (no real streaming)
limited semantics
exactly-once processing
state management
no sub-second latency

7/21
Batch vs. Stream Processing
1012812

7/21
1012812 1012812

7/21
1012812 1012812
sum
42

7/21
1012812 1012812
sum
42
1012812

7/21
1012812 1012812
sum
42
1012812 10

7/21
1012812 1012812
sum
42
1012812 10
0

7/21
1012812 1012812
sum
42
1012812 10
0
sum
10

7/21
1012812 1012812
sum
42
1012812 10
0
sum
10
12
10
sum
22

7/21
1012812 1012812
sum
42
1012812 10
0
sum
10
12
10
sum
22
8
22
sum
30

7/21
1012812 1012812
sum
42
1012812 10
0
sum
10
12
10
sum
22
8
22
sum
30
12
30
sum
42

8/21
Batch vs. Stream Processing (cont.)
DataSet API
DataSet <Tuple2 <String ,Integer >> input
= ...
DataSet <Tuple2 <String ,Integer >> result
= input.groupBy (0). sum (1);

8/21
Batch vs. Stream Processing (cont.)
DataSet API
DataSet <Tuple2 <String ,Integer >> input
= ...
DataSet <Tuple2 <String ,Integer >> result
= input.groupBy (0). sum (1);
DataStream API
DataStream <Tuple2 <String ,Integer >> input
= ...
DataStream <Tuple2 <String ,Integer >> result
= input.keyBy (0). sum (1);

9/21
Count Based Windows
1012812

9/21
Count Based Windows
1012812 1012

9/21
Count Based Windows
1012812 1012
sum
22

9/21
Count Based Windows
1012812 1012
sum
22
812
sum
20

9/21
Count Based Windows
1012812 1012
sum
22
812
sum
20
1012812

9/21
Count Based Windows
1012812 1012
sum
22
812
sum
20
1012812 1012
sum
22
128
sum
20
812
sum
20

10/21
Count Based Windows
Count Based Window (tumbling)
= ...
= input.keyBy (0). countWindow (2). sum (1);

10/21
Count Based Windows
= ...
Count Based Window (overlapping)
= ...
= input.keyBy (0). countWindow (2 ,1). sum (1);

10/21
Count Based Windows
= ...
Count Based Window (overlapping)
= ...
= input.keyBy (0). countWindow (2 ,1). sum (1);
Caution: count-windows applies to each sub-stream

11/21
The Nature of Data Streams
abcd
Ext. prod. 1
1234
Ext. prod. 2

11/21
abcd
Ext. prod. 1
1234
Ext. prod. 2
time: 5 4 3 2 1

11/21
abcd
Ext. prod. 1
1234
Ext. prod. 2
time: 5 4 3 2 1
Source Operator

11/21
bcd
Ext. prod. 1
234
Ext. prod. 2
time: 5 4 3 2 1
Source Operator
a1
1.52

11/21
cd
Ext. prod. 1
34
Ext. prod. 2
time: 5 4 3 2 1
Source Operator
a1
1.52
2b
2.53.5

11/21
d
Ext. prod. 1
4
Ext. prod. 2
time: 5 4 3 2 1
Source Operator
a1
1.52
2b
2.53.5
c3
4.55

11/21
Ext. prod. 1
Ext. prod. 2
time: 5 4 3 2 1
Source Operator
a1
1.52
2b
2.53.5
c3
4.55
d4
66.5

12/21
The Notion of Time
Event Time = Processing Time

12/21
The Notion of Time
Event Time
ProcessingTime
Google Cloud Dataﬂow and Flink, William Vambenepe, Flink Forward 2015.

12/21
The Notion of Time
Event Time
ProcessingTime
Skew
Google Cloud Dataﬂow and Flink, William Vambenepe, Flink Forward 2015.

13/21
Watermarks

13/21
Watermarks
1
3

13/21
Watermarks
1
3
2
4

13/21
Watermarks
1
3
2
4
1234

13/21
Watermarks
1
3
2
4
1234
wrong processing order!
2
5
8
7
1
4
7
9

13/21
Watermarks
1
3
2
4
2
5
8
7
1
4
7
9
wm=3
wm=4

13/21
Watermarks
4
5
8
7
4
7
9
wm=3
wm=4
11223

13/21
Watermarks
4
5
8
7
4
7
9
wm=3
wm=4
11223
wm=3

14/21
Streaming Tradeoﬀs
Processing Time
no late data / no skew
windows are simple to build
low latency
inherently non-deterministic

14/21
Processing Time
low latency
Event Time (external)
late data / skew
out-of-order data (windowing more diﬃcult)
simpler to reason about semantics (deterministic)
increased latency

14/21
Processing Time
low latency
Event Time (external)
late data / skew
out-of-order data (windowing more diﬃcult)
simpler to reason about semantics (deterministic)
increased latency
Event Time (ingestion)
no out-of-order
simpliﬁed watermarking

15/21
Time Based Windows
Timestamp Example
StreamExecutionEnviroment env = ...

15/21
Time Based Windows
Timestamp Example
// alternatives : ProcessingTime / IngestionTime
env. setStreamTimeCharacteristic (
TimeCharacteristic .EventTime );

15/21
Time Based Windows
Timestamp Example
DataStream <Tuple > input = ...
input. assignTimestamps (

15/21
Time Based Windows
Timestamp Example
input. assignTimestamps (new TimestampExtractor <Tuple > {

15/21
Time Based Windows
Timestamp Example
public long extractTimestamp (Tuple element ,
long currentTimestamp ) {
return /* extract from element */;
}

15/21
Time Based Windows
Timestamp Example
}
public long extractWatermark (Tuple element ,
}

15/21
Time Based Windows
Timestamp Example
}
public long extractWatermark (Tuple element ,
}
public long getCurrentWatermark () {
return Long.MIN_VALUE;
}
});

16/21
Time Based Windows (cont.)
Sliding Time Window Example
DataStream <...> input = ...
input.keyBy (...)
// size = 5s; slide = 1s
.timeWindow(Time.of(5, TimeUnit.SECONDS),
Time.of(1, TimeUnit.SECONDS ))
.reduce (...);

16/21
Time Based Windows (cont.)
Sliding Time Window Example
input.keyBy (...)
// size = 5s; slide = 1s
.timeWindow(Time.of(5, TimeUnit.SECONDS),
Time.of(1, TimeUnit.SECONDS ))
.reduce (...);
General Window Example
input.keyBy (...)
.window (...)
.apply(new WindowsFunction <... >() {
// ...
});

17/21
Advanced Windowing Concepts
global windows (non-parallelized)

17/21
Triggers:
closes a window (i. e., ﬁres)
processing time
watermark
count
delta
... (with diﬀerent discarding strategies)

17/21
Triggers:
processing time
watermark
count
delta
Evict:
removes tuple from window before function gets applied
time, count, delta

17/21
Triggers:
processing time
watermark
count
delta
Evict:
removes tuple from window before function gets applied
time, count, delta
mix diﬀerent windows/triggers/evictors

18/21
Stateful Stream Processing
Flink can handle arbitrary user state:
state is store reliably
distributed snapshots algorithm
Example
public class CounterSum
implements RichReduceFunction <Long > {
private OperatorState <Long > counter;
public void open( Configuration config) {
counter = getRuntimeContext ()
. getOperatorState ("myCnt", Long.class , 0L);
}
public Long reduce(Long v1 , Long v2) throws Exception {
counter.update(counter.value () + 1);
return v1 + v2;
}
}

19/21
Summary
The time-problem:
processing time vs. event time vs. ingestion time
time skew, out-of-order tuples, watermarks

19/21
Summary
The time-problem:
processing time vs. event time vs. ingestion time
time skew, out-of-order tuples, watermarks
The window-question:
count-based vs. time-based
tumbling vs. overlapping
intermediate triggers
advanced windows (mix of above)

20/21
Summary (cont.)
Flink provides a rich API (Java/Scala) to express diﬀerent
semantics
state handling for arbitrary UDF code
fault-tolerance with exactly-once guarantees
exaclty-once sink available

20/21
Summary (cont.)
Flink provides a rich API (Java/Scala) to express diﬀerent
semantics
state handling for arbitrary UDF code
fault-tolerance with exactly-once guarantees
exaclty-once sink available
What else?
Python API is coming (right now DataSet only)
Google Dataﬂow on Flink
Storm on Flink
Apache SAMOA on Flink

dbisINSTITUT FÜR INFORMATIK
HUMBOLDT−UNIVERSITÄT ZU ERLINB
Feeding a Squirrel in Time—Windows in Flink
Apache Flink Meetup Munich
Thanks!

Feeding a Squirrel in Time---Windows in Flink

Recommended

Recommended

More Related Content

Similar to Feeding a Squirrel in Time---Windows in Flink

Similar to Feeding a Squirrel in Time---Windows in Flink (20)

Recently uploaded

Recently uploaded (20)

Feeding a Squirrel in Time---Windows in Flink