FlinkCEP Library - Dawid Wysakowicz, GetInData (WHUG)

© Copyright. All rights reserved. Not to be reproduced without prior written consent.
Looking for Patterns -
FlinkCEP Library
_D. Wysakowicz_

About me
■ Data Engineer at GetInData
■ Apache Flink Committer
■ Involved in Flink CEP library development
Dawid Wysakowicz
@OneMoreCoder

That moment
when party
goes wrong.

Were there
any signs?
Any Patterns?
That moment
when party
goes wrong.

Image source: “Continuous Analytics: Stream Query Processing in Practice”, Michael J Franklin, Professor, UC Berkley, Dec 2009 and
http://www.slideshare.net/JoshBaer/shortening-the-feedback-loop-big-data-spain-external

Buying Home Insurance Online
Property
Info
Offer
Accept
General Terms
of Contract
PaymentScope
Change
Info
Change

Buying Home Insurance Online
Property
Info
Offer
Accept
Exit
Scope
Change
Info
Change
General Terms
of Contract
Payment

Simple Example - Matching Payments
Notify whenever a payment for accepted offer arrives

Create Pattern
/* create pattern sequence*/
Pattern<Event, ?> pattern = Pattern<Event>
.begin("offer accepted")
.subtype(OfferAccepted.class)
.followedBy("payment received")
.subtype(PaymentReceived.class)
.within(Time.hours(48));

Apply Pattern
/* convert match sequence into alerts */
DataStream<Alert> alerts = CEP.pattern(stream, pattern)

Select Matches
.select(new PatternSelectFunction<Event, Alert>() {
@Override
public Alert select(Map<String, List<Event>> match) throws Exception {
return createAlert(match);
}
});

Handle Results
.select(new PatternSelectFunction<Event, Alert>() {
@Override
public Alert select(Map<String, List<Event>> match) throws Exception {
return createAlert(match);
}
});
/* write out results */
alerts.addSink(...);

Seamless Integration - Standard Pipeline
DataStream
Source e.g.
DataStream
Sink e.g.
DataStream
Transformations

Seamless Integration - CEP Library
DataStream
Source e.g.
DataStream
Sink e.g.
DataStream
Transformations
CEP
DataStream Pattern
DataStream
Stream of matches

CEP Resulting Matches
CEP
DataStream
Pattern
A -> B -> C
C2C1B2B1A
DataStream
Stream of matches
A, B1, C1 A, B2, C1 A, B1, C2 A, B2, C2

CEP Timeouted Matches
CEP
DataStream
Pattern
A -> B -> C within 5 seconds
C2(t=7)B2(t=4)B1(t=2)A(t = 1)
DataStream
Stream of timeouted matches
W=6
?

CEP Timeouted Matches
CEP
DataStream
Pattern
A -> B -> C within 5 seconds
C2(t=7)B2(t=4)B1(t=2)A(t = 1)
DataStream
Stream of timeouted matches
A A, B1 A, B2
W=6

Pattern Sequence
Pattern
.begin("start")
.subtype(Event.class)
.where(...)
.or(...)
.followedBy("next")
.where(...)
.where(...)

Patterns
Pattern
.begin("start")
.where(...)
.or(...)
.followedBy("next")
.where(...)
.where(...)
Basic building blocks

Conditions
Pattern
.begin("start")
.where(...)
.or(...)
.followedBy("next")
.where(...)
.where(...)
Condition specification

Time Restrictions
Pattern
.begin("start")
.where(...)
.or(...)
.followedBy("next")
.where(...)
.where(...)
.within(Time.seconds(3))
Time restrictions

Time Restrictions
Pattern
.begin("start")
.where(...)
.or(...)
.followedBy("next")
.where(...)
.where(...)
Time restrictions
NOTE: The time restriction is a
global property. This is a period in
which either whole match is
generated or partial matches are
timeouted.

■ Skip till next
Consuming Strategy

■ Skip till next
■ Strict continuity
Consuming Strategy

■ Skip till next
■ Skip till any
Consuming Strategy

■ Skip till next
■ Skip till any
■ Not follow
Consuming Strategy

■ Skip till next
■ Skip till any
Consuming Strategy
■ Not follow

■ Skip till next
■ Skip till any
■ Not follow
■ Not next
Consuming Strategy

■ Skip till next
■ Skip till any
Consuming Strategy
■ Not follow
■ Not next

Consuming Strategy
■ Skip till next
■ Skip till any
■ Not follow
■ Not next

Looping Example
How many offer changes before accepting?
...
?

Pattern Specification
Pattern
.begin("start")
.where(...)
.or(...)
.times(3)
.followedBy("next")
.where(...)
.where(...)
.oneOrMore().optional()
Quantifier application
Time restrictions

Quantifiers
■ Singleton pattern
■ Complex pattern
● Times (count)
● Looping pattern

Quantifiers
■ Singleton pattern
■ Complex pattern
● Times (count)
● Looping pattern
■ Optional pattern
● singleton -> optional
● times -> no or exactly n times
● oneOrMore -> zeroOrMore

Complex Patterns - Consuming Strategies
?
Pattern
.begin("offer").subtype(Offer.class)

?
Pattern
.followedByAny("changes").subtype(OfferChanged.class)

?
Pattern
.oneOrMore()
.consecutive()

?
Pattern
.oneOrMore()
.consecutive()
.followedBy(“accepted”).subtype(OfferAccepted.class)

■ Consuming strategy - before first element
■ Inner consuming strategy - between events matched in the
Pattern

Iterative Condition
Trigger alert when the same property changed.
PL Flood PL

Iterative Condition
public abstract class IterativeCondition<T> implements Function, Serializable {
public abstract boolean filter(T value, Context<T> ctx) throws Exception;
/**
* @return An {@link Iterable} over the already accepted elements
* for a given pattern. Elements are iterated in the order they were
* inserted in the pattern.
*
* @param name The name of the pattern.
*/
public interface Context<T> extends Serializable {
Iterable<T> getEventsForPattern(String name);
}
}

Iterative Condition
Trigger alert when the same property changed.
Pattern
.begin("first change").subtype(OfferChanged.class)
.followedBy("second change").subtype(OfferChanged.class)
.where(new IterativeCondition<OfferChanged>() {
@Override
public boolean filter(OfferChanged value, Context<OfferChanged> ctx) {
return ctx.getEventsForPattern("first change").iterator().next()
.property().equals(value.property());
}
});
PL Flood PL

Complex Example
Alert when a user constantly decreases value of the
property
-12.000€
-14.000€
-24.000€
-100.000€

Complex Example - Implementation
Alert when a user changed value by more than 200% of
previous average change
Pattern
.begin("change").subtype(ValueChanged.class)
.oneOrMore()
.followedBy("alerting change").subtype(ValueChanged.class)
.where(new IterativeCondition<OfferChanged>() {
@Override
public boolean filter(OfferChanged value, Context<OfferChanged> ctx) {
double averageChange = countAverage(ctx.getEventsForPattern("change"));
return value.change > 2 * averageChange;
}
});

FlinkCEP & Asian Telco
■ +15 subscriber-oriented triggers to implement
● e.g. when a subscriber registers in a roaming and then
spends X USD in a roaming

Summary
■ Shortest possible feedback loop
■ Seamless integration with Flink ecosystem
■ Complex patterns
● Dynamic length
● Conditions based on calculations

How does it works underneath?

EventStream
Order handling
1 5 4 2
PatternOperator
ElementsQueue
1 2 4 5
NFA

EventStream
Order handling
1 5 4 82
W=5
PatternOperator
ElementsQueue
1 2 4 5
W=5
NFA

EventStream
Order handling
1 5 4 82
W=5
PatternOperator
ElementsQueue
1 2 4 5
W=5
NFA
MatchesStream
A B

EventStream
Order handling - late elements
1 5 4 8 32
W=5
PatternOperator
ElementsQueue
1 2 4 5
W=5
NFA
MatchesStream
A B

Nondeterministic Finite Automaton
■ Graph where:
● Vertices = States
● Edges = Transitions
■ IGNORE - event not consumed
■ TAKE - event consumed
■ PROCEED - event analyzed in the target state

Example of NFA
Pattern
.begin(“A”)
.followedByAny(“B”).optional()
.next(“C”)
.followedBy(“D”)

Example of NFA
A B? C D
TAKE TAKE
TAKE
PROCEED
IGNORE IGNORE IGNORE
TAKE
Pattern
.begin(“A”)
.followedByAny(“B”).optional()
.next(“C”)
.followedBy(“D”)

EventsStream
Example of NFA
A B? C D
TAKE TAKE
TAKE
PROCEED
TAKE

EventsStream
Example of NFA
A B? C D
TAKE TAKE
TAKE
PROCEED
TAKE
A

EventsStream
Example of NFA
A
A
B?
A

EventsStream
A B
Example of NFA
A B? C D
TAKE TAKE
TAKE
PROCEED
TAKE

EventsStream
Example of NFA
A B? C D
TAKE TAKE
TAKE
PROCEED
TAKE
A B

EventsStream
Example of NFA
A
A
B?
A
B
A
C
B?

EventsStream
Example of NFA
A B? C D
TAKE TAKE
TAKE
PROCEED
TAKE
A B C

EventsStream
Example of NFA
A
A
B?
A
B
A
C
B?
D
B?
D
A
C

EventsStream
Example of NFA
A
A
B?
A
B
A
C
B?
D
B?
D
A
C D
Match: A, C, D
B?
Match: A, B, C, D
A

Shared Buffer - how it works
MiddleStart End
S1
S2
M1
M2
M3
E1
E2
1
2
1.0
1.0.0
1.0.0.0
2.0
1.0.0.0.0
1.0.0.0.1
2.0.0
Events order
S1 M1 M2 S2 M3 E1 E2

FlinkCEP Library - Dawid Wysakowicz, GetInData (WHUG)

Recommended

Recommended

More Related Content

Similar to FlinkCEP Library - Dawid Wysakowicz, GetInData (WHUG)

Similar to FlinkCEP Library - Dawid Wysakowicz, GetInData (WHUG) (20)

More from GetInData

More from GetInData (20)

Recently uploaded

Recently uploaded (20)

FlinkCEP Library - Dawid Wysakowicz, GetInData (WHUG)