Flink Forward Berlin 2018: Dongwon Kim - "Real-time driving score service using Flink"

Real-time driving score service
using Flink
Dongwon Kim
SK telecom

My talks @FlinkForward
Flink Forward 2015
A Comparative Performance
Evaluation of Flink
Flink Forward 2017
Predictive Maintenance
with
Deep Learning and Flink .
Flink Forward 2018
Real-time driving score service
using Flink

T map, a mobile navigation app by SK telecom
≈
Choose from
frequent locations
Enter an address
or
a place nameWaze
Google Maps

T map, a mobile navigation app by SK telecom
multiple route options in driving mode arriving at destination

Driving score service by T map
I scored 83
out of 100! yay!
Driving score
KB Insurance DB Insurance
10% discount 10% discount
Car insurance discount for safe drivers
If you drive safely with ,
automobile insurance premiums
go down.

Driving score is based on three factors
My driving score
Rank : 970k Speeding
Rapid
accel.
Rapid
decel.
greatgood good
Monthly chart
Apr May Jun Jul Aug

The three factors are calculated for each session
6/29 (Fri.)
min
min
SKT Network Operation Center
Yanghyeon Village
●speeding 0 ●rapid acc. 0 ●rapid decel. 0
6/28 (Thu.)
min
min
Yanghyeon Village
● ● ●

The three factors are calculated for each session
● ● ●
Speeding
0.2km
My speed : 90km/h
(Speed limit : 70km/h)
Rapid accel.
(within 3 sec)
Rapid decel.
(within 3 sec)

Current client-server architecture
A GPS trajectory is generated
for each driving session
…
GPS coord.
• latitude
• longitude
• altitude
T1
GPS coord.
• latitude
• longitude
• altitude
T2
GPS coord.
• latitude
• longitude
• altitude
TN
T map
GPS trajectory
driving score
(+1day)
Batch ETL jobs are executed
twice a day
to calculate three factors ●●●
from trajectories
The main drawback
Users cannot see today’s driving scores until tomorrow
T map service server
...
11min

Migration from batch ETL to streaming processing
... ...
Service
DB
Millions
of users
...
Batch processing
Real-time streaming
processing
Goal
Let users know driving scores ASAP

Why we choose to use Flink?
https://flink.apache.org/introduction.html#features-why-flink
Exactly-once semantics
for stateful computations
stream processing and windowing
with event time semantics
flexible windowing
light-weight fault-tolerance high throughput and low latency

Contents
• Dataflow design and trigger customization
• Instrumentation with Prometheus
Source
JSON
parser SinkKafka Kafka Service DBUser
key-based
Bounded
OutOfOrderness
TimestampExtractor
(BOOTE)
messages
USER1 to ...
USER2 to ...
USER3 to ...
......
user ID +
destination
Session window
with a custom trigger
Define metrics Collect metrics Plot metrics

A 12-minute driving with 720 GPS coordinates
T map T map service server
...
...
...
...
T map generates
a GPS coordinate
every second

T map sends 4 messages to the service server
1st periodic message
(300 coordinates for the first 5 mins)
2nd periodic message
(300 coordinates for the next 5 mins)
End message
(120 coordinates for the last 2 mins)
...
...
...
T map T map service server
...
Init
message

Return scores right after receiving end messages
T map
driving score
7:20
T map service server
...
Init
a
7:08
Periodic
b
7:13
c
7:18
End
d
7:20
Messages
11min

Real-time driving score dataflow using
Source
JSON
key-based
Logical
dataflow
messages
USER1 to ...
USER2 to ...
USER3 to ...
......
user ID +
destination
Session window
Bounded
OutOfOrderness
TimestampExtractor
(BOOTE)
at-least-once Kafka producer
session gap : 1 hour

Real-time driving score dataflow using
Source
JSON
key-based
Logical
dataflow
Bounded
OutOfOrderness
TimestampExtractor
(BOOTE)
...
Source
Session window
p0
p1
p2
p19
20
partitions
20
tasks
256 tasks
...
...
...
p0
p1
p2
p19
20
partitions
Sink
...
several million
users
20
tasks
256
tasks
Service DB
...
User
Physical
dataflow
...
20
tasks
JSON
parser BOOTE
messages
USER1 to ...
USER2 to ...
USER3 to ...
......
user ID +
destination
messages
…
…
......
user ID +
destination
messages
…
…
......
user ID +
destination
messages
…
…
......
user ID +
destination
Session window

Session window (gap : 1 hour) with different triggers
The default
EventTimeTrigger
EarlyResultEventTimeTrigger
Time
Time

8:13 8:18 8:208:08
abcd
7:13
b
Periodic
7:08
a
Init
7:18
c
Periodic
The default
EventTimeTrigger
● 1 ● 1 ● 1
7:20
d
End
fire
Time
Time
Watermark

8:13 8:18 8:208:087:13
b
Periodic
7:08
a
Init
7:18
c
Periodic
7:20
d
End
8:13 8:18 8:208:087:13
b
Periodic
7:08
a
Init
7:18
c
Periodic
abcd
abcd ● 1 ● 1 ● 1
● 1 ● 1 ● 1
The default
EventTimeTrigger
7:20
d
End
early fire DO NOT fire
fire
(necessary in case of out-of-order messages)
Time
Time
Early timer

Slow for some reason
Out-or-order messages
...
Source
... ......
JSON
parser
p0
p1
p2
p19
...
p0
p1
p2
p19
Service
DB
... ...
a
Init
b
Periodic
cd
End
a
b
c
d
messages
……
……
......
user ID +
destination
messages
……
……
......
user ID +
destination
Session window w/
(session gap : 1 hour)
Sink
messages
user ID +
destination
……
……
Dongwon
to
SKT NOC
ab
Dongwon’s
iPhone
BOOTE
(maxOoO : 1 sec)
c d

How EarlyResultEventTimeTrigger deals with out-or-order messages
[Case 1] C arrives
before the early timer expires
c
[Case 2] C arrives
after the early timer expires
c
Time
Time

b
Periodic
a
Init
c
Periodic
abdc ● 1 ● 1 ● 1
d
End
early fire (perfect result) DO NOT fire
(no messages added after the last fire)
[Case 1] C arrives
c
[Case 2] C arrives
c
Time
Time

b
Periodic
a
Init
c
Periodic
● 1 ● 1 ● 1
d
End
early fire (perfect result) DO NOT fire
(no messages added after the last fire)
b
Periodic
a
Init
c
Periodic
abd ● 0 ● 1 ● 1
d
End
early fire (incomplete result)
[Case 1] C arrives
c
[Case 2] C arrives
c
2nd fire (perfect result)
abc d ● 1 ● 1 ● 1
Time
Time
abdc

[Constructor]
Get an evaluator to determine early firing
https://github.com/eastcirclek/flink-examples/blob/master/src/main/scala/com/github/eastcirclek/flink/trigger/EarlyResultEventTimeTrigger.scala
[onElement]
register an early timer
if the evaluator returns true
(e.g. when the end message comes in)
[onEventTime]
Fire if the early timer expires

Individual message statistics
N:1
Message stats.
extractor
Message stats.
sink
Source SinkKafka Kafka
JSON
parser BOOTE
key-based
messages
……
……
......
user ID +
destination
Source
20 tasks
... ...
20 tasks
JSON
parser
...
Message stats.
extractor
Message stats.
sink
20 tasks
1 task
Logical
dataflow
Physical
dataflow
Session window
Service DBUser

Individual message statistics
1K messages per second
100M messages per day
10s of MB per second
2 TB per day
N:1
Message stats.
extractor
Message stats.
sink
JSON
parser BOOTE
key-based
messages
……
……
......
user ID +
destination
Logical
dataflow
Session window
Service DBUser
meter histogram histogrammeter

Jitter (ingestion time – event time)
JSON
parser
Bounded
OutOfOrderness
TimestampExtractor
key-based
messages
……
……
......
user ID +
destination
Logical
dataflow
Session window
Service DB
event
time
ingestion
time
User
1 sec
Based on this observation,
we use 1 sec for maxOutOfOrderness

Session output statistics
N:1 N:1
Message stats.
extractor
Message stats.
sink
Session output stats.
extractor
sink
JSON
parser BOOTE
key-based
messages
……
……
......
user ID +
destination
Source
20 tasks
... ...
20 tasks
JSON
parser
...
Message stats.
extractor
Message stats.
sink
20 tasks
1 task
messages
……
……
......
user ID +
destination
256 tasks
extractor
sink
256 tasks
1 task...
Session window
...
messages
……
……
......
user ID +
destination
messages
……
……
......
user ID +
destination
● ● ● ● ● ●
● ● ● ● ● ●
● ● ● ● ● ●
Logical
dataflow
Physical
dataflow
Session window
Service DBUser

Session output statistics
N:1
extractor
sink
JSON
parser BOOTE
key-based
messages
……
……
......
user ID +
destination
Logical
dataflow
Session window
Service DBUser
N:1
Message stats.
extractor
Message stats.
sink
meter histogram histogrammeter

Our own definition of latency
ingestion time
of end messages
extractor
sink
Source
Sink
Kafka
Kafka
JSON
parser BOOTE
Session window
messages
user ID +
destination
……
……
Dongwon
to
SKT NOC
abcd
End
d
End
d
End
d
End
d
processing time of
the resultant session output
@extractor
● 1 ● 1 ● 1
● 1 ● 1 ● 1
Considering maxOutOfOrderness is 1 second,
Flink takes at most 250 milliseconds

N:1 N:1
Message stats.
extractor
Message stats.
sink
extractor
sink
JSON
parser BOOTE
key-based
messages
……
……
......
user ID +
destination
Service DBUser
How to expose metrics to Prometheus?
Session window

TaskManager #1 TaskManager #2JobManager
Push-model and pull-model
Ganglia
reporter
Ganglia
reporter
Ganglia
reporter
push push push

Ganglia
reporter
Graphite
reporter
Ganglia
reporter
Graphite
reporter
Ganglia
reporter
Graphite
reporter
push push push
pushed

Prometheus
reporter
(HTTP endpoint)
Ganglia
reporter
Graphite
reporter
Prometheus
reporter
(HTTP endpoint)
Ganglia
reporter
Graphite
reporter
Prometheus
reporter
(HTTP endpoint)
Ganglia
reporter
Graphite
reporter
pull
pushed pushed

Reporter configuration
Prometheus
reporter
Ganglia
reporter
Graphite
reporter
https://ci.apache.org/projects/flink/flink-docs-stable/monitoring/metrics.html#reporter
client
socket
client
socket
server
socket

Node
Manager
w1
Node
Manager
w2
Node
Manager
w3
Node
Manager
w4
Resource
Manager
Endpoint addresses cannot be determined in advance
#!/bin/bash
# launch a Flink per-job cluster on YARN
flink run
--jobmanager yarn-cluster
--yarncontainer 4
...
# flink-conf.yaml
...
metrics.reporter.prom.port: 5001-5100
...
Q. Can we list the endpoint addresses
before YARN’s scheduling?
A. No, impossible

Node
Manager
w1
Node
Manager
w2
Node
Manager
w3
Node
Manager
w4
Resource
Manager
#!/bin/bash
flink run
--yarncontainer 4
...
# flink-conf.yaml
...
...
TM
Prom. endpoint
w1:5002
TM
Prom. endpoint
w2:5001
JM
Prom. endpoint
w2:5001 TM
Prom. endpoint
w3:5001
TM
Prom. endpoint
w4:5001
Possible world #1

Node
Manager
w1
Node
Manager
w2
Node
Manager
w3
Node
Manager
w4
Resource
Manager
#!/bin/bash
flink run
--yarncontainer 4
...
# flink-conf.yaml
...
...
TM
Prom. endpoint
w1:5001
TM
Prom. endpoint
w2:5002
JM
Prom. endpoint
w2:5001 TM
Prom. endpoint
w3:5001
TM
Prom. endpoint
w4:5001
Possible world #2

Node
Manager
w1
Node
Manager
w2
Node
Manager
w3
Node
Manager
w4
Resource
Manager
#!/bin/bash
flink run
--yarncontainer 4
...
# flink-conf.yaml
...
...
TM
Prom. endpoint
w1:5001
TM
Prom. endpoint
w2:5001
JM
Prom. endpoint
w2:5001
TM
Prom. endpoint
w3:5002
TM
Prom. endpoint
w4:5001
Possible world #3

Node
Manager
w1
Node
Manager
w2
Node
Manager
w3
Node
Manager
w4
Resource
Manager
#!/bin/bash
flink run
--yarncontainer 4
...
# flink-conf.yaml
...
...
TM
Prom. endpoint
w1:5001
TM
Prom. endpoint
w2:5001
JM
Prom. endpoint
w2:5001TM
Prom. endpoint
w3:5001
TM
Prom. endpoint
w4:5002
Possible world #4

Node
Manager
w1
Node
Manager
w2
Node
Manager
w3
Node
Manager
w4
Resource
Manager
#!/bin/bash
flink run
--yarncontainer 4
...
# flink-conf.yaml
...
...
TM
Prom. endpoint
w1:5001
TM
Prom. endpoint
w2:5002
JM
Prom. endpoint
w2:5001 TM
Prom. endpoint
w3:5001
TM
Prom. endpoint
w4:5001
TM
Prom. endpoint
w1:5002
TM
Prom. endpoint
w2:5003
JM
Prom. endpoint
w3:5002
TM
Prom. endpoint
w3:5003
TM
Prom. endpoint
w4:5002

Where to scrape metrics form?
Endpoint addresses are available after a cluster is up
TM
Prom. endpoint
w1:5001
TM
Prom. endpoint
w2:5002
JobManager
Prom. endpoint : w2:5001
TM
Prom. endpoint
w3:5001
TM
Prom. endpoint
w4:5001
TM
Prom. endpoint
w1:5002
TM
Prom. endpoint
w2:5003
JobManager
TM
Prom. endpoint
w3:5003
TM
Prom. endpoint
w4:5002
A per-job cluster
(YARN ID : application_1500000000000_0001)
Another per-job cluster

File-based service discovery mechanism
TM
Prom. endpoint
w1:5001
TM
Prom. endpoint
w2:5002
JobManager
TM
Prom. endpoint
w3:5001
TM
Prom. endpoint
w4:5001
TM
Prom. endpoint
w1:5002
TM
Prom. endpoint
w2:5003
JobManager
TM
Prom. endpoint
w3:5003
TM
Prom. endpoint
w4:5002
A per-job cluster
/etc/prometheus/flink-service-discovery/
[
{
“targets”: [“w2:5001”, “w1:5001”, “w2:5002”, “w3:5001”, “w4:5001”],
}
]
application_1528160315197_0001.json
[
{
“targets”: [“w3:5002”, “w1:5002”, “w2:5003”, “w3:5003”, “w4:5002”],
}
]
🙄 watches file names matching a given pattern

File-based service discovery
scrape metrics
from known endpoints
TM
Prom. endpoint
w1:5001
TM
Prom. endpoint
w2:5002
JobManager
TM
Prom. endpoint
w3:5001
TM
Prom. endpoint
w4:5001
TM
Prom. endpoint
w1:5002
TM
Prom. endpoint
w2:5003
JobManager
TM
Prom. endpoint
w3:5003
TM
Prom. endpoint
w4:5002
A per-job cluster
w2:5001, w1:5001, w2:5002, w3:5001, w4:5001
w3:5002, w1:5002, w2:5003, w3:5003, w4:5002

flink-service-discovery
https://github.com/eastcirclek/flink-service-discovery
YARN
Resource
Manager
discovery.py
param1) rmAddr
param2) targetDir
TM
Prom. endpoint
w1:5001
TM
Prom. endpoint
w2:5002
JobManager
TM
Prom. endpoint
w3:5001
TM
Prom. endpoint
w4:5001
1) watch a new Flink cluster
2) get the address of JM
3) get all TM identifiers
4) identify all endpoints
by scrapping JM/TM logs
[
{
“targets”: [“w2:5001”, “w1:5001”,
“w2:5002”, “w3:5001”, “w4:5001”],
}
]
/etc/prometheus/flink-service-discovery/
🙄
5) create a file
6) scrape metrics from JM and TMs

Overview & summary
• Dataflow design and trigger customization
• Instrumentation with Prometheus
Source
JSON
key-based
Bounded
OutOfOrderness
TimestampExtractor
(BOOTE)
messages
USER1 to ...
USER2 to ...
USER3 to ...
......
user ID +
destination
Session window
Define metrics Collect metrics Plot metrics

Flink Forward Berlin 2018: Dongwon Kim - "Real-time driving score service using Flink"

Recommended

Recommended

More Related Content

Similar to Flink Forward Berlin 2018: Dongwon Kim - "Real-time driving score service using Flink"

Similar to Flink Forward Berlin 2018: Dongwon Kim - "Real-time driving score service using Flink" (20)

More from Flink Forward

More from Flink Forward (20)

Recently uploaded

Recently uploaded (20)

Flink Forward Berlin 2018: Dongwon Kim - "Real-time driving score service using Flink"