This slide set was presented at UCSB on Sep. 30, 2017.
The talk covers an extended version of the slides from SoCC 2017 plus a quick overview of Apache Flink.
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
JT@UCSB - On-Demand Data Streaming from Sensor Nodes and A quick overview of Apache Flink
1. On-Demand Data Streaming
from Sensor Nodes
(ACM SoCC 2017)
and
A quick overview of Apache Flink
Presentation at Sep. 30, 2017
University of California, Santa Barbara
2. About me
• Researcher and PhD candidate at
– Technische Universität Berlin (DIMA)
– German Research Center for Artificial Intelligence (DFKI) / (IAM)
• Working with Volker Markl
• Before
– Master’s degree in Computer Science (KTH Stockholm and TU Belin)
– Bachelor’s degree in Applied Computer Science (DHBW Stuttgart)
– Four years at IBM in Germany and the USA
Jonas Traub
jon@s-traub.com
Jonas.traub@tu-berlin.de
3. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Optimized On-Demand Data
Streaming from Sensor Nodes
Jonas Traub, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, Volker Markl
Extended Talk for . .
Santa Clara, California,
September 25-27, 2017
4. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud
Real-time
insights
4
5. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud
Real-time
insights
Billions of sensor nodes form a sensor cloud
and provide data streams to analysis systems.
5
6. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud
Real-time
insights
Billions of sensor nodes form a sensor cloud
and provide data streams to analysis systems.
6
7. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud
Real-time
insights
Billions of sensor nodes form a sensor cloud
and provide data streams to analysis systems.
7
8. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud
Real-time
insights
Billions of sensor nodes form a sensor cloud
and provide data streams to analysis systems.
8
9. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud – Problems
Real-time
insights
9
Billions of sensor nodes form a sensor cloud
and provide data streams to analysis systems.
10. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud – Problems
Real-time
insights
Streaming all data from billions
of sensors to all applications
with maximal frequencies is impossible
10
Billions of sensor nodes form a sensor cloud
and provide data streams to analysis systems.
11. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud – Problems
Real-time
insights
Streaming all data from billions
of sensors to all applications
with maximal frequencies is impossible
Increasing data rates
require expensive
system scale-out.
11
Billions of sensor nodes form a sensor cloud
and provide data streams to analysis systems.
12. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud – Solutions
12
Tailor Data Streams to the Demand of Applications
13. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud – Solutions
13
Tailor Data Streams to the Demand of Applications
• Provide an abstraction to define the data demand of applications.
User-Defined Sampling Functions (UDSFs)
14. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud – Solutions
14
Tailor Data Streams to the Demand of Applications
• Provide an abstraction to define the data demand of applications.
• Optimize communication costs while maintaining the result accuracy.
User-Defined Sampling Functions (UDSFs)
Read-Time Optimization
15. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
The Sensor Cloud – Solutions
15
Tailor Data Streams to the Demand of Applications
• Provide an abstraction to define the data demand of applications.
• Optimize communication costs while maintaining the result accuracy.
• Share sensor reads and data transfer among users and queries.
User-Defined Sampling Functions (UDSFs)
Read-Time Optimization
Multi-Query / Multi-User Optimization
16. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example
16
17. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example
17
18. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example
18
Different Data Data Demands:
• Query 1 adaptively increases sampling rates when accelerating or braking.
19. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example
19
Different Data Data Demands:
• Query 1 adaptively increases sampling rates when accelerating or braking.
20. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example
20
Different Data Data Demands:
• Query 1 adaptively increases sampling rates when accelerating or braking.
• Query 2 requires a sample at least every 20 meters
21. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example
21
Different Data Data Demands:
• Query 1 adaptively increases sampling rates when accelerating or braking.
• Query 2 requires a sample at least every 20 meters
22. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example
22
Different Data Data Demands:
• Query 1 adaptively increases sampling rates when accelerating or braking.
• Query 2 requires a sample at least every 20 meters
• Query 3 requires a sample at least every 0.3s.
23. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example
23
Different Data Data Demands:
• Query 1 adaptively increases sampling rates when accelerating or braking.
• Query 2 requires a sample at least every 20 meters
• Query 3 requires a sample at least every 0.3s.
24. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example
24
Different Data Data Demands:
• Query 1 adaptively increases sampling rates when accelerating or braking.
• Query 2 requires a sample at least every 20 meters
• Query 3 requires a sample at least every 0.3s.
25. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example - Evaluation
25
26. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example - Evaluation
26
-57%
27. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example - Evaluation
27
-57%
-72%
28. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A Motivating Example - Evaluation
28
1/3 because 3 values per tuple
-57%
-72%
29. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Architecture Overview
29
30. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Architecture Overview
30
31. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Architecture Overview
31
32. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Architecture Overview
32
33. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Architecture Overview
33
34. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Sensor Read Scheduling
34
35. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
User-Defined Sampling Functions
35
Input:
Sensor read time and value
Output:
Next Sensor Read Request
36. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
User-Defined Sampling Functions
36
Input:
Sensor read time and value
37. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
User-Defined Sampling Functions
37
Enable adaptive sampling techniques to reduce data transmission
e.g., Adam [Trihinas ‘15], FAST [Fan ‘14], L-SIP [Gaura ’13]
38. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
User-Defined Sampling Functions - Examples
38
39. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
User-Defined Sampling Functions - Examples
39
40. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
User-Defined Sampling Functions - Examples
40
41. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Sensor Read Fusion
41
42. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Sensor Read Fusion
42
1) Minimize Sensor Reads and Data Transfer:
Latest possible read time
43. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Read Time Optimization
43
2) Optimize Sensor Read Times:
● Minimize penalty while executing the minimum number of sensor reads only
● Challenge: assign read requests to sensor reads
44. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Assigning Read Requests to Sensor Reads
44
PostponeAssign to next Read
45. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Assigning Read Requests to Sensor Reads
45
PostponeAssign to next Read
46. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Assigning Read Requests to Sensor Reads
46
PostponeAssign to next Read
47. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Assigning Read Requests to Sensor Reads
47
PostponeAssign to next Read
48. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Local Filtering
48
49. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Local Filtering
49
● Enable adaptive filtering in combination with adaptive sampling
● Enable model-driven data acquisition
50. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Local Filtering
50
51. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Evaluation
●Replay sensor data
- from a football match [DEBS Grand Challenge ’13]
- formula 1 telementry data
●Random UDSFs:
- Read in a poisson process (also simulate load peaks)
- In average 1 read per query per second
- Exponentially distributed read time tolerance
- high probability for small tolerances
- small probability for large tolerances
- In average 0.04s read time tolerance
51
52. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17 52
Increasing the number of concurrent queries
• On-Demand scheduling reduces sensor reads and data transfer by up to 87%.
• The # of reads and transfers increases sub-linearly with the # of queries.
53. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17 53
Increasing the number of concurrent queries
• Our read-time optimizer reduces the deviation from desired read times
by up to 69% (preserving the min. # of reads and transfers).
54. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17 54
Increasing read time tolerances
55. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17 55
Increasing read time tolerances
56. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17 56
Query Prioritization (1/2)
57. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17 57
Query Prioritization (2/2)
58. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17 58
Slack Robustness of Adaptive Sampling
59. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
Optimized On-Demand Data
Streaming from Sensor Nodes
Wrap-Up:
Tailor Data Streams to the Demand of Applications
• Define data demand: User-Defined Sampling Functions
• Schedule sensor reads and data transfer on-demand
• Optimize read times globally - for all users and queries
Jonas Traub, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, Volker Markl
60. Traub et al., Optimized On-Demand Data Streaming from Sensor Nodes, SoCC ‘17
A quick overview of Apache Flink
- research summary -
Jonas Traub visiting September 30, 2017
61. Outline
Apache Flink Primer
• Stratosphere – The origin of Apache Flink
• What is Apache Flink? – Basic System Internals
• The Flink Community
An Apache Flink Research Summary
61
84. On-Demand Data Streaming
from Sensor Nodes
(ACM SoCC 2017)
and
A quick overview of Apache Flink
Presentation at Sep. 30, 2017
University of California, Santa Barbara