SlideShare a Scribd company logo
1 of 11
Download to read offline
1 Jonas Traub, BIRTE @ VLDB, 20181 Jonas Traub, BIRTE @ VLDB, 2018
Are we solving the core problems
in stream processing?
Jonas Traub
Technische Universität Berlin / DFKI IAM
www.dima.tu-berlin.de | jonas.traub@tu-berlin.de
Panel Discussion with:
Manpreet Singh (Google)
Karthik Ramasamy (Stremlio)
C. Mohan (IBM)
Badrish Chandramouli (Microsoft)
Neng Lu (Twitter)
Alok Pareek (Striim)
Jonas Traub (TU-Berlin)
2 Jonas Traub, BIRTE @ VLDB, 20182 Jonas Traub, BIRTE @ VLDB, 2018
Are we solving the core problems
in stream processing?
Jonas Traub
Technische Universität Berlin / DFKI IAM
www.dima.tu-berlin.de | jonas.traub@tu-berlin.de
3 Jonas Traub, BIRTE @ VLDB, 20183 Jonas Traub, BIRTE @ VLDB, 2018
Are we solving the core problems
in stream processing?
Yes, we do!
4 Jonas Traub, BIRTE @ VLDB, 20184 Jonas Traub, BIRTE @ VLDB, 2018
Are we solving the core problems
in stream processing?
Yes, we do!
Apache Flink and
its success story
What are the core problems
and how are we solving them?
Examples
5 Jonas Traub, BIRTE @ VLDB, 2018
5
5 Jonas Traub, BIRTE @ VLDB, 2018
Apache Flink Timeline
6 Jonas Traub, BIRTE @ VLDB, 2018
6
6 Jonas Traub, BIRTE @ VLDB, 2018
7 Jonas Traub, BIRTE @ VLDB, 2018
Apache Flink - Stateful Computations over Data Streams
source: flink.apache.org
• Event-driven Applications
• Stream & Batch Analytics
• Data Pipelines & ETL
• Exactly-once state consistency
• Event-time processing
• Sophisticated late data handling
• Scale-out architecture
• Support for very large state
• Incremental checkpointing
8 Jonas Traub, BIRTE @ VLDB, 20188 Jonas Traub, BIRTE @ VLDB, 2018
Examples:
What are core problems and
how are we solving them?
9 Jonas Traub, BIRTE @ VLDB, 20189 Jonas Traub, BIRTE @ VLDB, 2018
Examples:
Expressiveness: Event-time processing and sophisticated late data handling
The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-
Scale, Unbounded, Out-of-Order Data Processing (Akidau et al.)
10 Jonas Traub, BIRTE @ VLDB, 201810 Jonas Traub, BIRTE @ VLDB, 2018
Examples:
Expressiveness: Event-time processing and sophisticated late data handling
Service: Common APIs and feature sets
The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-
Scale, Unbounded, Out-of-Order Data Processing (Akidau et al.)
Apache Beam: An advanced unified programming model
“Implement batch and streaming data processing jobs that run on any
execution engine. (beam.apache.org)”
11 Jonas Traub, BIRTE @ VLDB, 201811 Jonas Traub, BIRTE @ VLDB, 2018
Examples:
Consistency: Exactly-once state consistency
Expressiveness: Event-time processing and sophisticated late data handling
Service: Common APIs and feature sets
Lightweight asynchronous snapshots for distributed dataflows
P Carbone, G Fóra, S Ewen, S Haridi, K Tzoumas
State management in Apache Flink: consistent stateful distributed stream processing
P Carbone, S Ewen, G Fóra, S Haridi, S Richter, K Tzoumas
The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-
Scale, Unbounded, Out-of-Order Data Processing (Akidau et al.)
Apache Beam: An advanced unified programming model
“Implement batch and streaming data processing jobs that run on any
execution engine. (beam.apache.org)”

More Related Content

More from Jonas Traub

Flink Forward 2018: Efficient Window Aggregation with Stream Slicing
Flink Forward 2018: Efficient Window Aggregation with Stream SlicingFlink Forward 2018: Efficient Window Aggregation with Stream Slicing
Flink Forward 2018: Efficient Window Aggregation with Stream Slicing
Jonas Traub
 
Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing
Scotty: Efficient Window Aggregation for Out-of-Order Stream ProcessingScotty: Efficient Window Aggregation for Out-of-Order Stream Processing
Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing
Jonas Traub
 
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
Jonas Traub
 

More from Jonas Traub (10)

Flink Forward 2018: Efficient Window Aggregation with Stream Slicing
Flink Forward 2018: Efficient Window Aggregation with Stream SlicingFlink Forward 2018: Efficient Window Aggregation with Stream Slicing
Flink Forward 2018: Efficient Window Aggregation with Stream Slicing
 
Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing
Scotty: Efficient Window Aggregation for Out-of-Order Stream ProcessingScotty: Efficient Window Aggregation for Out-of-Order Stream Processing
Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing
 
Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive W...
Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive W...Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive W...
Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive W...
 
Efficient SIMD Vectorization for Hashing in OpenCL
Efficient SIMD Vectorization for Hashing in OpenCLEfficient SIMD Vectorization for Hashing in OpenCL
Efficient SIMD Vectorization for Hashing in OpenCL
 
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
 
JT@UCSB - On-Demand Data Streaming from Sensor Nodes and A quick overview of ...
JT@UCSB - On-Demand Data Streaming from Sensor Nodes and A quick overview of ...JT@UCSB - On-Demand Data Streaming from Sensor Nodes and A quick overview of ...
JT@UCSB - On-Demand Data Streaming from Sensor Nodes and A quick overview of ...
 
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
 
I²: Interactive Real-Time Visualization for Streaming Data
I²: Interactive Real-Time Visualization for Streaming DataI²: Interactive Real-Time Visualization for Streaming Data
I²: Interactive Real-Time Visualization for Streaming Data
 
LWA 2015: The Apache Flink Platform (Poster)
LWA 2015: The Apache Flink Platform (Poster)LWA 2015: The Apache Flink Platform (Poster)
LWA 2015: The Apache Flink Platform (Poster)
 
LWA 2015: The Apache Flink Platform for Parallel Batch and Stream Analysis
LWA 2015: The Apache Flink Platform for Parallel Batch and Stream AnalysisLWA 2015: The Apache Flink Platform for Parallel Batch and Stream Analysis
LWA 2015: The Apache Flink Platform for Parallel Batch and Stream Analysis
 

Recently uploaded

Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...
Sérgio Sacani
 
Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...
Sérgio Sacani
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
University of Hertfordshire
 
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdf
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdfPests of Green Manures_Bionomics_IPM_Dr.UPR.pdf
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdf
PirithiRaju
 
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Sérgio Sacani
 

Recently uploaded (20)

Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...
 
Microbial bio Synthesis of nanoparticles.pptx
Microbial bio Synthesis of nanoparticles.pptxMicrobial bio Synthesis of nanoparticles.pptx
Microbial bio Synthesis of nanoparticles.pptx
 
Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...
Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...
Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...
 
NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.
 
Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
 
INSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere UniversityINSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere University
 
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdf
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdfPests of Green Manures_Bionomics_IPM_Dr.UPR.pdf
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdf
 
GBSN - Microbiology (Unit 7) Microbiology in Everyday Life
GBSN - Microbiology (Unit 7) Microbiology in Everyday LifeGBSN - Microbiology (Unit 7) Microbiology in Everyday Life
GBSN - Microbiology (Unit 7) Microbiology in Everyday Life
 
Erythropoiesis- Dr.E. Muralinath-C Kalyan
Erythropoiesis- Dr.E. Muralinath-C KalyanErythropoiesis- Dr.E. Muralinath-C Kalyan
Erythropoiesis- Dr.E. Muralinath-C Kalyan
 
Hemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. MuralinathHemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. Muralinath
 
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
 
In-pond Race way systems for Aquaculture (IPRS).pptx
In-pond Race way systems for Aquaculture (IPRS).pptxIn-pond Race way systems for Aquaculture (IPRS).pptx
In-pond Race way systems for Aquaculture (IPRS).pptx
 
Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
 
MODERN PHYSICS_REPORTING_QUANTA_.....pdf
MODERN PHYSICS_REPORTING_QUANTA_.....pdfMODERN PHYSICS_REPORTING_QUANTA_.....pdf
MODERN PHYSICS_REPORTING_QUANTA_.....pdf
 
A Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on EarthA Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on Earth
 
Ostiguy & Panizza & Moffitt (eds.) - Populism in Global Perspective. A Perfor...
Ostiguy & Panizza & Moffitt (eds.) - Populism in Global Perspective. A Perfor...Ostiguy & Panizza & Moffitt (eds.) - Populism in Global Perspective. A Perfor...
Ostiguy & Panizza & Moffitt (eds.) - Populism in Global Perspective. A Perfor...
 
SCHISTOSOMA HEAMATOBIUM life cycle .pdf
SCHISTOSOMA HEAMATOBIUM life cycle  .pdfSCHISTOSOMA HEAMATOBIUM life cycle  .pdf
SCHISTOSOMA HEAMATOBIUM life cycle .pdf
 
The Scientific names of some important families of Industrial plants .pdf
The Scientific names of some important families of Industrial plants .pdfThe Scientific names of some important families of Industrial plants .pdf
The Scientific names of some important families of Industrial plants .pdf
 
Lec 1.b Totipotency and birth of tissue culture.ppt
Lec 1.b Totipotency and birth of tissue culture.pptLec 1.b Totipotency and birth of tissue culture.ppt
Lec 1.b Totipotency and birth of tissue culture.ppt
 

BIRTE Panel at VLDB: Are we solving the core Problems in stream processing?

  • 1. 1 Jonas Traub, BIRTE @ VLDB, 20181 Jonas Traub, BIRTE @ VLDB, 2018 Are we solving the core problems in stream processing? Jonas Traub Technische Universität Berlin / DFKI IAM www.dima.tu-berlin.de | jonas.traub@tu-berlin.de Panel Discussion with: Manpreet Singh (Google) Karthik Ramasamy (Stremlio) C. Mohan (IBM) Badrish Chandramouli (Microsoft) Neng Lu (Twitter) Alok Pareek (Striim) Jonas Traub (TU-Berlin)
  • 2. 2 Jonas Traub, BIRTE @ VLDB, 20182 Jonas Traub, BIRTE @ VLDB, 2018 Are we solving the core problems in stream processing? Jonas Traub Technische Universität Berlin / DFKI IAM www.dima.tu-berlin.de | jonas.traub@tu-berlin.de
  • 3. 3 Jonas Traub, BIRTE @ VLDB, 20183 Jonas Traub, BIRTE @ VLDB, 2018 Are we solving the core problems in stream processing? Yes, we do!
  • 4. 4 Jonas Traub, BIRTE @ VLDB, 20184 Jonas Traub, BIRTE @ VLDB, 2018 Are we solving the core problems in stream processing? Yes, we do! Apache Flink and its success story What are the core problems and how are we solving them? Examples
  • 5. 5 Jonas Traub, BIRTE @ VLDB, 2018 5 5 Jonas Traub, BIRTE @ VLDB, 2018 Apache Flink Timeline
  • 6. 6 Jonas Traub, BIRTE @ VLDB, 2018 6 6 Jonas Traub, BIRTE @ VLDB, 2018
  • 7. 7 Jonas Traub, BIRTE @ VLDB, 2018 Apache Flink - Stateful Computations over Data Streams source: flink.apache.org • Event-driven Applications • Stream & Batch Analytics • Data Pipelines & ETL • Exactly-once state consistency • Event-time processing • Sophisticated late data handling • Scale-out architecture • Support for very large state • Incremental checkpointing
  • 8. 8 Jonas Traub, BIRTE @ VLDB, 20188 Jonas Traub, BIRTE @ VLDB, 2018 Examples: What are core problems and how are we solving them?
  • 9. 9 Jonas Traub, BIRTE @ VLDB, 20189 Jonas Traub, BIRTE @ VLDB, 2018 Examples: Expressiveness: Event-time processing and sophisticated late data handling The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive- Scale, Unbounded, Out-of-Order Data Processing (Akidau et al.)
  • 10. 10 Jonas Traub, BIRTE @ VLDB, 201810 Jonas Traub, BIRTE @ VLDB, 2018 Examples: Expressiveness: Event-time processing and sophisticated late data handling Service: Common APIs and feature sets The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive- Scale, Unbounded, Out-of-Order Data Processing (Akidau et al.) Apache Beam: An advanced unified programming model “Implement batch and streaming data processing jobs that run on any execution engine. (beam.apache.org)”
  • 11. 11 Jonas Traub, BIRTE @ VLDB, 201811 Jonas Traub, BIRTE @ VLDB, 2018 Examples: Consistency: Exactly-once state consistency Expressiveness: Event-time processing and sophisticated late data handling Service: Common APIs and feature sets Lightweight asynchronous snapshots for distributed dataflows P Carbone, G Fóra, S Ewen, S Haridi, K Tzoumas State management in Apache Flink: consistent stateful distributed stream processing P Carbone, S Ewen, G Fóra, S Haridi, S Richter, K Tzoumas The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive- Scale, Unbounded, Out-of-Order Data Processing (Akidau et al.) Apache Beam: An advanced unified programming model “Implement batch and streaming data processing jobs that run on any execution engine. (beam.apache.org)”