Event Streaming in Academia With John Desjardins | Current 2022
A systematic review and meta-analysis of event streaming and real-time data applications using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement were performed from an original inquiry of 540 articles from 12 digital libraries. An independent review of selected studies assessed the risk of bias and obtained data from research and academic studies.
The talk will cover the systematic review workflow and obtained results from the academic literature. It will demonstrate best practices of event streaming and real-time applications in academia and research communities using Google Scholar for scholarly literature search. The findings will be made available publicly on Github and arXiv.org
18. Anti-Pattern for Real-time Stream Processing
Data Lake Pattern
✦ Complex Architecture
✦ Long Lag from Data Birth to Value Creation
✦ Often There Are Changes to Data as It Is Consolidated
✦ Doesn’t Support Zero Downtime
✦ Costly to Implement and Operate
Ingest
“Raw”
Refine &
Enrich Cleansed
Real-time
Analytics &
Machine
Learning
Data Lakes
Batch
Loads
Data Is
Born
Action
Taken,
Value
Created
19. Source
Source
Source
Data Processing
Live Events
Analytics
Live Events
Analytics
Live Events
Analytics
Live Events
Analytics
Sink or Client
App
Unified Architecture
Scale Ingest/Data/Compute/ML - Together
Streaming Ingest Queries, Logic &
Machine Learning
Streaming Ingest
Queries, Logic &
Machine Learning
Streaming Ingest
Queries, Logic &
Machine Learning
Streaming Ingest Queries, Logic &
Machine Learning
Event Stream Input
MLOps
Runtimes
Both Compute & Data Are Partitioned. Compute Is Partition Aware.
20. Hazelcast Advantages
✦ Partitioning of Both Compute & Data = Maximizing Parallel & Distributed Architecture
✦ Data-Aware Processing = Data Locality – Less Movement of Data
✦ Collaborative Worksharing = Simplifies Scalability
✦ In-Memory Optimized = Further Drive Down Latency
✦ Simple Peer to Peer Architecture, Cloud-Native
✦ Multi-Region DR, Live Job Upgrades – Do Not Require Restarts – ZERO DOWNTIME
Additional Unique Features:
✦ Easier to Operate across Dev & Ops
✦ Great Developer Experience
✦ Cooperative Multi-threading - Efficiently Uses Multi-Core Processors