SlideShare a Scribd company logo
1 of 100
Download to read offline
Kai Wähner
Technology Evangelist
kontakt@kai-waehner.de
LinkedIn
@KaiWaehner
www.kai-waehner.de
Big Data Spain @ Madrid (November 2016)
Comparison of Streaming Analytics Frameworks
© Copyright 2000-2016 TIBCO Software Inc.
Key Take-Aways
• Streaming Analytics processes Data while it is in Motion!
• Automation and Proactive Human Interaction are BOTH needed!
• Streaming Analytics is Complementary to Hadoop and Machine Learning!
© Copyright 2000-2016 TIBCO Software Inc.
Agenda
• Real World Use Cases
• Introduction to Streaming Analytics
• Market Overview
• Relation to other Big Data Components
• Live Demo
© Copyright 2000-2016 TIBCO Software Inc.
Agenda
• Real World Use Cases
• Introduction to Streaming Analytics
• Market Overview
• Relation to other Big Data Components
• Live Demo
© Copyright 2000-2016 TIBCO Software Inc.
Analyze and Act on Critical Business Moments
© Copyright 2000-2016 TIBCO Software Inc.
Success Story
Predictive
Fault Management
© Copyright 2000-2013 TIBCO Software Inc.
“An outage on one well can cost $10M per
hour. We have 20-100 outages per year.“
- Drilling operations VP, major oil company
Data Monitoring
• Motor temperature
• Motor vibration
• Current
• Intake pressure
• Intake
temperature
Ø Flow
Electrical power cable
Pump
Intake
Protector
ESP motor
Pump monitoring unit
Electric Submersible
Pumps (ESP)
Predictive Analytics - Fault Management
Voltage
Temperature
Vibration
Device
history
Temporal analytic: “If vibration spike is followed by temp spike then
voltage spike [within 4 hours] then flag high severity alert.”
Predictive Analytics - Fault Management
© Copyright 2000-2016 TIBCO Software Inc.
Live Surveillance of Equipment
Continuous, live geospatial display of pump health
and predictive signal breeches
Alerts based on
predictive signals
Compare live readings and signals
to historical average and means
Continuous, live visualization of
stats per 100’s of wells
© Copyright 2000-2016 TIBCO Software Inc.
Success Story
Crowd Management
© Copyright 2000-2013 TIBCO Software Inc.
“Turn the customer into a fan and increase
revenue significantly.“
© Copyright 2000-2016 TIBCO Software Inc.
World’s Smartest Building
© Copyright 2000-2015 TIBCO Software Inc.
© Copyright 2000-2016 TIBCO Software Inc.
All Customers are different… Treat them that way…
14
Capture – Engage – Expand - Monetize
Patterns – Real time
MOREPERSONAL
MORE CONTEXT
social
CRM
POS
mobileweb
e-mails
© Copyright 2000-2016 TIBCO Software Inc.
Success Story
Smart Manufacturing
© Copyright 2000-2013 TIBCO Software Inc.
““For every 1% increase in shipped
product, we make $11MM in profit. The
demand is there, we just need to fulfill it.“
- Head of Quality, Solar Panel Manufacturer
Scenario: Predictive Scrapping of Parts in an Assembly Line
Goal: Scrap parts as early as possible automatically to reduce costs in a manufacturing process.
Question: When to scrap a part in Station 1 instead of doing re-work or sending it to Station 2?
Station 1 Station 2
Cost Before
9€
7€ 13€
Total Cost
29€
(or more)
Scrap? Scrap?
Machine Learning Applied to Sensor Events in Real Time
© Copyright 2000-2016 TIBCO Software Inc.
Example: Predictive Analytics for Manufacturing (“scrap parts as early as possible”)
© Copyright 2000-2016 TIBCO Software Inc.
Great success stories, but …
… how to realize these use cases?
© Copyright 2000-2016 TIBCO Software Inc.
Agenda
• Real World Use Cases
• Introduction to Streaming Analytics
• Market Overview
• Relation to other Big Data Components
• Live Demo
© Copyright 2000-2016 TIBCO Software Inc.
Traditional Data Processing: ”Request – Response”
Store
Analyze
Act
© Copyright 2000-2016 TIBCO Software Inc.
Traditional Data Processing: ”Request – Response”
• Data is collected from a variety of
sources, and placed in a persistent
store.
– Relational database.
– NoSQL store.
– Hadoop environment.
• Analytical processes are executed
against the stored data to detect
opportunities or threats.
• Actions are identified, delivered,
and executed across various
business channels.
Store
Analyze
Act
© Copyright 2000-2016 TIBCO Software Inc.
Traditional Data Processing: Challenges
Store
Analyze
Act
• Introduces too much “decision
latency” into the business.
• Responses are delivered “after-the-
fact”.
• Maximum value of the identified
situation is lost.
– Cross-sell / up-sell opportunities are
lost, impending equipment failure is
missed, business processes are slow
to respond and lack timely context.
• Decisions are made on old and stale
data.
© Copyright 2000-2016 TIBCO Software Inc.
Event Value Decreases Over TimeValue
Time
© Copyright 2000-2016 TIBCO Software Inc.
Event Value Decreases Over TimeValue
Time
• Events are often most
valuable “close to” the
point of collection.
• As time passes, events tend
to lose their value.
• The ability to proactively
identify “threats” or
“opportunities” will typically
decrease.
• Real-time capability is
needed to maximize event
value.
© Copyright 2000-2016 TIBCO Software Inc.
The New Era: Streaming Analytics
Act &
Monitor
Analyze
Store
© Copyright 2000-2016 TIBCO Software Inc.
The New Era: Streaming Analytics
• Events are analyzed and processed in
real-time as they arrive.
• Decisions are timely, contextual, and
based on fresh data.
• Decision latency is eliminated, resulting
in:
ü Superior Customer Experience
ü Operational Excellence
ü Instant Awareness and Timely Decisions
Act &
Monitor
Analyze
Store
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics: What Is A “Stream”?
Clickstream
Sensors
Social Data
Logs
• Consists of pieces of data
typically generated due to a
change of state.
• One or more identifiers
• Timestamp & payload
• Immutable
• Typically unbounded; there is no
end to the data.
• Batch dataset: “bounded”.
• Can be raw or derived.
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics Processing Pipeline
APIs
Adapters /
Channels
Integration
Messaging
Stream Ingest
Transformation
Aggregation
Enrichment
Filtering
Stream
Preprocessing
Process
Management
Analytics
(Real Time)
Applications
& APIs
Analytics /
DW Reporting
Stream
Outcomes
• Contextual Rules
• Windowing
• Patterns
• Deep ML
• Analytics
• …
Stream Analytics &
Processing
Index / SearchNormalization
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics Processing Pipeline
Separation of concerns
to easily adjust one part in response to
changing business requirements
without the need for rewriting other parts!
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics: Ingest
APIs
Adapters /
Channels
Integration
Messaging
Stream Ingest
• Stream data may come from a number sources,
either at the edge, in the data center, or via the
cloud.
• Need to handle a variety of data formats and protocols, all at global
scale.
• Pay attention to “event time” vs. “processing time”
!!
• Event Time: Time the event was created.
• Processing Time: Time the event was received or processed.
• Event time is typically more relevant, and will lead
to more predictable results.
• Eliminate time skew associated with clock synchronization, system
outages, processing latency, network issues, etc.
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics: Preprocessing
Transformation
Aggregation
Enrichment
Filtering
Stream
Preprocessing
Normalization • Stream data often needs to be manipulated before it is
processed by downstream components.
• Normalization
• Transformation
• May filter unwanted events close to the source to
eliminate “noise”.
• Events may also be enriched with additional context to
provide additional data for further processing.
• Customer details, equipment details, location information, etc.
• Data may be stored in a high-speed cache or other store for rapid
access.
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics: Processing
Batch
• Transform
• Deep ML
• Analytics
• Data Lake
• …
Stream Analytics &
Processing
Real-Time
• RT Analytics
• Contextual
Rules
• Windowing
• Patterns
• …
• Streams may be immediately pushed to a data lake.
• May be raw or preprocessed.
• Used for subsequent analysis as part of an immutable data layer.
• Typically processed in batch in this part of the architecture.
• In parallel, streams may be processed in real-time
against a number of constructs.
• Real-time analytics.
• Graph analysis / Geo Analysis
• Rules.
• Results from the real-time processing may be fed into
the batch component.
• The results of batch processing may also be pushed into the real-
time layer.
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics Processing Pipeline
APIs
Adapters /
Channels
Integration
Messaging
Stream Ingest
Transformation
Aggregation
Enrichment
Filtering
Stream
Preprocessing
Process
Management
Analytics
(Real Time)
Applications
& APIs
Analytics /
DW Reporting
Stream
Outcomes
• Contextual Rules
• Windowing
• Patterns
• Deep ML
• Analytics
• …
Stream Analytics &
Processing
Index / SearchNormalization
© Copyright 2000-2016 TIBCO Software Inc.
Dataflow Streaming Pipeline – Extract, Transform, Load in Real Time
https://www.linkedin.com/pulse/data-pipeline-hadoop-part-1-2-birender-saini
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics Processing Pipeline
APIs
Adapters /
Channels
Integration
Messaging
Stream Ingest
Transformation
Aggregation
Enrichment
Filtering
Stream
Preprocessing
Process
Management
Analytics
(Real Time)
Applications
& APIs
Analytics /
DW Reporting
Stream
Outcomes
• Contextual Rules
• Windowing
• Patterns
• Deep ML
• Analytics
• …
Stream Analytics &
Processing
Index / SearchNormalization
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics: “Windows”
https://www.oreilly.com/ideas/the-world-beyond-batch-streaming-101
© Copyright 2000-2016 TIBCO Software Inc.
Automation and Augmented Intelligence for Humans
Actions by Operations
Human	decisions	in	real	time	informed	
by	up	to	date	information
38
Automated	action	based	on	models	of	history	
combined	with	live	context	and	business	rules
Machine-to-Machine Automation
Big Data Reference Architecture
Augmented	Intelligence
Operations
SENSOR DATA
TRANSACTIONS
MESSAGE BUS
MACHINE DATA
SOCIAL DATA
Streaming	AnalyticsAction
Aggregate
Rules
Stream	Processing
Analytics
Correlate
Continuous	query	
processing
Alerts
Manual	action,	
escalation
Data	Discovery
Python
R
Data	
Scientists
Cleansed
Data
History
Visual	Analytics
Spark
Integration
ERP MDM DB WMS
SOA	/	Microservices
BIG	DATA
Data	Warehouse,	Hadoop
Internal	Data
Integration	Bus
API
Event	Server
H2O.ai
Live	UI
© Copyright 2000-2016 TIBCO Software Inc.
Agenda
• Real World Use Cases
• Introduction to Streaming Analytics
• Market Overview
• Relation to other Big Data Components
• Live Demo
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics Market Growing Significantly
“Everything Flows:
The value of stream processing
and streaming integration”
(September 2016)
http://hortonworks.com/info/value-streaming-integration/
© Copyright 2000-2016 TIBCO Software Inc.
Alternatives for Stream Processing
Time
to
Market
Streaming
Frameworks
Streaming
Products
Slow Fast
Streaming
Concepts
IncludesIncludes
© Copyright 2000-2016 TIBCO Software Inc.
Alternatives for Stream Processing
Concepts (Continuous Queries, Sliding Windows)
Patterns (Counting, Sequencing, Tracking, Trends)
Build everything by yourself! L
Time
to
Market
Streaming
Frameworks
Streaming
Products
Slow Fast
Streaming
Concepts
© Copyright 2000-2016 TIBCO Software Inc.
Usually not an option ...
… as there are a lot of
Frameworks and
Products available!
© Copyright 2000-2016 TIBCO Software Inc.
Alternatives for Stream Processing
Library (Java, .NET, Python)
Query Language (often similar to SQL)
Scalability (horizontal and vertical, fail over)
Connectivity (technologies, markets, products)
Operators (Filter, Sort, Aggregate)
Time
to
Market
Streaming
Frameworks
Streaming
Products
Slow Fast
Streaming
Concepts
Different frameworks
(ingest, preprocess, analytics)
combined!
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics Processing Pipeline
APIs
Adapters /
Channels
Integration
Messaging
Stream Ingest
Transformation
Aggregation
Enrichment
Filtering
Stream
Preprocessing
Process
Management
Analytics
(Real Time)
Applications
& APIs
Analytics /
DW Reporting
Stream
Outcomes
• Contextual Rules
• Windowing
• Patterns
• Deep ML
• Analytics
• …
Stream Analytics &
Processing
Index / SearchNormalization
© Copyright 2000-2016 TIBCO Software Inc.
Example for an Open Source Streaming Pipeline
http://hortonworks.com/hadoop-tutorial/realtime-event-processing-nifi-kafka-storm
“Realtime Event Processing in Hadoop with Apache NiFi, Kafka and Storm”
Dataflow Streaming Pipeline (Ingest, Preprocess)
Augmented	Intelligence
Operations
SENSOR DATA
TRANSACTIONS
MESSAGE BUS
MACHINE DATA
SOCIAL DATA
Streaming	AnalyticsAction
Aggregate
Rules
Stream	Processing
Analytics
Correlate
Continuous	query	
processing
Alerts
Manual	action,	
escalation
Data	Discovery
Python
R
Data	
Scientists
Cleansed
Data
History
Visual	Analytics
Spark
Integration
ERP MDM DB WMS
SOA	/	Microservices
BIG	DATA
Data	Warehouse,	Hadoop
Internal	Data
Integration	Bus
API
Event	Server
H2O.ai
Live	UI
© Copyright 2000-2016 TIBCO Software Inc.
Open Source Dataflow Streaming Pipelines
Streaming Analytics
Augmented	Intelligence
Operations
SENSOR DATA
TRANSACTIONS
MESSAGE BUS
MACHINE DATA
SOCIAL DATA
Streaming	AnalyticsAction
Aggregate
Rules
Stream	Processing
Analytics
Correlate
Continuous	query	
processing
Alerts
Manual	action,	
escalation
Data	Discovery
Python
R
Data	
Scientists
Cleansed
Data
History
Visual	Analytics
Spark
Integration
ERP MDM DB WMS
SOA	/	Microservices
BIG	DATA
Data	Warehouse,	Hadoop
Internal	Data
Integration	Bus
API
Event	Server
H2O.ai
Live	UI
© Copyright 2000-2016 TIBCO Software Inc.
Frameworks and Products (no complete list!)
OPEN SOURCE CLOSED SOURCE
PRODUCT
FRAMEWORK
Azure Microsoft
Stream Analytics
Google Cloud
Dataflow
© Copyright 2000-2016 TIBCO Software Inc.
Frameworks and Products (no complete list!)
OPEN SOURCE CLOSED SOURCE
PRODUCT
FRAMEWORK
Azure Microsoft
Stream Analytics
Google Cloud
Dataflow
© Copyright 2000-2016 TIBCO Software Inc.
Apache Storm
Spout Bolt
© Copyright 2000-2016 TIBCO Software Inc.
Apache Storm – Hello World
http://wpcertification.blogspot.ch/2014/02/helloworld-apache-storm-word-counter.html
© Copyright 2000-2016 TIBCO Software Inc.
AWS Kinesis – Integration with other AWS Components
https://aws.amazon.com/kinesis/
AWS S3 RedShift DynamoDB
© Copyright 2000-2016 TIBCO Software Inc.
AWS Kinesis – Hello World
© Copyright 2000-2016 TIBCO Software Inc.
AWS Kinesis – Public Cloud Trade-Off
… is easy to setup and scale.
But you do not have full control! L
• Any data that is older than 24 hours is automatically deleted
• Every Kinesis application consists of just one procedure, so you can’t use Kinesis
to perform complex stream processing unless you connect multiple applications
• Kinesis can only support a maximum size of 50KB for each data item
http://diamondstream.com/amazon-kinesis-big-real-time-data-processing-solution/
(blog post from 2014, might be outdated, but shows that you do not have full control over a cloud service)
© Copyright 2000-2016 TIBCO Software Inc.
Apache Spark
General Data-processing Framework
à However, focus is especially on Analytics (these days)
x
© Copyright 2000-2016 TIBCO Software Inc.
Apache Spark – Focus on Analytics
http://aptuz.com/blog/is-apache-spark-going-to-replace-hadoop/
http://fortune.com/2015/09/09/cloudera-spark-mapreduce/
http://www.ebaytechblog.com/2014/05/28/using-spark-to-ignite-data-analytics/
http://www.forbes.com/sites/paulmiller/2015/06/15/ibm-backs-apache-spark-for-big-data-analytics/
“[IBM’s initiatives] include:
• deepening the integration between Apache
Spark and existing IBM products like the
Watson Health Cloud;
• open sourcing IBM’s existing SystemML
machine learning technology;
© Copyright 2000-2016 TIBCO Software Inc.
Spark Streaming
Spark Streaming
• is no real streaming solution
• uses micro-batches
• cannot process data in real-time (i.e. no ultra-low latency)
• allows easy combination with other Spark components (SQL, Machine Learning, etc.)
© Copyright 2000-2016 TIBCO Software Inc.
Apache Spark – Hello World
Spark Streaming API
Spark Core API
© Copyright 2000-2016 TIBCO Software Inc.
Apache Spark – as a Cloud Service
© Copyright 2000-2016 TIBCO Software Inc.
Apache Flink
Spark Streaming
• „Newcomer“
• Looks very similar to Spark
• But „Streaming First“ concept
© Copyright 2000-2016 TIBCO Software Inc.
Apache Beam
Generic API with unified programming model for stream processing frameworks
http://www.slideshare.net/DataTorrent/apache-beam-incubating-67428372
© Copyright 2000-2016 TIBCO Software Inc.
Frameworks and Products (no complete list!)
OPEN SOURCE CLOSED SOURCE
PRODUCT
FRAMEWORK
Azure Microsoft
Stream Analytics
Google Cloud
Dataflow
Alternatives for Stream Processing
Library (Java, .NET, Python)
Query Language (often similar to SQL)
Scalability (horizontal and vertical, fail over)
Connectivity (technologies, markets, products)
Operators (Filter, Sort, Aggregate)
Time
to
Market
Streaming
Frameworks
Streaming
Products
Slow Fast
Streaming
Concepts
Single Tool (Complete Processing Pipeline)
Visual IDE (Dev, Test, Debug)
Simulation (Feed Testing, Test Generation)
Live UI (monitoring, proactive interaction)
Maturity (24/7 support, consulting)
Integration (out-of-the-box: ESB, MDM, Analytics, etc.)
© Copyright 2000-2016 TIBCO Software Inc.
Streaming Analytics Processing Pipeline
APIs
Adapters /
Channels
Integration
Messaging
Stream Ingest
Transformation
Aggregation
Enrichment
Filtering
Stream
Preprocessing
Process
Management
Analytics
(Real Time)
Applications
& APIs
Analytics /
DW Reporting
Stream
Outcomes
• Contextual Rules
• Windowing
• Patterns
• Deep ML
• Analytics
• …
Stream Analytics &
Processing
Index / SearchNormalization
Dataflow Streaming Pipeline + Streaming Analytics
Augmented	Intelligence
Operations
SENSOR DATA
TRANSACTIONS
MESSAGE BUS
MACHINE DATA
SOCIAL DATA
Streaming	AnalyticsAction
Aggregate
Rules
Stream	Processing
Analytics
Correlate
Continuous	query	
processing
Alerts
Manual	action,	
escalation
Data	Discovery
Python
R
Data	
Scientists
Cleansed
Data
History
Visual	Analytics
Spark
Integration
ERP MDM DB WMS
SOA	/	Microservices
BIG	DATA
Data	Warehouse,	Hadoop
Internal	Data
Integration	Bus
API
Event	Server
H2O.ai
Live	UI
© Copyright 2000-2016 TIBCO Software Inc.
IBM Streams
© Copyright 2000-2016 TIBCO Software Inc.
TIBCO StreamBase
• Performance: Latency, Throughput, Scalability
• Multi-threaded and clustered server from version 1
• High throughput: Millions of messages, 100,000s of quotes, 10,000s of orders
• Low-latency: microsecond latency for algo trading, pre-trade risk, market data
• Take Advantage of High Performance Hardware
• Multicore (12, 24, 32 core) large memory (10s of gigabytes)
• 64-bit Linux, Windows, Solaris deployment
• Hardware acceleration (GPU, Solace, Tervela)
• Enterprise Deployment
• High availability and fault tolerance
• Distributed state management for large data sets
• Management and monitoring tools
• Security and entitlements Integration
• Continuous deployment and QA Process Support
StreamSQL	compiler	
and	static	optimizer
In	process,	in	thread	
adapter	architecture
Visual	parallelism	
and	scaling
In-Memory	Data	Grid	
integration	for	
distributed	shared	state
Data	parallelism	
and	dispatch
StreamBase	Server	
Innovations
© Copyright 2000-2016 TIBCO Software Inc.
TIBCO StreamBase - Visual Programming
Aggregate
Capture	card	activations	per	
location
Sales	too	high
à Fraud
Log	to	any	
database
No	Fraud
Sales	too	high?
Visual Debugger
Feed Simulation
Unit Testing
StreamBase Development StudioTIBCO StreamBase - Visual Programming
Live UI for Augmented Intelligence
Augmented	Intelligence
Operations
SENSOR DATA
TRANSACTIONS
MESSAGE BUS
MACHINE DATA
SOCIAL DATA
Streaming	AnalyticsAction
Aggregate
Rules
Stream	Processing
Analytics
Correlate
Continuous	query	
processing
Alerts
Manual	action,	
escalation
Data	Discovery
Python
R
Data	
Scientists
Cleansed
Data
History
Visual	Analytics
Spark
Integration
ERP MDM DB WMS
SOA	/	Microservices
BIG	DATA
Data	Warehouse,	Hadoop
Internal	Data
Integration	Bus
API
Event	Server
H2O.ai
Live	UI
© Copyright 2000-2016 TIBCO Software Inc.
Live User Interface
Live UI
Continuous Query Processor Alerts
CEP
MQTT
JMS
In-Memory	Data	Grid
Integration
Social	Media	Data
Market	Data
Sensor	Data
Historical	
Data
In-Memory	Data	Grid
Enterprise	
dataMarket Data
IoT
Mobile
Social
Browser / App
Command & Control
ACTION
Continuous	Query
© Copyright 2000-2016 TIBCO Software Inc.
Live UI in Desktop / Web Browser / Mobile App
Dynamic aggregation
Live visualization
Ad-hoc continuous query
Alerts
Action
© Copyright 2000-2016 TIBCO Software Inc.
Live UI - Products
Characteristics to Check
• Alternative clients (rich client, browser,
mobile app)
• Maturity for enterprise use cases
• Performance and scalability
• “Big data native” deployment (YARN, Mesos)
• Monitoring and proactive actions
• Streaming engine under the hood (not just
visualization layer)
• New Ad-hoc queries by the business user
(without the help of IT department)
• Various visual components
• Extendibility (graphical designer vs. coding)
… or build your own solution using Websockets, Angular JS, etc.
© Copyright 2000-2016 TIBCO Software Inc.
Spoilt for Choice
Does it make sense to
combine frameworks
and products?
© Copyright 2000-2016 TIBCO Software Inc.
Customer Example: Apache Storm + TIBCO Live Datamart
External
Data
Snapshot
Results
Continuous Query Processor
Query
TIBCO Live Datamart
Continuous
Alerting
Active Tables Active Tables
Continuous
Updates
Clients
Message
Bus
Public
Data
Customer
Data
StreamBase
Bolt
StreamBase
Spout
Operational
Data
StreamBase Bolt and Spout connect
Apache Storm to StreamBase to provide
real-time analytics on operational data
© Copyright 2000-2016 TIBCO Software Inc.
Agenda
• Real World Use Cases
• Introduction to Streaming Analytics
• Market Overview
• Relation to other Big Data Components
• Live Demo
© Copyright 2000-2016 TIBCO Software Inc.
Closed Loop: Understand – Anticipate – Act
© Copyright 2000-2016 TIBCO Software Inc.
Closed Loop: Understand – Anticipate – Act
Insights Actions
MONITOR
PREDICT
ACT
DECIDE
MODEL
ORGANIZE
ANALYZE
WRANGLE
Data Discovery via Visual Analytics, Big Data and Machine Learning
Augmented	Intelligence
Operations
SENSOR DATA
TRANSACTIONS
MESSAGE BUS
MACHINE DATA
SOCIAL DATA
Streaming	AnalyticsAction
Aggregate
Rules
Stream	Processing
Analytics
Correlate
Continuous	query	
processing
Alerts
Manual	action,	
escalation
Data	Discovery
Python
R
Data	
Scientists
Cleansed
Data
History
Visual	Analytics
Spark
Integration
ERP MDM DB WMS
SOA	/	Microservices
BIG	DATA
Data	Warehouse,	Hadoop
Internal	Data
Integration	Bus
API
Event	Server
H2O.ai
Live	UI
Find Insights and Patterns in Historical Data
Visual Analytics + Machine Learning
Apply Insights and Analytic Models to Proactive Actions
Streaming
AnalyticsH20.ai
Open Source
R
TERR
Spark ML
MATLAB
SAS
PMML
© Copyright 2000-2013 TIBCO Software Inc.
80% of betting happens
AFTER the game begins
TODAY
Case Study: Streaming Analytics for Betting
• Situation: Today, 80% of Betting is Done After the
Game Starts
• It’s not your father’s bookie anymore!
• Problem: How to Analyze Big Betting Data?
• Thousands of concurrent games, constantly adjusting odds, dozens of
betting networks – firms must correlate millions of events a day to
find the best betting opportunities in real-time
• Solution: TIBCO for Fast Data Architecture
• TXOdds uses TIBCO to correlate, aggregate, and analyze large
volumes of streaming betting data in real-time and publish innovative
predictive betting analytics to their customers
• Result: TXOdds First to Market with Innovative Zero
Latency Betting Analytics
• Innovative real-time analytics help players who can process electronic
data in real-time the edge
“With StreamBase, in two
months we had our first
betting analytics feed live,
and we continually deploy
new ideas and evolve our
old ones.”
- Alex Kozlenkov, VP of technology,
TXOdds
© Copyright 2000-2016 TIBCO Software Inc.
Big Data Architecture for Streaming Betting Analytics
Event Processing
MONITOR
REAL-TIME ANALYTICS
AGGREGATE
HISTORICAL COMPARISON
Predictive
odds analytics
Zero Latency
Betting Analytics
GLOBAL, DISTRIBUTED INFRASTRUCTURE
Historical odds
deviations
B
U
S
BETTING LINES
SCORES
NEWS
HADOOP
Context:
Historical Betting
Data, Odds,
Outcomes
B
U
S
CACHE CACHE CACHE
Real-Time Analytics
CORRELATE
Live Datamart
SOCIAL
Real-Time Social Media Analytics
Twitter
(#TomBradyBrokenLeg)
Twitter (#Boston)
Brady’s
Stats
Actionable
Insights
Twitter (#NFL)
Something relevant happening?
Every second counts!
Change Odds (automated or manually triggered):
Stop live-betting for the current running game?
• Who will win the game?
• How many interceptions will the Quarterback throw?
• Will the Patriots win the Super Bowl?
• …
© Copyright 2000-2016 TIBCO Software Inc.
Real-Time Social Media Analytics
© Copyright 2000-2016 TIBCO Software Inc.
Agenda
• Real World Use Cases
• Introduction to Streaming Analytics
• Market Overview
• Relation to other Big Data Components
• Live Demo
Scenario: Predictive Scrapping of Parts in an Assembly Line
Goal: Scrap parts as early as possible automatically to reduce costs in a manufacturing process.
Question: When to scrap a part in Station 1 instead of doing re-work or sending it to Station 2?
Station 1 Station 2
Cost Before
9€
7€ 13€
Total Cost
29€
(or more)
Scrap? Scrap?
Big Data Architecture for Predictive Maintenance
Operational	Analytics
Operations
Live	UI
CSV Batch
JSON Real Time
XML Real Time
Streaming	AnalyticsAction
Aggregate
Rules
Analytics
Correlate
Live	Datamart
Continuous	query	
processing
Alerts
Manual	action,	
escalation
HISTORICAL	ANALYSIS Data	
Scientists
Flume
HDFS
Spotfire
R	/	TERR
HDFS
Hadoop (Cloudera)
StreamBase
TIBCO Fast Data Platform
H2O
Oracle	RDBMS
Avro Parquet … PMML
Internal	Data
Find Patterns à TIBCO Spotfire with H2O Integration
© Copyright 2000-2016 TIBCO Software Inc.
Example: Predictive Analytics for Manufacturing (“scrap parts as early as possible”)
© Copyright 2000-2016 TIBCO Software Inc.
Apply Patterns à TIBCO StreamBase Connector for H2O.ai
Monitor Patterns à TIBCO Live Datamart
Augmented Intelligence (“Monitor the manufacturing process and change rules in real time!”)
Live Dartmart Desktop Client
Monitor Patterns à TIBCO Live Datamart
Augmented Intelligence (“Monitor the manufacturing process and change rules in real time!”)
Live Dartmart Web API
TIBCO Spotfire + StreamBase + Live Datamart + H2O.ai
Live DemoLive Demo
© Copyright 2000-2016 TIBCO Software Inc.
Key Take-Aways
• Streaming Analytics processes Data while it is in Motion!
• Automation and Proactive Human Interaction are BOTH needed!
• Streaming Analytics is Complementary to Hadoop and Machine Learning!
Questions? Please contact me!
Kai Wähner
Technology Evangelist
kontakt@kai-waehner.de
@KaiWaehner
www.kai-waehner.de
LinkedIn

More Related Content

What's hot

Finding the needle in the haystack: how Nestle is leveraging big data to defe...
Finding the needle in the haystack: how Nestle is leveraging big data to defe...Finding the needle in the haystack: how Nestle is leveraging big data to defe...
Finding the needle in the haystack: how Nestle is leveraging big data to defe...Big Data Spain
 
Druid Overview by Rachel Pedreschi
Druid Overview by Rachel PedreschiDruid Overview by Rachel Pedreschi
Druid Overview by Rachel PedreschiBrian Olsen
 
Solving Performance Problems on Hadoop
Solving Performance Problems on HadoopSolving Performance Problems on Hadoop
Solving Performance Problems on HadoopTyler Mitchell
 
Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...
Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...
Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...Big Data Spain
 
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Databricks
 
VP of WW Partners by Alan Chhabra
VP of WW Partners by Alan ChhabraVP of WW Partners by Alan Chhabra
VP of WW Partners by Alan ChhabraBig Data Spain
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data ArchitectureGuido Schmutz
 
The Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationThe Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationDatabricks
 
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...Altan Khendup
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016StampedeCon
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
End to End Supply Chain Control Tower
End to End Supply Chain Control TowerEnd to End Supply Chain Control Tower
End to End Supply Chain Control TowerDatabricks
 
Inside open metadata—the deep dive
Inside open metadata—the deep diveInside open metadata—the deep dive
Inside open metadata—the deep diveDataWorks Summit
 
Securing and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industrySecuring and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industryDataWorks Summit
 
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...DataWorks Summit
 
The modern analytics architecture
The modern analytics architectureThe modern analytics architecture
The modern analytics architectureJoseph D'Antoni
 
Optimizing industrial operations using the big data ecosystem
Optimizing industrial operations using the big data ecosystemOptimizing industrial operations using the big data ecosystem
Optimizing industrial operations using the big data ecosystemDataWorks Summit
 
Building Custom Big Data Integrations
Building Custom Big Data IntegrationsBuilding Custom Big Data Integrations
Building Custom Big Data IntegrationsPat Patterson
 

What's hot (20)

Finding the needle in the haystack: how Nestle is leveraging big data to defe...
Finding the needle in the haystack: how Nestle is leveraging big data to defe...Finding the needle in the haystack: how Nestle is leveraging big data to defe...
Finding the needle in the haystack: how Nestle is leveraging big data to defe...
 
Druid Overview by Rachel Pedreschi
Druid Overview by Rachel PedreschiDruid Overview by Rachel Pedreschi
Druid Overview by Rachel Pedreschi
 
Solving Performance Problems on Hadoop
Solving Performance Problems on HadoopSolving Performance Problems on Hadoop
Solving Performance Problems on Hadoop
 
Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...
Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...
Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...
 
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
 
VP of WW Partners by Alan Chhabra
VP of WW Partners by Alan ChhabraVP of WW Partners by Alan Chhabra
VP of WW Partners by Alan Chhabra
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data Architecture
 
The Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationThe Hidden Value of Hadoop Migration
The Hidden Value of Hadoop Migration
 
Intuit Analytics Cloud 101
Intuit Analytics Cloud 101Intuit Analytics Cloud 101
Intuit Analytics Cloud 101
 
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
End to End Supply Chain Control Tower
End to End Supply Chain Control TowerEnd to End Supply Chain Control Tower
End to End Supply Chain Control Tower
 
Inside open metadata—the deep dive
Inside open metadata—the deep diveInside open metadata—the deep dive
Inside open metadata—the deep dive
 
Securing and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industrySecuring and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industry
 
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
 
The modern analytics architecture
The modern analytics architectureThe modern analytics architecture
The modern analytics architecture
 
Zero Downtime App Deployment using Hadoop
Zero Downtime App Deployment using HadoopZero Downtime App Deployment using Hadoop
Zero Downtime App Deployment using Hadoop
 
Optimizing industrial operations using the big data ecosystem
Optimizing industrial operations using the big data ecosystemOptimizing industrial operations using the big data ecosystem
Optimizing industrial operations using the big data ecosystem
 
Building Custom Big Data Integrations
Building Custom Big Data IntegrationsBuilding Custom Big Data Integrations
Building Custom Big Data Integrations
 

Viewers also liked

TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...
TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...
TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...Big Data Spain
 
Inferring the effect of an event using CausalImpact by Kay H. Brodersen
Inferring the effect of an event using CausalImpact by Kay H. BrodersenInferring the effect of an event using CausalImpact by Kay H. Brodersen
Inferring the effect of an event using CausalImpact by Kay H. BrodersenBig Data Spain
 
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...Big Data Spain
 
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
Apache Hive 2.0 SQL, Speed, Scale by Alan GatesApache Hive 2.0 SQL, Speed, Scale by Alan Gates
Apache Hive 2.0 SQL, Speed, Scale by Alan GatesBig Data Spain
 
Managing Data Science by David Martínez Rego
Managing Data Science by David Martínez RegoManaging Data Science by David Martínez Rego
Managing Data Science by David Martínez RegoBig Data Spain
 
Growing Data Scientists by Amparo Alonso Betanzos
Growing Data Scientists by Amparo Alonso BetanzosGrowing Data Scientists by Amparo Alonso Betanzos
Growing Data Scientists by Amparo Alonso BetanzosBig Data Spain
 
RUNNING A PETASCALE DATA SYSTEM: GOOD, BAD, AND UGLY CHOICES by Alexey Kharlamov
RUNNING A PETASCALE DATA SYSTEM: GOOD, BAD, AND UGLY CHOICES by Alexey KharlamovRUNNING A PETASCALE DATA SYSTEM: GOOD, BAD, AND UGLY CHOICES by Alexey Kharlamov
RUNNING A PETASCALE DATA SYSTEM: GOOD, BAD, AND UGLY CHOICES by Alexey KharlamovBig Data Spain
 
Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
 Turning an idea into a Data-Driven Production System: An Energy Load Forecas... Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
Turning an idea into a Data-Driven Production System: An Energy Load Forecas...Big Data Spain
 
Multiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier DominguezMultiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier DominguezBig Data Spain
 
Open data : from Insight to Visualisation with Google BigQuery and Carto.com ...
Open data : from Insight to Visualisation with Google BigQuery and Carto.com ...Open data : from Insight to Visualisation with Google BigQuery and Carto.com ...
Open data : from Insight to Visualisation with Google BigQuery and Carto.com ...Big Data Spain
 

Viewers also liked (10)

TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...
TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...
TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...
 
Inferring the effect of an event using CausalImpact by Kay H. Brodersen
Inferring the effect of an event using CausalImpact by Kay H. BrodersenInferring the effect of an event using CausalImpact by Kay H. Brodersen
Inferring the effect of an event using CausalImpact by Kay H. Brodersen
 
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...
 
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
Apache Hive 2.0 SQL, Speed, Scale by Alan GatesApache Hive 2.0 SQL, Speed, Scale by Alan Gates
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
 
Managing Data Science by David Martínez Rego
Managing Data Science by David Martínez RegoManaging Data Science by David Martínez Rego
Managing Data Science by David Martínez Rego
 
Growing Data Scientists by Amparo Alonso Betanzos
Growing Data Scientists by Amparo Alonso BetanzosGrowing Data Scientists by Amparo Alonso Betanzos
Growing Data Scientists by Amparo Alonso Betanzos
 
RUNNING A PETASCALE DATA SYSTEM: GOOD, BAD, AND UGLY CHOICES by Alexey Kharlamov
RUNNING A PETASCALE DATA SYSTEM: GOOD, BAD, AND UGLY CHOICES by Alexey KharlamovRUNNING A PETASCALE DATA SYSTEM: GOOD, BAD, AND UGLY CHOICES by Alexey Kharlamov
RUNNING A PETASCALE DATA SYSTEM: GOOD, BAD, AND UGLY CHOICES by Alexey Kharlamov
 
Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
 Turning an idea into a Data-Driven Production System: An Energy Load Forecas... Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
 
Multiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier DominguezMultiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier Dominguez
 
Open data : from Insight to Visualisation with Google BigQuery and Carto.com ...
Open data : from Insight to Visualisation with Google BigQuery and Carto.com ...Open data : from Insight to Visualisation with Google BigQuery and Carto.com ...
Open data : from Insight to Visualisation with Google BigQuery and Carto.com ...
 

Similar to Stream Processing as Game Changer for Big Data and Internet of Things by Kai Wahner

TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...
TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...
TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...Nelson Petracek
 
Streaming Analytics - Comparison of Open Source Frameworks and Products
Streaming Analytics - Comparison of Open Source Frameworks and ProductsStreaming Analytics - Comparison of Open Source Frameworks and Products
Streaming Analytics - Comparison of Open Source Frameworks and ProductsKai Wähner
 
Smart Manufacturing and Industry 4.0 - Tibco PoV
Smart Manufacturing and Industry 4.0 - Tibco PoVSmart Manufacturing and Industry 4.0 - Tibco PoV
Smart Manufacturing and Industry 4.0 - Tibco PoVNicola Sandoli
 
Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workfl...
Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workfl...Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workfl...
Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workfl...Matt Stubbs
 
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...Chief Analytics Officer Forum
 
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025Nicola Sandoli
 
When Streaming Becomes Strategic
When Streaming Becomes StrategicWhen Streaming Becomes Strategic
When Streaming Becomes StrategicMapR Technologies
 
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...Codemotion
 
NoSQL in Practice with TIBCO: Real World Use Cases and Customer Success Stori...
NoSQL in Practice with TIBCO: Real World Use Cases and Customer Success Stori...NoSQL in Practice with TIBCO: Real World Use Cases and Customer Success Stori...
NoSQL in Practice with TIBCO: Real World Use Cases and Customer Success Stori...Kai Wähner
 
Take Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven BusinessTake Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven BusinessInside Analysis
 
Real World Use Cases and Success Stories for In-Memory Data Grids (TIBCO Acti...
Real World Use Cases and Success Stories for In-Memory Data Grids (TIBCO Acti...Real World Use Cases and Success Stories for In-Memory Data Grids (TIBCO Acti...
Real World Use Cases and Success Stories for In-Memory Data Grids (TIBCO Acti...Kai Wähner
 
Digital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming EraDigital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming EraAttunity
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaKai Wähner
 
Findability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learningFindability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learningFindwise
 
Assessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesAssessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesDATAVERSITY
 
Kai Wähner – Real World Use Cases for Realtime In-Memory Computing - NoSQL ma...
Kai Wähner – Real World Use Cases for Realtime In-Memory Computing - NoSQL ma...Kai Wähner – Real World Use Cases for Realtime In-Memory Computing - NoSQL ma...
Kai Wähner – Real World Use Cases for Realtime In-Memory Computing - NoSQL ma...NoSQLmatters
 
Apply Machine Learning to Microservices
Apply Machine Learning to MicroservicesApply Machine Learning to Microservices
Apply Machine Learning to MicroservicesKai Wähner
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Inside Analysis
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...BigDataEverywhere
 
From an experiment to a real production environment
From an experiment to a real production environmentFrom an experiment to a real production environment
From an experiment to a real production environmentDataWorks Summit
 

Similar to Stream Processing as Game Changer for Big Data and Internet of Things by Kai Wahner (20)

TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...
TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...
TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...
 
Streaming Analytics - Comparison of Open Source Frameworks and Products
Streaming Analytics - Comparison of Open Source Frameworks and ProductsStreaming Analytics - Comparison of Open Source Frameworks and Products
Streaming Analytics - Comparison of Open Source Frameworks and Products
 
Smart Manufacturing and Industry 4.0 - Tibco PoV
Smart Manufacturing and Industry 4.0 - Tibco PoVSmart Manufacturing and Industry 4.0 - Tibco PoV
Smart Manufacturing and Industry 4.0 - Tibco PoV
 
Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workfl...
Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workfl...Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workfl...
Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workfl...
 
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
 
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
 
When Streaming Becomes Strategic
When Streaming Becomes StrategicWhen Streaming Becomes Strategic
When Streaming Becomes Strategic
 
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...
 
NoSQL in Practice with TIBCO: Real World Use Cases and Customer Success Stori...
NoSQL in Practice with TIBCO: Real World Use Cases and Customer Success Stori...NoSQL in Practice with TIBCO: Real World Use Cases and Customer Success Stori...
NoSQL in Practice with TIBCO: Real World Use Cases and Customer Success Stori...
 
Take Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven BusinessTake Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven Business
 
Real World Use Cases and Success Stories for In-Memory Data Grids (TIBCO Acti...
Real World Use Cases and Success Stories for In-Memory Data Grids (TIBCO Acti...Real World Use Cases and Success Stories for In-Memory Data Grids (TIBCO Acti...
Real World Use Cases and Success Stories for In-Memory Data Grids (TIBCO Acti...
 
Digital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming EraDigital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming Era
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
 
Findability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learningFindability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learning
 
Assessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesAssessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use Cases
 
Kai Wähner – Real World Use Cases for Realtime In-Memory Computing - NoSQL ma...
Kai Wähner – Real World Use Cases for Realtime In-Memory Computing - NoSQL ma...Kai Wähner – Real World Use Cases for Realtime In-Memory Computing - NoSQL ma...
Kai Wähner – Real World Use Cases for Realtime In-Memory Computing - NoSQL ma...
 
Apply Machine Learning to Microservices
Apply Machine Learning to MicroservicesApply Machine Learning to Microservices
Apply Machine Learning to Microservices
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
 
From an experiment to a real production environment
From an experiment to a real production environmentFrom an experiment to a real production environment
From an experiment to a real production environment
 

More from Big Data Spain

Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017
Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017
Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017Big Data Spain
 
Scaling a backend for a big data and blockchain environment by Rafael Ríos at...
Scaling a backend for a big data and blockchain environment by Rafael Ríos at...Scaling a backend for a big data and blockchain environment by Rafael Ríos at...
Scaling a backend for a big data and blockchain environment by Rafael Ríos at...Big Data Spain
 
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
AI: The next frontier by Amparo Alonso at Big Data Spain 2017AI: The next frontier by Amparo Alonso at Big Data Spain 2017
AI: The next frontier by Amparo Alonso at Big Data Spain 2017Big Data Spain
 
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017Big Data Spain
 
Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...
Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...
Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...Big Data Spain
 
Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...
Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...
Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...Big Data Spain
 
Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...
Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...
Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...Big Data Spain
 
Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...
Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...
Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...Big Data Spain
 
State of the art time-series analysis with deep learning by Javier Ordóñez at...
State of the art time-series analysis with deep learning by Javier Ordóñez at...State of the art time-series analysis with deep learning by Javier Ordóñez at...
State of the art time-series analysis with deep learning by Javier Ordóñez at...Big Data Spain
 
Trading at market speed with the latest Kafka features by Iñigo González at B...
Trading at market speed with the latest Kafka features by Iñigo González at B...Trading at market speed with the latest Kafka features by Iñigo González at B...
Trading at market speed with the latest Kafka features by Iñigo González at B...Big Data Spain
 
Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...
Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...
Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...Big Data Spain
 
The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
 The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a... The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...Big Data Spain
 
Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...
Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...
Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...Big Data Spain
 
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017Big Data Spain
 
Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...
Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...
Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...Big Data Spain
 
Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...
Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...
Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...Big Data Spain
 
End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...
End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...
End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...Big Data Spain
 
Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...
Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...
Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...Big Data Spain
 
More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...
More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...
More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...Big Data Spain
 
Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017
Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017
Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017Big Data Spain
 

More from Big Data Spain (20)

Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017
Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017
Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017
 
Scaling a backend for a big data and blockchain environment by Rafael Ríos at...
Scaling a backend for a big data and blockchain environment by Rafael Ríos at...Scaling a backend for a big data and blockchain environment by Rafael Ríos at...
Scaling a backend for a big data and blockchain environment by Rafael Ríos at...
 
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
AI: The next frontier by Amparo Alonso at Big Data Spain 2017AI: The next frontier by Amparo Alonso at Big Data Spain 2017
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
 
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
 
Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...
Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...
Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...
 
Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...
Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...
Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...
 
Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...
Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...
Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...
 
Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...
Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...
Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...
 
State of the art time-series analysis with deep learning by Javier Ordóñez at...
State of the art time-series analysis with deep learning by Javier Ordóñez at...State of the art time-series analysis with deep learning by Javier Ordóñez at...
State of the art time-series analysis with deep learning by Javier Ordóñez at...
 
Trading at market speed with the latest Kafka features by Iñigo González at B...
Trading at market speed with the latest Kafka features by Iñigo González at B...Trading at market speed with the latest Kafka features by Iñigo González at B...
Trading at market speed with the latest Kafka features by Iñigo González at B...
 
Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...
Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...
Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...
 
The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
 The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a... The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
 
Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...
Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...
Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...
 
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
 
Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...
Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...
Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...
 
Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...
Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...
Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...
 
End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...
End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...
End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...
 
Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...
Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...
Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...
 
More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...
More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...
More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...
 
Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017
Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017
Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017
 

Recently uploaded

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 

Recently uploaded (20)

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 

Stream Processing as Game Changer for Big Data and Internet of Things by Kai Wahner

  • 1.
  • 2. Kai Wähner Technology Evangelist kontakt@kai-waehner.de LinkedIn @KaiWaehner www.kai-waehner.de Big Data Spain @ Madrid (November 2016) Comparison of Streaming Analytics Frameworks
  • 3. © Copyright 2000-2016 TIBCO Software Inc. Key Take-Aways • Streaming Analytics processes Data while it is in Motion! • Automation and Proactive Human Interaction are BOTH needed! • Streaming Analytics is Complementary to Hadoop and Machine Learning!
  • 4. © Copyright 2000-2016 TIBCO Software Inc. Agenda • Real World Use Cases • Introduction to Streaming Analytics • Market Overview • Relation to other Big Data Components • Live Demo
  • 5. © Copyright 2000-2016 TIBCO Software Inc. Agenda • Real World Use Cases • Introduction to Streaming Analytics • Market Overview • Relation to other Big Data Components • Live Demo
  • 6. © Copyright 2000-2016 TIBCO Software Inc. Analyze and Act on Critical Business Moments
  • 7. © Copyright 2000-2016 TIBCO Software Inc. Success Story Predictive Fault Management
  • 8. © Copyright 2000-2013 TIBCO Software Inc. “An outage on one well can cost $10M per hour. We have 20-100 outages per year.“ - Drilling operations VP, major oil company
  • 9. Data Monitoring • Motor temperature • Motor vibration • Current • Intake pressure • Intake temperature Ø Flow Electrical power cable Pump Intake Protector ESP motor Pump monitoring unit Electric Submersible Pumps (ESP) Predictive Analytics - Fault Management
  • 10. Voltage Temperature Vibration Device history Temporal analytic: “If vibration spike is followed by temp spike then voltage spike [within 4 hours] then flag high severity alert.” Predictive Analytics - Fault Management
  • 11. © Copyright 2000-2016 TIBCO Software Inc. Live Surveillance of Equipment Continuous, live geospatial display of pump health and predictive signal breeches Alerts based on predictive signals Compare live readings and signals to historical average and means Continuous, live visualization of stats per 100’s of wells
  • 12. © Copyright 2000-2016 TIBCO Software Inc. Success Story Crowd Management
  • 13. © Copyright 2000-2013 TIBCO Software Inc. “Turn the customer into a fan and increase revenue significantly.“
  • 14. © Copyright 2000-2016 TIBCO Software Inc. World’s Smartest Building © Copyright 2000-2015 TIBCO Software Inc.
  • 15. © Copyright 2000-2016 TIBCO Software Inc. All Customers are different… Treat them that way… 14 Capture – Engage – Expand - Monetize Patterns – Real time MOREPERSONAL MORE CONTEXT social CRM POS mobileweb e-mails
  • 16. © Copyright 2000-2016 TIBCO Software Inc. Success Story Smart Manufacturing
  • 17. © Copyright 2000-2013 TIBCO Software Inc. ““For every 1% increase in shipped product, we make $11MM in profit. The demand is there, we just need to fulfill it.“ - Head of Quality, Solar Panel Manufacturer
  • 18. Scenario: Predictive Scrapping of Parts in an Assembly Line Goal: Scrap parts as early as possible automatically to reduce costs in a manufacturing process. Question: When to scrap a part in Station 1 instead of doing re-work or sending it to Station 2? Station 1 Station 2 Cost Before 9€ 7€ 13€ Total Cost 29€ (or more) Scrap? Scrap?
  • 19. Machine Learning Applied to Sensor Events in Real Time © Copyright 2000-2016 TIBCO Software Inc. Example: Predictive Analytics for Manufacturing (“scrap parts as early as possible”)
  • 20. © Copyright 2000-2016 TIBCO Software Inc. Great success stories, but … … how to realize these use cases?
  • 21. © Copyright 2000-2016 TIBCO Software Inc. Agenda • Real World Use Cases • Introduction to Streaming Analytics • Market Overview • Relation to other Big Data Components • Live Demo
  • 22. © Copyright 2000-2016 TIBCO Software Inc. Traditional Data Processing: ”Request – Response” Store Analyze Act
  • 23. © Copyright 2000-2016 TIBCO Software Inc. Traditional Data Processing: ”Request – Response” • Data is collected from a variety of sources, and placed in a persistent store. – Relational database. – NoSQL store. – Hadoop environment. • Analytical processes are executed against the stored data to detect opportunities or threats. • Actions are identified, delivered, and executed across various business channels. Store Analyze Act
  • 24. © Copyright 2000-2016 TIBCO Software Inc. Traditional Data Processing: Challenges Store Analyze Act • Introduces too much “decision latency” into the business. • Responses are delivered “after-the- fact”. • Maximum value of the identified situation is lost. – Cross-sell / up-sell opportunities are lost, impending equipment failure is missed, business processes are slow to respond and lack timely context. • Decisions are made on old and stale data.
  • 25. © Copyright 2000-2016 TIBCO Software Inc. Event Value Decreases Over TimeValue Time
  • 26. © Copyright 2000-2016 TIBCO Software Inc. Event Value Decreases Over TimeValue Time • Events are often most valuable “close to” the point of collection. • As time passes, events tend to lose their value. • The ability to proactively identify “threats” or “opportunities” will typically decrease. • Real-time capability is needed to maximize event value.
  • 27. © Copyright 2000-2016 TIBCO Software Inc. The New Era: Streaming Analytics Act & Monitor Analyze Store
  • 28. © Copyright 2000-2016 TIBCO Software Inc. The New Era: Streaming Analytics • Events are analyzed and processed in real-time as they arrive. • Decisions are timely, contextual, and based on fresh data. • Decision latency is eliminated, resulting in: ü Superior Customer Experience ü Operational Excellence ü Instant Awareness and Timely Decisions Act & Monitor Analyze Store
  • 29. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics: What Is A “Stream”? Clickstream Sensors Social Data Logs • Consists of pieces of data typically generated due to a change of state. • One or more identifiers • Timestamp & payload • Immutable • Typically unbounded; there is no end to the data. • Batch dataset: “bounded”. • Can be raw or derived.
  • 30. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics Processing Pipeline APIs Adapters / Channels Integration Messaging Stream Ingest Transformation Aggregation Enrichment Filtering Stream Preprocessing Process Management Analytics (Real Time) Applications & APIs Analytics / DW Reporting Stream Outcomes • Contextual Rules • Windowing • Patterns • Deep ML • Analytics • … Stream Analytics & Processing Index / SearchNormalization
  • 31. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics Processing Pipeline Separation of concerns to easily adjust one part in response to changing business requirements without the need for rewriting other parts!
  • 32. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics: Ingest APIs Adapters / Channels Integration Messaging Stream Ingest • Stream data may come from a number sources, either at the edge, in the data center, or via the cloud. • Need to handle a variety of data formats and protocols, all at global scale. • Pay attention to “event time” vs. “processing time” !! • Event Time: Time the event was created. • Processing Time: Time the event was received or processed. • Event time is typically more relevant, and will lead to more predictable results. • Eliminate time skew associated with clock synchronization, system outages, processing latency, network issues, etc.
  • 33. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics: Preprocessing Transformation Aggregation Enrichment Filtering Stream Preprocessing Normalization • Stream data often needs to be manipulated before it is processed by downstream components. • Normalization • Transformation • May filter unwanted events close to the source to eliminate “noise”. • Events may also be enriched with additional context to provide additional data for further processing. • Customer details, equipment details, location information, etc. • Data may be stored in a high-speed cache or other store for rapid access.
  • 34. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics: Processing Batch • Transform • Deep ML • Analytics • Data Lake • … Stream Analytics & Processing Real-Time • RT Analytics • Contextual Rules • Windowing • Patterns • … • Streams may be immediately pushed to a data lake. • May be raw or preprocessed. • Used for subsequent analysis as part of an immutable data layer. • Typically processed in batch in this part of the architecture. • In parallel, streams may be processed in real-time against a number of constructs. • Real-time analytics. • Graph analysis / Geo Analysis • Rules. • Results from the real-time processing may be fed into the batch component. • The results of batch processing may also be pushed into the real- time layer.
  • 35. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics Processing Pipeline APIs Adapters / Channels Integration Messaging Stream Ingest Transformation Aggregation Enrichment Filtering Stream Preprocessing Process Management Analytics (Real Time) Applications & APIs Analytics / DW Reporting Stream Outcomes • Contextual Rules • Windowing • Patterns • Deep ML • Analytics • … Stream Analytics & Processing Index / SearchNormalization
  • 36. © Copyright 2000-2016 TIBCO Software Inc. Dataflow Streaming Pipeline – Extract, Transform, Load in Real Time https://www.linkedin.com/pulse/data-pipeline-hadoop-part-1-2-birender-saini
  • 37. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics Processing Pipeline APIs Adapters / Channels Integration Messaging Stream Ingest Transformation Aggregation Enrichment Filtering Stream Preprocessing Process Management Analytics (Real Time) Applications & APIs Analytics / DW Reporting Stream Outcomes • Contextual Rules • Windowing • Patterns • Deep ML • Analytics • … Stream Analytics & Processing Index / SearchNormalization
  • 38. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics: “Windows” https://www.oreilly.com/ideas/the-world-beyond-batch-streaming-101
  • 39. © Copyright 2000-2016 TIBCO Software Inc. Automation and Augmented Intelligence for Humans Actions by Operations Human decisions in real time informed by up to date information 38 Automated action based on models of history combined with live context and business rules Machine-to-Machine Automation
  • 40. Big Data Reference Architecture Augmented Intelligence Operations SENSOR DATA TRANSACTIONS MESSAGE BUS MACHINE DATA SOCIAL DATA Streaming AnalyticsAction Aggregate Rules Stream Processing Analytics Correlate Continuous query processing Alerts Manual action, escalation Data Discovery Python R Data Scientists Cleansed Data History Visual Analytics Spark Integration ERP MDM DB WMS SOA / Microservices BIG DATA Data Warehouse, Hadoop Internal Data Integration Bus API Event Server H2O.ai Live UI
  • 41. © Copyright 2000-2016 TIBCO Software Inc. Agenda • Real World Use Cases • Introduction to Streaming Analytics • Market Overview • Relation to other Big Data Components • Live Demo
  • 42. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics Market Growing Significantly “Everything Flows: The value of stream processing and streaming integration” (September 2016) http://hortonworks.com/info/value-streaming-integration/
  • 43. © Copyright 2000-2016 TIBCO Software Inc. Alternatives for Stream Processing Time to Market Streaming Frameworks Streaming Products Slow Fast Streaming Concepts IncludesIncludes
  • 44. © Copyright 2000-2016 TIBCO Software Inc. Alternatives for Stream Processing Concepts (Continuous Queries, Sliding Windows) Patterns (Counting, Sequencing, Tracking, Trends) Build everything by yourself! L Time to Market Streaming Frameworks Streaming Products Slow Fast Streaming Concepts
  • 45. © Copyright 2000-2016 TIBCO Software Inc. Usually not an option ... … as there are a lot of Frameworks and Products available!
  • 46. © Copyright 2000-2016 TIBCO Software Inc. Alternatives for Stream Processing Library (Java, .NET, Python) Query Language (often similar to SQL) Scalability (horizontal and vertical, fail over) Connectivity (technologies, markets, products) Operators (Filter, Sort, Aggregate) Time to Market Streaming Frameworks Streaming Products Slow Fast Streaming Concepts Different frameworks (ingest, preprocess, analytics) combined!
  • 47. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics Processing Pipeline APIs Adapters / Channels Integration Messaging Stream Ingest Transformation Aggregation Enrichment Filtering Stream Preprocessing Process Management Analytics (Real Time) Applications & APIs Analytics / DW Reporting Stream Outcomes • Contextual Rules • Windowing • Patterns • Deep ML • Analytics • … Stream Analytics & Processing Index / SearchNormalization
  • 48. © Copyright 2000-2016 TIBCO Software Inc. Example for an Open Source Streaming Pipeline http://hortonworks.com/hadoop-tutorial/realtime-event-processing-nifi-kafka-storm “Realtime Event Processing in Hadoop with Apache NiFi, Kafka and Storm”
  • 49. Dataflow Streaming Pipeline (Ingest, Preprocess) Augmented Intelligence Operations SENSOR DATA TRANSACTIONS MESSAGE BUS MACHINE DATA SOCIAL DATA Streaming AnalyticsAction Aggregate Rules Stream Processing Analytics Correlate Continuous query processing Alerts Manual action, escalation Data Discovery Python R Data Scientists Cleansed Data History Visual Analytics Spark Integration ERP MDM DB WMS SOA / Microservices BIG DATA Data Warehouse, Hadoop Internal Data Integration Bus API Event Server H2O.ai Live UI
  • 50. © Copyright 2000-2016 TIBCO Software Inc. Open Source Dataflow Streaming Pipelines
  • 51. Streaming Analytics Augmented Intelligence Operations SENSOR DATA TRANSACTIONS MESSAGE BUS MACHINE DATA SOCIAL DATA Streaming AnalyticsAction Aggregate Rules Stream Processing Analytics Correlate Continuous query processing Alerts Manual action, escalation Data Discovery Python R Data Scientists Cleansed Data History Visual Analytics Spark Integration ERP MDM DB WMS SOA / Microservices BIG DATA Data Warehouse, Hadoop Internal Data Integration Bus API Event Server H2O.ai Live UI
  • 52. © Copyright 2000-2016 TIBCO Software Inc. Frameworks and Products (no complete list!) OPEN SOURCE CLOSED SOURCE PRODUCT FRAMEWORK Azure Microsoft Stream Analytics Google Cloud Dataflow
  • 53. © Copyright 2000-2016 TIBCO Software Inc. Frameworks and Products (no complete list!) OPEN SOURCE CLOSED SOURCE PRODUCT FRAMEWORK Azure Microsoft Stream Analytics Google Cloud Dataflow
  • 54. © Copyright 2000-2016 TIBCO Software Inc. Apache Storm Spout Bolt
  • 55. © Copyright 2000-2016 TIBCO Software Inc. Apache Storm – Hello World http://wpcertification.blogspot.ch/2014/02/helloworld-apache-storm-word-counter.html
  • 56. © Copyright 2000-2016 TIBCO Software Inc. AWS Kinesis – Integration with other AWS Components https://aws.amazon.com/kinesis/ AWS S3 RedShift DynamoDB
  • 57. © Copyright 2000-2016 TIBCO Software Inc. AWS Kinesis – Hello World
  • 58. © Copyright 2000-2016 TIBCO Software Inc. AWS Kinesis – Public Cloud Trade-Off … is easy to setup and scale. But you do not have full control! L • Any data that is older than 24 hours is automatically deleted • Every Kinesis application consists of just one procedure, so you can’t use Kinesis to perform complex stream processing unless you connect multiple applications • Kinesis can only support a maximum size of 50KB for each data item http://diamondstream.com/amazon-kinesis-big-real-time-data-processing-solution/ (blog post from 2014, might be outdated, but shows that you do not have full control over a cloud service)
  • 59. © Copyright 2000-2016 TIBCO Software Inc. Apache Spark General Data-processing Framework à However, focus is especially on Analytics (these days) x
  • 60. © Copyright 2000-2016 TIBCO Software Inc. Apache Spark – Focus on Analytics http://aptuz.com/blog/is-apache-spark-going-to-replace-hadoop/ http://fortune.com/2015/09/09/cloudera-spark-mapreduce/ http://www.ebaytechblog.com/2014/05/28/using-spark-to-ignite-data-analytics/ http://www.forbes.com/sites/paulmiller/2015/06/15/ibm-backs-apache-spark-for-big-data-analytics/ “[IBM’s initiatives] include: • deepening the integration between Apache Spark and existing IBM products like the Watson Health Cloud; • open sourcing IBM’s existing SystemML machine learning technology;
  • 61. © Copyright 2000-2016 TIBCO Software Inc. Spark Streaming Spark Streaming • is no real streaming solution • uses micro-batches • cannot process data in real-time (i.e. no ultra-low latency) • allows easy combination with other Spark components (SQL, Machine Learning, etc.)
  • 62. © Copyright 2000-2016 TIBCO Software Inc. Apache Spark – Hello World Spark Streaming API Spark Core API
  • 63. © Copyright 2000-2016 TIBCO Software Inc. Apache Spark – as a Cloud Service
  • 64. © Copyright 2000-2016 TIBCO Software Inc. Apache Flink Spark Streaming • „Newcomer“ • Looks very similar to Spark • But „Streaming First“ concept
  • 65. © Copyright 2000-2016 TIBCO Software Inc. Apache Beam Generic API with unified programming model for stream processing frameworks http://www.slideshare.net/DataTorrent/apache-beam-incubating-67428372
  • 66. © Copyright 2000-2016 TIBCO Software Inc. Frameworks and Products (no complete list!) OPEN SOURCE CLOSED SOURCE PRODUCT FRAMEWORK Azure Microsoft Stream Analytics Google Cloud Dataflow
  • 67. Alternatives for Stream Processing Library (Java, .NET, Python) Query Language (often similar to SQL) Scalability (horizontal and vertical, fail over) Connectivity (technologies, markets, products) Operators (Filter, Sort, Aggregate) Time to Market Streaming Frameworks Streaming Products Slow Fast Streaming Concepts Single Tool (Complete Processing Pipeline) Visual IDE (Dev, Test, Debug) Simulation (Feed Testing, Test Generation) Live UI (monitoring, proactive interaction) Maturity (24/7 support, consulting) Integration (out-of-the-box: ESB, MDM, Analytics, etc.)
  • 68. © Copyright 2000-2016 TIBCO Software Inc. Streaming Analytics Processing Pipeline APIs Adapters / Channels Integration Messaging Stream Ingest Transformation Aggregation Enrichment Filtering Stream Preprocessing Process Management Analytics (Real Time) Applications & APIs Analytics / DW Reporting Stream Outcomes • Contextual Rules • Windowing • Patterns • Deep ML • Analytics • … Stream Analytics & Processing Index / SearchNormalization
  • 69. Dataflow Streaming Pipeline + Streaming Analytics Augmented Intelligence Operations SENSOR DATA TRANSACTIONS MESSAGE BUS MACHINE DATA SOCIAL DATA Streaming AnalyticsAction Aggregate Rules Stream Processing Analytics Correlate Continuous query processing Alerts Manual action, escalation Data Discovery Python R Data Scientists Cleansed Data History Visual Analytics Spark Integration ERP MDM DB WMS SOA / Microservices BIG DATA Data Warehouse, Hadoop Internal Data Integration Bus API Event Server H2O.ai Live UI
  • 70. © Copyright 2000-2016 TIBCO Software Inc. IBM Streams
  • 71. © Copyright 2000-2016 TIBCO Software Inc. TIBCO StreamBase • Performance: Latency, Throughput, Scalability • Multi-threaded and clustered server from version 1 • High throughput: Millions of messages, 100,000s of quotes, 10,000s of orders • Low-latency: microsecond latency for algo trading, pre-trade risk, market data • Take Advantage of High Performance Hardware • Multicore (12, 24, 32 core) large memory (10s of gigabytes) • 64-bit Linux, Windows, Solaris deployment • Hardware acceleration (GPU, Solace, Tervela) • Enterprise Deployment • High availability and fault tolerance • Distributed state management for large data sets • Management and monitoring tools • Security and entitlements Integration • Continuous deployment and QA Process Support StreamSQL compiler and static optimizer In process, in thread adapter architecture Visual parallelism and scaling In-Memory Data Grid integration for distributed shared state Data parallelism and dispatch StreamBase Server Innovations
  • 72. © Copyright 2000-2016 TIBCO Software Inc. TIBCO StreamBase - Visual Programming Aggregate Capture card activations per location Sales too high à Fraud Log to any database No Fraud Sales too high?
  • 73. Visual Debugger Feed Simulation Unit Testing StreamBase Development StudioTIBCO StreamBase - Visual Programming
  • 74. Live UI for Augmented Intelligence Augmented Intelligence Operations SENSOR DATA TRANSACTIONS MESSAGE BUS MACHINE DATA SOCIAL DATA Streaming AnalyticsAction Aggregate Rules Stream Processing Analytics Correlate Continuous query processing Alerts Manual action, escalation Data Discovery Python R Data Scientists Cleansed Data History Visual Analytics Spark Integration ERP MDM DB WMS SOA / Microservices BIG DATA Data Warehouse, Hadoop Internal Data Integration Bus API Event Server H2O.ai Live UI
  • 75. © Copyright 2000-2016 TIBCO Software Inc. Live User Interface Live UI Continuous Query Processor Alerts CEP MQTT JMS In-Memory Data Grid Integration Social Media Data Market Data Sensor Data Historical Data In-Memory Data Grid Enterprise dataMarket Data IoT Mobile Social Browser / App Command & Control ACTION Continuous Query
  • 76. © Copyright 2000-2016 TIBCO Software Inc. Live UI in Desktop / Web Browser / Mobile App Dynamic aggregation Live visualization Ad-hoc continuous query Alerts Action
  • 77. © Copyright 2000-2016 TIBCO Software Inc. Live UI - Products Characteristics to Check • Alternative clients (rich client, browser, mobile app) • Maturity for enterprise use cases • Performance and scalability • “Big data native” deployment (YARN, Mesos) • Monitoring and proactive actions • Streaming engine under the hood (not just visualization layer) • New Ad-hoc queries by the business user (without the help of IT department) • Various visual components • Extendibility (graphical designer vs. coding) … or build your own solution using Websockets, Angular JS, etc.
  • 78. © Copyright 2000-2016 TIBCO Software Inc. Spoilt for Choice Does it make sense to combine frameworks and products?
  • 79. © Copyright 2000-2016 TIBCO Software Inc. Customer Example: Apache Storm + TIBCO Live Datamart External Data Snapshot Results Continuous Query Processor Query TIBCO Live Datamart Continuous Alerting Active Tables Active Tables Continuous Updates Clients Message Bus Public Data Customer Data StreamBase Bolt StreamBase Spout Operational Data StreamBase Bolt and Spout connect Apache Storm to StreamBase to provide real-time analytics on operational data
  • 80. © Copyright 2000-2016 TIBCO Software Inc. Agenda • Real World Use Cases • Introduction to Streaming Analytics • Market Overview • Relation to other Big Data Components • Live Demo
  • 81. © Copyright 2000-2016 TIBCO Software Inc. Closed Loop: Understand – Anticipate – Act
  • 82. © Copyright 2000-2016 TIBCO Software Inc. Closed Loop: Understand – Anticipate – Act Insights Actions MONITOR PREDICT ACT DECIDE MODEL ORGANIZE ANALYZE WRANGLE
  • 83. Data Discovery via Visual Analytics, Big Data and Machine Learning Augmented Intelligence Operations SENSOR DATA TRANSACTIONS MESSAGE BUS MACHINE DATA SOCIAL DATA Streaming AnalyticsAction Aggregate Rules Stream Processing Analytics Correlate Continuous query processing Alerts Manual action, escalation Data Discovery Python R Data Scientists Cleansed Data History Visual Analytics Spark Integration ERP MDM DB WMS SOA / Microservices BIG DATA Data Warehouse, Hadoop Internal Data Integration Bus API Event Server H2O.ai Live UI
  • 84. Find Insights and Patterns in Historical Data Visual Analytics + Machine Learning
  • 85. Apply Insights and Analytic Models to Proactive Actions Streaming AnalyticsH20.ai Open Source R TERR Spark ML MATLAB SAS PMML
  • 86. © Copyright 2000-2013 TIBCO Software Inc. 80% of betting happens AFTER the game begins TODAY
  • 87. Case Study: Streaming Analytics for Betting • Situation: Today, 80% of Betting is Done After the Game Starts • It’s not your father’s bookie anymore! • Problem: How to Analyze Big Betting Data? • Thousands of concurrent games, constantly adjusting odds, dozens of betting networks – firms must correlate millions of events a day to find the best betting opportunities in real-time • Solution: TIBCO for Fast Data Architecture • TXOdds uses TIBCO to correlate, aggregate, and analyze large volumes of streaming betting data in real-time and publish innovative predictive betting analytics to their customers • Result: TXOdds First to Market with Innovative Zero Latency Betting Analytics • Innovative real-time analytics help players who can process electronic data in real-time the edge “With StreamBase, in two months we had our first betting analytics feed live, and we continually deploy new ideas and evolve our old ones.” - Alex Kozlenkov, VP of technology, TXOdds
  • 88. © Copyright 2000-2016 TIBCO Software Inc. Big Data Architecture for Streaming Betting Analytics Event Processing MONITOR REAL-TIME ANALYTICS AGGREGATE HISTORICAL COMPARISON Predictive odds analytics Zero Latency Betting Analytics GLOBAL, DISTRIBUTED INFRASTRUCTURE Historical odds deviations B U S BETTING LINES SCORES NEWS HADOOP Context: Historical Betting Data, Odds, Outcomes B U S CACHE CACHE CACHE Real-Time Analytics CORRELATE Live Datamart SOCIAL
  • 89. Real-Time Social Media Analytics Twitter (#TomBradyBrokenLeg) Twitter (#Boston) Brady’s Stats Actionable Insights Twitter (#NFL) Something relevant happening? Every second counts! Change Odds (automated or manually triggered): Stop live-betting for the current running game? • Who will win the game? • How many interceptions will the Quarterback throw? • Will the Patriots win the Super Bowl? • …
  • 90. © Copyright 2000-2016 TIBCO Software Inc. Real-Time Social Media Analytics
  • 91. © Copyright 2000-2016 TIBCO Software Inc. Agenda • Real World Use Cases • Introduction to Streaming Analytics • Market Overview • Relation to other Big Data Components • Live Demo
  • 92. Scenario: Predictive Scrapping of Parts in an Assembly Line Goal: Scrap parts as early as possible automatically to reduce costs in a manufacturing process. Question: When to scrap a part in Station 1 instead of doing re-work or sending it to Station 2? Station 1 Station 2 Cost Before 9€ 7€ 13€ Total Cost 29€ (or more) Scrap? Scrap?
  • 93. Big Data Architecture for Predictive Maintenance Operational Analytics Operations Live UI CSV Batch JSON Real Time XML Real Time Streaming AnalyticsAction Aggregate Rules Analytics Correlate Live Datamart Continuous query processing Alerts Manual action, escalation HISTORICAL ANALYSIS Data Scientists Flume HDFS Spotfire R / TERR HDFS Hadoop (Cloudera) StreamBase TIBCO Fast Data Platform H2O Oracle RDBMS Avro Parquet … PMML Internal Data
  • 94. Find Patterns à TIBCO Spotfire with H2O Integration © Copyright 2000-2016 TIBCO Software Inc. Example: Predictive Analytics for Manufacturing (“scrap parts as early as possible”)
  • 95. © Copyright 2000-2016 TIBCO Software Inc. Apply Patterns à TIBCO StreamBase Connector for H2O.ai
  • 96. Monitor Patterns à TIBCO Live Datamart Augmented Intelligence (“Monitor the manufacturing process and change rules in real time!”) Live Dartmart Desktop Client
  • 97. Monitor Patterns à TIBCO Live Datamart Augmented Intelligence (“Monitor the manufacturing process and change rules in real time!”) Live Dartmart Web API
  • 98. TIBCO Spotfire + StreamBase + Live Datamart + H2O.ai Live DemoLive Demo
  • 99. © Copyright 2000-2016 TIBCO Software Inc. Key Take-Aways • Streaming Analytics processes Data while it is in Motion! • Automation and Proactive Human Interaction are BOTH needed! • Streaming Analytics is Complementary to Hadoop and Machine Learning!
  • 100. Questions? Please contact me! Kai Wähner Technology Evangelist kontakt@kai-waehner.de @KaiWaehner www.kai-waehner.de LinkedIn