SlideShare a Scribd company logo
The Internet of Analytics
Discovering actionable insights from
high-velocity streams of real-time IoT data
Sami Akbay, Founder and EVP, WebAction
In-Memory Computing Summit | San Francisco, CA | June 2015
Cheap and Efficient
Data Capture
Affordable sensors,
RFID, antennas,
aggregators,
cameras
Smaller footprint
Low energy
consumption
Continuous
Connectivity from
Everywhere
Wired networks /
wireless networks
Reliable, high
bandwidth
connectivity
Ubiquitous access
virtually from
anywhere
Abundant Compute
Power and Storage
Faster chips
Cheaper Memory
Hadoop / big data
stores
• ERP, CRM, Billing …
• Human generated/captured
• Stored in Databases
• Data inherently valuable
• Data already useful for operations
before analytics
Traditional
• Capture
only events
of interest
Internet of
Things
• Capture
Everything
• Sensor, log, location, …
• Machine generated/captured
• Stored in big data frameworks
• Most of the data has little inherent
value
• Value of data unknown until after
it is analyzed
The Internet of Analytics
Abundant
Compute
Power and
Storage
Continuous
Connectivity
from
Everywhere
Cheap and
efficient
Data
Capture
• Expensive to store in traditional data stores
• Much of it is not useful
IoT generates large volumes of data
• Requires adequate connectivity
• Uses significant network resources
Data comes in spikes or high-velocity continuous
streams
• Data transformation is required
• Data models need to facilitate analytics
Data arrives in a variety of different formats
• Requires platforms that can perform sophisticated Stream analytics
Data contains perishable insights
Source: Internet Of Things Applications Hunger for Hadoop
and Real-Time Analytics in the Cloud
by Mike Gualtieri and Rowan Curran , Forrester
Acquire Store Process
Acquire Process in Memory Deliver
BI /
Analytics
RDBMS EDW
Structured
Data
Machine
Data
LocationClick
Stream
Structured
Data
Machine
Data
LocationClick
Stream
IoT Analytics
Applications
Batch Reactive
R E ALT I M E B AR R I E R
ProactiveRealtime
Visualizations Store
Alerts Integrate
Reduce the Latency to Capture, Analyze, and
Ultimately Take Action to Increase Value
Events
Decision
latency
BusinessValue
Time to Action
Action taken
Data analyzed
Data captured
Based on concept developed by Richard Hackathorn, Bolder Technology
• Oil Rig Drill Sensor:
– Temperature up 10°C  continue drilling
– Temperature up 10°C + Viscosity down  stop drilling
• Hospital Bed:
– Blood O2 below 93%  Patient went to the bathroom
– Blood O2 below 93% and Pulse @ 150BPM  send a doctor
Perfect Storm: An event where
a rare combination of
circumstances aggravate a
situation drastically
You need realtime correlation of multiple data streams
to handle the perfect storm
Actionable insights come from combining
current events with context
Context Realtime Action+ =Event
Historical Context
+
Reference Data
Real-time Event Stream
e.g. Real-time Sensor Events
e.g. Shopper profile,
Store ID, Inventory,
Profitability
e.g. Present Next-Best-Action,
update current price, modify sourcing
Because actionable insights come from
combining current IoT events with context
In the last 30 minutes
a store has sold
$8,000
This store typically sells
$3,000 on Tuesdays in
June
Alert the store manager to
require ID at checkout
In the last hour, 2 visits
by shopper X in Store
Zone 3 for 16 minutes
Zone 3 has mobile
phones. Shopper X due
for device refresh.
Offer promotion package
for new device with 2 year
contract renewal
A mobile subscriber
drops 3 calls in 2
hours
A subscriber will drop 8
calls in a week before
becoming a churn risk
When a 611 call is made,
alert the agent NOT TO
offer a service discount
Context Realtime Action+ =Event
• Support Data in-Motion and Data at-Rest
– Process events and groups of events (data windows) as Streams
– Correlate multiple Streams in Realtime before disk storage
– Leverage analyzed context from historic data sources
– Store aggregate data, analyzed data, and raw payload on various storage
frameworks
• Implement an Easy-to-Use Development environment
– Allow users to quickly discover and analyze data
– Convert analysis patterns into IoT Analytics Applications
– Provide an easy-to-use development / deployment interface
• Address industrial and operational needs
– Offer linear scalability
– Run on commodity infrastructure / virtualized environments
– Provide redundancy, failover, recovery
RDBMS
JDBC/SQL
Oracle CDC
MS SQL CDC
NonStop
GoldenGate
Network
TCP/UDP
HTTP
SNMP/NetFlow
Files
CSV/TSV
JSON
XML
Apache
Free-form
BigData
HDFS
Log Flows
Flume
Collectd
Windows Events
Message Queues
JMS
Kafka
Sources Applications
DB Persistence
JDBC/SQL
NoSQL
Vertica
File Persistence
CSV/TSV
JSON
XML
Automated
Workflows
BigData
HDFS
Alerting
Email
SMS
External
Context
Distributed Results Cache
Sources
Streams
Windows
∞Queries
Caches
Targets
Distributed Continuous
Query Processor
Real-time
Dashboards
Delivery
Business-Level Logic
With Tungsten QL (extended SQL)
Message Queues
JaMS
Kafka
WebAction Solution
Structured and
unstructured data
Distributed,
in-memory, as data
is created
Correlated, enriched,
and filtered real-time
big data records
Deliver
Process
Assimilate
 Data from transactional sources is acquired
via redo or transaction logs
 Structured and non-Structured data
 No Production Impact
 No Application changes
Device Data
Industry Data
Social Feeds
Real-Time
Transaction Data
System/ IT Data
Common File
Format
TYPE EXAMPLE COMPLEXITY
CSV, JSON, XML
Facebook, Twitter
Syslogs, weblogs, event logs
SmartMeter, Medical Device, RFID, Netflow,
iBeacon, CDR
SWIFT, HL7, FIX, ASN
Oracle, DB2, SQLServer, MySQL, HP NonStop
SIMPLE
VERY HIGH
SIMPLE TO MEDIUM
MEDIUM
MEDIUM
HIGH
Structured and
unstructured data
Assimilate
Distributed,
in-memory, as data
is created
Process
 Enrich live Big Data with historical
data sources
 Process Big Data faster using
partitioned streams, caches, and
additional nodes
 Execute SQL-like queries of in-memory
Big Data
 Alert in real-time based on predictive
analytic model results
Structured and
unstructured data
Assimilate
Structured and
unstructured data
Distributed,
in-memory, as data
is created
Correlated, enriched,
and filtered real-time
big data records
Deliver
Process
Assimilate
 Continuous Big Data Records
 Realtime Drag & Drop Dashboards
 Predictive Alerts
 Business Trends
 Data Patterns
 Outliers
Device Data
Big Data
Infrastructure
Industry Data
Social Feeds
Transaction Data
Enterprise Apps
& Workflows
Enterprise Data
Warehouse
RDBMS
Stream Analytics Applications
System/ IT Data
HighSpeedDataAcquisition
Command Line Visual Designer
CREATE APPLICATION MultiLogApp;
CREATE FLOW MonitorLogs;
CREATE SOURCE AccessLogSource
USING…
CREATE TYPE AccessLogEntry …
CREATE STREAM AccessStream OF…
CREATE CQ ParseAccessLog …
W >
Results
Persistence
Context
Cache
Distributed
Results
Cache
Distributed Query
Processor
External
Targets &
Alerts
Event
Windows
Node: n
2
1
Drag & Drop
Stream
Dashboards
Component Definition
Source Access external data and provides realtime continuous events into streams
Stream Carries data between components and nodes
Window Provides moving snapshot/collection of events for aggregates and models
Cache External contextual data made available using distributed in-memory grid
CQ
(Continuous Query)
A Continuous Query emits big data records after processing realtime
streaming events (can process data from streams, windows, caches, event
tables, and stores)
WAction Store
(big data records)
Resulting big data records from processing (aggregates, correlates, anomalies,
predictions) - can be in-memory only or persisted to Elasticsearch / database
Target Outputs realtime big data records to external systems
Application A combination of the above components performing business logic
Dashboard A drag and drop realtime view into stores, caches, and streams
• Create Applications
• Add & Navigate through Flows
• Design Data Model
• Drag & Drop Components
• Configure Components
• Deploy Applications
• Start / Stop Applications
• View Alerts
• View Event Flow Rate
• Add a new application
• Add a source (CDC, structured, semi-structured, etc.)
• Configure Source
• Add a typed stream
• Add a CQ to transform data types
• Add a Cache and CQ for context enrichment
• Design dashboard
• Create multiple pages / drilldowns
• Define data through queries
• Drag and drop visualizations: Values / Icons / Gauges – Maps – Line Charts –
Scatter / Bubble Plots – Pie / Donut Charts – Bar Charts – Tables – Heat
Maps
• Add a new Dashboard
• Add a Visualization
• Configure Query and Visualization
• View Data Visualized
• Filter data in a page
• Drilldown to related and detail pages
• Realtime log / database CDC reading in addition to push sources like
TCP/JMS
• Bytecode generation for data types and query processing
• Scaling across multiple nodes with flexible deployment
• Auto failover of application components from one node to another
• Nodes can be added and removed while applications are running
• Recovery ensures no events are missed or processed twice
• Recovery takes window contents into account
• Role based security at the application through component level
• Integrated realtime dashboard visualizations using server push
WebAction Headquarters
info@webaction.com
+1 (650) 241-0680
LinkedIn.com/company/WebAction-Inc- facebook.com/WebActionSoftware Twitter.com/WebActionInc

More Related Content

What's hot

How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
Databricks
 
ParStream - Big Data for Business Users
ParStream - Big Data for Business UsersParStream - Big Data for Business Users
ParStream - Big Data for Business Users
ParStream Inc.
 
Extreme Analytics @ eBay
Extreme Analytics @ eBayExtreme Analytics @ eBay
Extreme Analytics @ eBay
DataWorks Summit/Hadoop Summit
 
Billions of Rows, Millions of Insights, Right Now
Billions of Rows, Millions of Insights, Right NowBillions of Rows, Millions of Insights, Right Now
Billions of Rows, Millions of Insights, Right Now
Rob Winters
 
4th Industrial Revolution
4th Industrial Revolution4th Industrial Revolution
4th Industrial Revolution
Rolando Rangel
 
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
Lucas Jellema
 
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE) Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Guido Schmutz
 
Necessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services SectorNecessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services Sector
DataWorks Summit
 
Importance of Big Data Analytics
Importance of Big Data AnalyticsImportance of Big Data Analytics
Importance of Big Data Analytics
Impetus Technologies
 
Mastering MapReduce: MapReduce for Big Data Management and Analysis
Mastering MapReduce: MapReduce for Big Data Management and AnalysisMastering MapReduce: MapReduce for Big Data Management and Analysis
Mastering MapReduce: MapReduce for Big Data Management and Analysis
Teradata Aster
 
Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and Analytics
VMware Tanzu
 
Choosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChoosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your Business
Chicago Hadoop Users Group
 
HP Discover: Real Time Insights from Big Data
HP Discover: Real Time Insights from Big DataHP Discover: Real Time Insights from Big Data
HP Discover: Real Time Insights from Big Data
Rob Winters
 
Why Finance Should Consider Agile Modern Data Delivery Platform
Why Finance Should Consider Agile Modern Data Delivery PlatformWhy Finance Should Consider Agile Modern Data Delivery Platform
Why Finance Should Consider Agile Modern Data Delivery Platform
syed_javed
 
Key Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShareKey Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShare
MapR Technologies
 
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
SoftServe
 
Analytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
CCG
 
Overview of Business Intelligence
Overview of Business IntelligenceOverview of Business Intelligence
Overview of Business Intelligence
Parthiv Dixit
 
Transforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform StrategyTransforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform Strategy
Databricks
 
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Impetus Technologies
 

What's hot (20)

How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
 
ParStream - Big Data for Business Users
ParStream - Big Data for Business UsersParStream - Big Data for Business Users
ParStream - Big Data for Business Users
 
Extreme Analytics @ eBay
Extreme Analytics @ eBayExtreme Analytics @ eBay
Extreme Analytics @ eBay
 
Billions of Rows, Millions of Insights, Right Now
Billions of Rows, Millions of Insights, Right NowBillions of Rows, Millions of Insights, Right Now
Billions of Rows, Millions of Insights, Right Now
 
4th Industrial Revolution
4th Industrial Revolution4th Industrial Revolution
4th Industrial Revolution
 
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
 
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE) Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
 
Necessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services SectorNecessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services Sector
 
Importance of Big Data Analytics
Importance of Big Data AnalyticsImportance of Big Data Analytics
Importance of Big Data Analytics
 
Mastering MapReduce: MapReduce for Big Data Management and Analysis
Mastering MapReduce: MapReduce for Big Data Management and AnalysisMastering MapReduce: MapReduce for Big Data Management and Analysis
Mastering MapReduce: MapReduce for Big Data Management and Analysis
 
Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and Analytics
 
Choosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChoosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your Business
 
HP Discover: Real Time Insights from Big Data
HP Discover: Real Time Insights from Big DataHP Discover: Real Time Insights from Big Data
HP Discover: Real Time Insights from Big Data
 
Why Finance Should Consider Agile Modern Data Delivery Platform
Why Finance Should Consider Agile Modern Data Delivery PlatformWhy Finance Should Consider Agile Modern Data Delivery Platform
Why Finance Should Consider Agile Modern Data Delivery Platform
 
Key Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShareKey Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShare
 
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
 
Analytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
 
Overview of Business Intelligence
Overview of Business IntelligenceOverview of Business Intelligence
Overview of Business Intelligence
 
Transforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform StrategyTransforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform Strategy
 
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
 

Similar to WebAction In-Memory Computing Summit 2015

WebAction-Sami Abkay
WebAction-Sami AbkayWebAction-Sami Abkay
WebAction-Sami Abkay
Inside Analysis
 
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
In-Memory Computing Summit
 
Hadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural PatternsHadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural Patterns
DataWorks Summit
 
Introduction to Stream Processing
Introduction to Stream ProcessingIntroduction to Stream Processing
Introduction to Stream Processing
Guido Schmutz
 
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon KinesisDay 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis
Amazon Web Services
 
Future Grid Overview 2018
Future Grid Overview 2018Future Grid Overview 2018
Future Grid Overview 2018
Chris J Law
 
Wikibon #IoT #HyperConvergence Presentation via @theCUBE
Wikibon #IoT #HyperConvergence Presentation via @theCUBE Wikibon #IoT #HyperConvergence Presentation via @theCUBE
Wikibon #IoT #HyperConvergence Presentation via @theCUBE
John Furrier
 
Hyper-Convergence CrowdChat
Hyper-Convergence CrowdChatHyper-Convergence CrowdChat
Hyper-Convergence CrowdChat
Wikibon Community
 
BDA307 Real-time Streaming Applications on AWS, Patterns and Use Cases
BDA307 Real-time Streaming Applications on AWS, Patterns and Use CasesBDA307 Real-time Streaming Applications on AWS, Patterns and Use Cases
BDA307 Real-time Streaming Applications on AWS, Patterns and Use Cases
Amazon Web Services
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
Amazon Web Services
 
Datawarehouse Overview
Datawarehouse OverviewDatawarehouse Overview
Datawarehouse Overview
ashok kumar
 
Spark meetup stream processing use cases
Spark meetup   stream processing use casesSpark meetup   stream processing use cases
Spark meetup stream processing use cases
punesparkmeetup
 
Driving Business Insights with a Modern Data Architecture AWS Summit SG 2017
Driving Business Insights with a Modern Data Architecture  AWS Summit SG 2017Driving Business Insights with a Modern Data Architecture  AWS Summit SG 2017
Driving Business Insights with a Modern Data Architecture AWS Summit SG 2017
Amazon Web Services
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
Amazon Web Services
 
Kafka and Stream Processing, Taking Analytics Real-time, Mike Spicer
Kafka and Stream Processing, Taking Analytics Real-time, Mike SpicerKafka and Stream Processing, Taking Analytics Real-time, Mike Spicer
Kafka and Stream Processing, Taking Analytics Real-time, Mike Spicer
confluent
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
Nicolas Morales
 
Real Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalReal Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from Pivotal
VMware Tanzu Korea
 
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
Amazon Web Services
 
Real time data integration best practices and architecture
Real time data integration best practices and architectureReal time data integration best practices and architecture
Real time data integration best practices and architecture
Bui Kiet
 
Amazon Kinesis Platform – The Complete Overview - Pop-up Loft TLV 2017
Amazon Kinesis Platform – The Complete Overview - Pop-up Loft TLV 2017Amazon Kinesis Platform – The Complete Overview - Pop-up Loft TLV 2017
Amazon Kinesis Platform – The Complete Overview - Pop-up Loft TLV 2017
Amazon Web Services
 

Similar to WebAction In-Memory Computing Summit 2015 (20)

WebAction-Sami Abkay
WebAction-Sami AbkayWebAction-Sami Abkay
WebAction-Sami Abkay
 
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
 
Hadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural PatternsHadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural Patterns
 
Introduction to Stream Processing
Introduction to Stream ProcessingIntroduction to Stream Processing
Introduction to Stream Processing
 
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon KinesisDay 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis
 
Future Grid Overview 2018
Future Grid Overview 2018Future Grid Overview 2018
Future Grid Overview 2018
 
Wikibon #IoT #HyperConvergence Presentation via @theCUBE
Wikibon #IoT #HyperConvergence Presentation via @theCUBE Wikibon #IoT #HyperConvergence Presentation via @theCUBE
Wikibon #IoT #HyperConvergence Presentation via @theCUBE
 
Hyper-Convergence CrowdChat
Hyper-Convergence CrowdChatHyper-Convergence CrowdChat
Hyper-Convergence CrowdChat
 
BDA307 Real-time Streaming Applications on AWS, Patterns and Use Cases
BDA307 Real-time Streaming Applications on AWS, Patterns and Use CasesBDA307 Real-time Streaming Applications on AWS, Patterns and Use Cases
BDA307 Real-time Streaming Applications on AWS, Patterns and Use Cases
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
 
Datawarehouse Overview
Datawarehouse OverviewDatawarehouse Overview
Datawarehouse Overview
 
Spark meetup stream processing use cases
Spark meetup   stream processing use casesSpark meetup   stream processing use cases
Spark meetup stream processing use cases
 
Driving Business Insights with a Modern Data Architecture AWS Summit SG 2017
Driving Business Insights with a Modern Data Architecture  AWS Summit SG 2017Driving Business Insights with a Modern Data Architecture  AWS Summit SG 2017
Driving Business Insights with a Modern Data Architecture AWS Summit SG 2017
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
 
Kafka and Stream Processing, Taking Analytics Real-time, Mike Spicer
Kafka and Stream Processing, Taking Analytics Real-time, Mike SpicerKafka and Stream Processing, Taking Analytics Real-time, Mike Spicer
Kafka and Stream Processing, Taking Analytics Real-time, Mike Spicer
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
Real Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalReal Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from Pivotal
 
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
 
Real time data integration best practices and architecture
Real time data integration best practices and architectureReal time data integration best practices and architecture
Real time data integration best practices and architecture
 
Amazon Kinesis Platform – The Complete Overview - Pop-up Loft TLV 2017
Amazon Kinesis Platform – The Complete Overview - Pop-up Loft TLV 2017Amazon Kinesis Platform – The Complete Overview - Pop-up Loft TLV 2017
Amazon Kinesis Platform – The Complete Overview - Pop-up Loft TLV 2017
 

Recently uploaded

Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 

Recently uploaded (20)

Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 

WebAction In-Memory Computing Summit 2015

  • 1. The Internet of Analytics Discovering actionable insights from high-velocity streams of real-time IoT data Sami Akbay, Founder and EVP, WebAction In-Memory Computing Summit | San Francisco, CA | June 2015
  • 2. Cheap and Efficient Data Capture Affordable sensors, RFID, antennas, aggregators, cameras Smaller footprint Low energy consumption Continuous Connectivity from Everywhere Wired networks / wireless networks Reliable, high bandwidth connectivity Ubiquitous access virtually from anywhere Abundant Compute Power and Storage Faster chips Cheaper Memory Hadoop / big data stores
  • 3. • ERP, CRM, Billing … • Human generated/captured • Stored in Databases • Data inherently valuable • Data already useful for operations before analytics Traditional • Capture only events of interest Internet of Things • Capture Everything • Sensor, log, location, … • Machine generated/captured • Stored in big data frameworks • Most of the data has little inherent value • Value of data unknown until after it is analyzed
  • 4. The Internet of Analytics Abundant Compute Power and Storage Continuous Connectivity from Everywhere Cheap and efficient Data Capture
  • 5. • Expensive to store in traditional data stores • Much of it is not useful IoT generates large volumes of data • Requires adequate connectivity • Uses significant network resources Data comes in spikes or high-velocity continuous streams • Data transformation is required • Data models need to facilitate analytics Data arrives in a variety of different formats • Requires platforms that can perform sophisticated Stream analytics Data contains perishable insights Source: Internet Of Things Applications Hunger for Hadoop and Real-Time Analytics in the Cloud by Mike Gualtieri and Rowan Curran , Forrester
  • 6. Acquire Store Process Acquire Process in Memory Deliver BI / Analytics RDBMS EDW Structured Data Machine Data LocationClick Stream Structured Data Machine Data LocationClick Stream IoT Analytics Applications Batch Reactive R E ALT I M E B AR R I E R ProactiveRealtime Visualizations Store Alerts Integrate
  • 7. Reduce the Latency to Capture, Analyze, and Ultimately Take Action to Increase Value Events Decision latency BusinessValue Time to Action Action taken Data analyzed Data captured Based on concept developed by Richard Hackathorn, Bolder Technology
  • 8. • Oil Rig Drill Sensor: – Temperature up 10°C  continue drilling – Temperature up 10°C + Viscosity down  stop drilling • Hospital Bed: – Blood O2 below 93%  Patient went to the bathroom – Blood O2 below 93% and Pulse @ 150BPM  send a doctor Perfect Storm: An event where a rare combination of circumstances aggravate a situation drastically You need realtime correlation of multiple data streams to handle the perfect storm
  • 9. Actionable insights come from combining current events with context Context Realtime Action+ =Event Historical Context + Reference Data Real-time Event Stream e.g. Real-time Sensor Events e.g. Shopper profile, Store ID, Inventory, Profitability e.g. Present Next-Best-Action, update current price, modify sourcing
  • 10. Because actionable insights come from combining current IoT events with context In the last 30 minutes a store has sold $8,000 This store typically sells $3,000 on Tuesdays in June Alert the store manager to require ID at checkout In the last hour, 2 visits by shopper X in Store Zone 3 for 16 minutes Zone 3 has mobile phones. Shopper X due for device refresh. Offer promotion package for new device with 2 year contract renewal A mobile subscriber drops 3 calls in 2 hours A subscriber will drop 8 calls in a week before becoming a churn risk When a 611 call is made, alert the agent NOT TO offer a service discount Context Realtime Action+ =Event
  • 11. • Support Data in-Motion and Data at-Rest – Process events and groups of events (data windows) as Streams – Correlate multiple Streams in Realtime before disk storage – Leverage analyzed context from historic data sources – Store aggregate data, analyzed data, and raw payload on various storage frameworks • Implement an Easy-to-Use Development environment – Allow users to quickly discover and analyze data – Convert analysis patterns into IoT Analytics Applications – Provide an easy-to-use development / deployment interface • Address industrial and operational needs – Offer linear scalability – Run on commodity infrastructure / virtualized environments – Provide redundancy, failover, recovery
  • 12. RDBMS JDBC/SQL Oracle CDC MS SQL CDC NonStop GoldenGate Network TCP/UDP HTTP SNMP/NetFlow Files CSV/TSV JSON XML Apache Free-form BigData HDFS Log Flows Flume Collectd Windows Events Message Queues JMS Kafka Sources Applications DB Persistence JDBC/SQL NoSQL Vertica File Persistence CSV/TSV JSON XML Automated Workflows BigData HDFS Alerting Email SMS External Context Distributed Results Cache Sources Streams Windows ∞Queries Caches Targets Distributed Continuous Query Processor Real-time Dashboards Delivery Business-Level Logic With Tungsten QL (extended SQL) Message Queues JaMS Kafka
  • 14. Structured and unstructured data Distributed, in-memory, as data is created Correlated, enriched, and filtered real-time big data records Deliver Process Assimilate
  • 15.  Data from transactional sources is acquired via redo or transaction logs  Structured and non-Structured data  No Production Impact  No Application changes Device Data Industry Data Social Feeds Real-Time Transaction Data System/ IT Data Common File Format TYPE EXAMPLE COMPLEXITY CSV, JSON, XML Facebook, Twitter Syslogs, weblogs, event logs SmartMeter, Medical Device, RFID, Netflow, iBeacon, CDR SWIFT, HL7, FIX, ASN Oracle, DB2, SQLServer, MySQL, HP NonStop SIMPLE VERY HIGH SIMPLE TO MEDIUM MEDIUM MEDIUM HIGH Structured and unstructured data Assimilate
  • 16. Distributed, in-memory, as data is created Process  Enrich live Big Data with historical data sources  Process Big Data faster using partitioned streams, caches, and additional nodes  Execute SQL-like queries of in-memory Big Data  Alert in real-time based on predictive analytic model results Structured and unstructured data Assimilate
  • 17. Structured and unstructured data Distributed, in-memory, as data is created Correlated, enriched, and filtered real-time big data records Deliver Process Assimilate  Continuous Big Data Records  Realtime Drag & Drop Dashboards  Predictive Alerts  Business Trends  Data Patterns  Outliers
  • 18. Device Data Big Data Infrastructure Industry Data Social Feeds Transaction Data Enterprise Apps & Workflows Enterprise Data Warehouse RDBMS Stream Analytics Applications System/ IT Data HighSpeedDataAcquisition Command Line Visual Designer CREATE APPLICATION MultiLogApp; CREATE FLOW MonitorLogs; CREATE SOURCE AccessLogSource USING… CREATE TYPE AccessLogEntry … CREATE STREAM AccessStream OF… CREATE CQ ParseAccessLog … W > Results Persistence Context Cache Distributed Results Cache Distributed Query Processor External Targets & Alerts Event Windows Node: n 2 1 Drag & Drop Stream Dashboards
  • 19. Component Definition Source Access external data and provides realtime continuous events into streams Stream Carries data between components and nodes Window Provides moving snapshot/collection of events for aggregates and models Cache External contextual data made available using distributed in-memory grid CQ (Continuous Query) A Continuous Query emits big data records after processing realtime streaming events (can process data from streams, windows, caches, event tables, and stores) WAction Store (big data records) Resulting big data records from processing (aggregates, correlates, anomalies, predictions) - can be in-memory only or persisted to Elasticsearch / database Target Outputs realtime big data records to external systems Application A combination of the above components performing business logic Dashboard A drag and drop realtime view into stores, caches, and streams
  • 20.
  • 21. • Create Applications • Add & Navigate through Flows • Design Data Model • Drag & Drop Components • Configure Components • Deploy Applications • Start / Stop Applications • View Alerts • View Event Flow Rate
  • 22. • Add a new application • Add a source (CDC, structured, semi-structured, etc.) • Configure Source
  • 23. • Add a typed stream • Add a CQ to transform data types • Add a Cache and CQ for context enrichment
  • 24.
  • 25. • Design dashboard • Create multiple pages / drilldowns • Define data through queries • Drag and drop visualizations: Values / Icons / Gauges – Maps – Line Charts – Scatter / Bubble Plots – Pie / Donut Charts – Bar Charts – Tables – Heat Maps
  • 26. • Add a new Dashboard • Add a Visualization • Configure Query and Visualization
  • 27. • View Data Visualized • Filter data in a page • Drilldown to related and detail pages
  • 28. • Realtime log / database CDC reading in addition to push sources like TCP/JMS • Bytecode generation for data types and query processing • Scaling across multiple nodes with flexible deployment • Auto failover of application components from one node to another • Nodes can be added and removed while applications are running • Recovery ensures no events are missed or processed twice • Recovery takes window contents into account • Role based security at the application through component level • Integrated realtime dashboard visualizations using server push
  • 29. WebAction Headquarters info@webaction.com +1 (650) 241-0680 LinkedIn.com/company/WebAction-Inc- facebook.com/WebActionSoftware Twitter.com/WebActionInc

Editor's Notes

  1. Added windows to event streams – feel free to tweak. Here was what we wanted to get across on the Context topic: Combine multiple realtime streams for context. If A and B happen. Event correlation. Include time and windowing.  Historic context - same store sales from last year.  Reference data - Store ID. Combined through Multi-load correlation and lookups. Need to flesh out the context piece of this slide without mentioning WebAction. Update use case references for IoT.
  2. Edit this slide to include an IoT use case in the first position. You may also want to take an IoT angle on the other use cases. John Schaeffer has a couple of different use cases for this slide you may wish to plagiarize from.
  3. We need to add in all of the IoT Data Sources to this graphic. I don’t seem to have that particular data source slide that includes the IoT sources. Is this where you’d like to talk about “Enables data that was previously uncapturable - ability to capture the data”. If not, please be sure to mention this concept where appropriate, or create a slide to speak to it. I added Automated Workflows to Delivery.