SlideShare a Scribd company logo
1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Make Streaming Analytics
Work for you: The Devil is in
the Details
Kanishk Mahajan
June 2016
Director Product Management, Hortonworks
Ryan Medlin
Director Software Engineering, Neustar
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
• Building a Stream Analytics Platform:
Requirements
• Building a Stream Analytics Platform: Capabilities
• Use Case -
• CyberSecurity - Threat Protection
• Q&A
Building a Stream
Analytics Platform
Requirements
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
5 Attributes of a Streaming Platform
 Ingest
 Process
 Analyze
 Visualize
 Respond
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Data Flow versus Stream Analytics
 Data Flow
–Ingest and route terabytes of data into a ”unified firehose”
–Actively performance manage the latency and quality of these data flows -
–across high variability of data formats, size of data and speed of data
–Produces Streams - ”unbounded sequence of events”
Real Time Stream Analytics
–Sub second event processing with linear scalability to billions of events
–Predictive Analytics at Scale
– Real time data aggregation across edge nodes while processing 10s of millions of events and
100s of gigabytes per second
–Guaranteed no data loss and events processed in order
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
6 key focus areas for building a Streaming Platform
 Common Abstraction Layer
 Latency
 Lambda Architecture
–“Orchestrate” over static and real time data
 Scale-out
 Rapid Application Development
 Data Visualization
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform
Common Abstraction Layer
–Select one or more streaming engines
–Select one or more cloud providers
–Select one or more event sources
–…Future Proof
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform
Latency
–500 millisecond or less- Dashboards, Security Incidents, Asset Performance
–20 milliseconds or less - Ad Networks, Preventive Maintenance
–…
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform
Lambda Architecture
–Integrate static and real time data
–Enrichment
–Orchestration of Batch workflows
–Predictive Classification and Scoring
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform
Scale Out
–Linear Scale out or scale down
–Resource Management
–Handle Transient workloads
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform
Rapid Application Development
–Aggregations
–Filters
–Multi Stream Correlations
–Splits
–Joins
–Normalizations
–Business Rules Editor
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Focus Areas for Streaming Platform
Data Visualization
–Time Series Visualizations
–Metrics Dashboarding
–Trends
–Comparisons
–Thresholds
–Custom UI extensions
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Real Time Stream Analytics versus Streaming Engine
Abstracts underlying Streaming Engine- Storm, Spark Streaming, Flink..
OOTB Support for multiple (cloud) event sources - Kafka, AWS Kinesis, Azure Event
Hub
Built-in Operators for Complex Event Processing
Built-in Real Time Dashboarding- Metrics and Events
Pluggable Workflow Management
Business Rules Editor and Rapid Application Development Framework
Cloud Deployment
Scalable Architecture
 Handles different latency requirements
Hortonworks Inc Confidential 2016
Building a Stream
Analytics Platform
Capabilities
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stream Analytics Platform - Capabilities
• Continuous, Real Time Processing
–Continuous Query and near real time processing of event data
–Distributed, scalable platform for processing continuous unbounded streams of data
–Abstraction across multiple open source stream engine technologies
–Large Scale Event Processing
–No coding approach for event processing logic
• Event Source
–Multiple event source support
–Multiple data format support
–Drag and Drop, no coding approach
• Event Storage
–Multiple event storage support
–Event Retention time period support
–Drag and Drop, no coding approach
Page 15
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Streaming Platform Capabilities - Analytics
Detect Trends
Alert on probable Failures
Descriptive Proactive Predictive Prescriptive
• Rules Calculations on
predefined settings
• High Throughput, Low
Latency Event Processing
Framework
• Train Models Offline
• Provide Query interface and
storage of Models
• Integrate Real Time Data with
Batch Data
• Connect with Enterprise
Applications
Reporting and Dashboarding
Visual Pattern Recognition
Monitor System
Alert before Failure
Take Action on Data
Avoid Failure and
Drive Operational Efficiency
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stream Analytics Platform - Capabilities
• Filtering
–Event Selection based on pre-defined criteria
• Transformations
–Aggregations
–Event Prioritization
–Event Deduplication
–Event Time Stamping
–Custom User Defined Functions
• Enrichment
–Pull in reference data for additional context
Page 17
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stream Analytics Platform - Capabilities
• Correlations
–Correlation of in-flight event data from multiple event streams
• Temporal Calculations
–Time Based Calculations on in-flight data in an event stream for e.g. a windowing
computation with a time based window
• Pattern Detection
–Calculating trends on in-flight data in an event stream
–Outlier detection : Pattern Matching with historical data
Page 18
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stream Analytics Platform - Capabilities
• Serialization/Deserialization
–Marshalling/Unmarshalling event data into a series of bytes
–Drag and Drop, no coding support
• Rule Engine
–Support for business rules (business directives that impact a decision)
–Support for invoking external business rule engine
• Visualization
–Reporting and Dashboarding
–Visual Pattern Recognition
Page 19
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Stream Analytics - Capabilities Summary
# Capability
1. Filtering
2. Transformations
3. Enrichment
4. Correlations
5. Temporal Calculations
6. Pattern Detection
# Capability
1. Continuous Real Time
Processing
2. Event Sources
3. Event Storage
4. Temporal Calculations
5. Rules Engine
6. Visualization
# Capability
7. Filtering
8. Transformation
9. Event Storage
10. Enrichment
11. Correlations
12. Pattern Detection
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Use Cases
CyberSecurity - Threat Protection
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
CyberSecurity - Threat Protection Use Case
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
CyberSecurity - Threat Protection - Steps
Applying Streaming Analytics
• Expose
• Provide Visibility into your enterprise
• Passively watch network traffic and construct events to describe the activity it sees.
• “Ingest”
• Transmit
• Compress, Encrypt, De-identify event data
• Send to the cloud for centralized log retention, real-time threat analysis and incident investigation
24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
CyberSecurity - Threat Protection - Steps
Applying Streaming Analytics
• Prioritize
• Aggregate intelligence across all control points to identify and prioritize those systems that remain
compromised and require immediate remediation.
• Allows analysts to “zero in” on just those events of most importance.
• Significantly reduce the number of incidents that security analysts need to investigate
• Combine global telemetry from cyber intelligence networks with local customer context across
endpoints, networks and email, to uncover attacks that would otherwise evade detection.
• “Correlate” and “Enrich”
25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
CyberSecurity - Threat Protection - Steps
Applying Streaming Analytics
• Analyze
• Create comprehensive detection rules, behavioral analytics and guided investigations to detect the
latest threats.
• Enable quick and nimble data exploration across billions of events to proactively hunt for hidden
indicators of compromise (IOCs).
• Provide agile investigation tooling/UI to pivot from one indicator to the next, reconstruct the attack
storyline and plan a forceful response to disrupt the attack.
• Allow Security analysts to visualize all related attack components, e.g., all files used in the attack, all
email addresses, all malicious IP addresses, etc.
• “Proactive”, “Proactive”, “Prescriptive” and “Descriptive” Analytics
• “Real Time Analytics across large volumes of data at scale”
26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
CyberSecurity - Threat Protection - Steps
Applying Streaming Analytics
• Remediate
• Generate alerts to expedite validating and scoping the incident.
• Enrich alerts with supporting data. This data could include threat intelligence, point-in-time
context regarding users impacted, actions taken and hosts involved help you validate and scope the
incident.
• Allow a Security Investigator to hunt for Indicators-of-Compromise across all your endpoints from a
single console.
• Leverage your existing installations of Endpoint Protection and Email Security for enforcement
without having to install any new endpoint agents.
• Export rich security intelligence into third-party security incident and event management systems
(SIEMs).
• “Notifications”, “Event Storage”, “Workflow”
27 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Make Streaming Analytics
Work for you: The Devil is in
the Details
Kanishk Mahajan
June 2016
Director Product Management, Hortonworks
Ryan Medlin
Director Software Engineering, Neustar
Copyright © 2015 Neustar, Inc. All Rights Reserved 28
AREAS OF FOCUS AND INNOVATION FOR NEUSTAR
 Security
 User
 Device
 Anomaly Detection
 Multi-Factor Authentication (PKI, etc)
 Policy Creation/Decisioning/Enforcement
 Local / Edge / Distributed functionality
 Openness and Standardization
 (OCF/OneM2M/MQTT/CoAP/GitHub)
 Partnerships
Copyright © 2015 Neustar, Inc. All Rights Reserved 29
DEVICE AND IOT PLATFORM
Devices
Data Processing Services
Local & Offline Functionality
Device and User Authentication
Device Directory & Identity
User and Device Policy
Device Certification & Public /
Private Key Distribution/MFA
OIC/OCF and IoTivity Standards Based Platform for Interoperability & device definition
Absolute, Group/Collection,
Local & cloud
Verification /
Fraud /
Anomaly
detection
Location
Services
Future
Peer-to-Peer communication &
Discovery. decentralization
Secure Discovery & Communication
OCF based security &
communication protocols
Data pipeline, enrichment and
analytics
CORE DATA PLATFORM REQUIREMENTS
 Provide Ingestion for all Machine Data Exhaust from all system components. (ex:
Anomaly Detection, Monitoring/Alerting)
 Deserializing avro messages using given schema(s) and by default store a
message in hdfs.
 Given a message attribute (for example device type and customer Id or device Id)
execute a rule to determine next course of action which should comprise of one
or more of these actions:
– Notify a user via email or sms or 3rd party api
– Notify or pass message to registry service/API
– Store data in hdfs
– Store data in a relational data store (hbase phoenix or other?)
Copyright © 2015 Neustar, Inc. All Rights Reserved 31
Copyright © 2015 Neustar, Inc. All Rights Reserved 32
Copyright © 2015 Neustar, Inc. All Rights Reserved 33
Copyright © 2015 Neustar, Inc. All Rights Reserved 34
Outgoing Messages
Copyright © 2015 Neustar, Inc. All Rights Reserved 35
Turn a Light on Manually
Inbound Message Flow
Copyright © 2015 Neustar, Inc. All Rights Reserved 36
Incoming Messages
INTERESTING PROBLEMS
 We need to work in offline and temporary offline mode so data replication is the hardest
problem to solve and is based on unique use cases.
 Also need to run a subset of the Cloud Inbound real time service rules and processing.
How to reuse this logic both in the cloud and locally on a Local Gateway is a design
challenge with tradeoffs.
Copyright © 2015 Neustar, Inc. All Rights Reserved 37
38 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Q & A

More Related Content

What's hot

Design a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDFDesign a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDF
Hortonworks
 
HPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare TransformationHPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare Transformation
Hortonworks
 
Real Time Machine Learning Visualization with Spark
Real Time Machine Learning Visualization with SparkReal Time Machine Learning Visualization with Spark
Real Time Machine Learning Visualization with Spark
DataWorks Summit/Hadoop Summit
 
Interactive Analytics at Scale in Apache Hive Using Druid
Interactive Analytics at Scale in Apache Hive Using DruidInteractive Analytics at Scale in Apache Hive Using Druid
Interactive Analytics at Scale in Apache Hive Using Druid
DataWorks Summit
 
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
DataWorks Summit/Hadoop Summit
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
Hortonworks
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...
Data Con LA
 
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior GraphsPredicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
Hortonworks
 
Intro to Spark & Zeppelin - Crash Course - HS16SJ
Intro to Spark & Zeppelin - Crash Course - HS16SJIntro to Spark & Zeppelin - Crash Course - HS16SJ
Intro to Spark & Zeppelin - Crash Course - HS16SJ
DataWorks Summit/Hadoop Summit
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
Cécile Poyet
 
Visualizing Big Data in Realtime
Visualizing Big Data in RealtimeVisualizing Big Data in Realtime
Visualizing Big Data in Realtime
DataWorks Summit
 
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks
 
Building a Scalable Data Science Platform with R
Building a Scalable Data Science Platform with RBuilding a Scalable Data Science Platform with R
Building a Scalable Data Science Platform with R
DataWorks Summit/Hadoop Summit
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data Warehouse
DataWorks Summit
 
Make Streaming IoT Analytics Work for You
Make Streaming IoT Analytics Work for YouMake Streaming IoT Analytics Work for You
Make Streaming IoT Analytics Work for You
Hortonworks
 
Pivotal - Advanced Analytics for Telecommunications
Pivotal - Advanced Analytics for Telecommunications Pivotal - Advanced Analytics for Telecommunications
Pivotal - Advanced Analytics for Telecommunications
Hortonworks
 
Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4
Hortonworks
 
Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...
Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...
Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...
DataWorks Summit
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - WebinarHortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks
 
The Next Generation of Data Processing and Open Source
The Next Generation of Data Processing and Open SourceThe Next Generation of Data Processing and Open Source
The Next Generation of Data Processing and Open Source
DataWorks Summit/Hadoop Summit
 

What's hot (20)

Design a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDFDesign a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDF
 
HPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare TransformationHPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare Transformation
 
Real Time Machine Learning Visualization with Spark
Real Time Machine Learning Visualization with SparkReal Time Machine Learning Visualization with Spark
Real Time Machine Learning Visualization with Spark
 
Interactive Analytics at Scale in Apache Hive Using Druid
Interactive Analytics at Scale in Apache Hive Using DruidInteractive Analytics at Scale in Apache Hive Using Druid
Interactive Analytics at Scale in Apache Hive Using Druid
 
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...
 
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior GraphsPredicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
 
Intro to Spark & Zeppelin - Crash Course - HS16SJ
Intro to Spark & Zeppelin - Crash Course - HS16SJIntro to Spark & Zeppelin - Crash Course - HS16SJ
Intro to Spark & Zeppelin - Crash Course - HS16SJ
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Visualizing Big Data in Realtime
Visualizing Big Data in RealtimeVisualizing Big Data in Realtime
Visualizing Big Data in Realtime
 
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
 
Building a Scalable Data Science Platform with R
Building a Scalable Data Science Platform with RBuilding a Scalable Data Science Platform with R
Building a Scalable Data Science Platform with R
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data Warehouse
 
Make Streaming IoT Analytics Work for You
Make Streaming IoT Analytics Work for YouMake Streaming IoT Analytics Work for You
Make Streaming IoT Analytics Work for You
 
Pivotal - Advanced Analytics for Telecommunications
Pivotal - Advanced Analytics for Telecommunications Pivotal - Advanced Analytics for Telecommunications
Pivotal - Advanced Analytics for Telecommunications
 
Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4
 
Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...
Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...
Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - WebinarHortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - Webinar
 
The Next Generation of Data Processing and Open Source
The Next Generation of Data Processing and Open SourceThe Next Generation of Data Processing and Open Source
The Next Generation of Data Processing and Open Source
 

Similar to Make Streaming Analytics work for you: The Devil is in the Details

Unlocking insights in streaming data
Unlocking insights in streaming dataUnlocking insights in streaming data
Unlocking insights in streaming data
Carolyn Duby
 
HDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi IntroductionHDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi Introduction
Milind Pandit
 
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Data Con LA
 
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
Splunk
 
What’s New: Splunk App for Stream and Splunk MINT
What’s New: Splunk App for Stream and Splunk MINTWhat’s New: Splunk App for Stream and Splunk MINT
What’s New: Splunk App for Stream and Splunk MINT
Splunk
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical Workshop
Hortonworks
 
StreamAnalytix - Multi-Engine Streaming Analytics Platform
StreamAnalytix - Multi-Engine Streaming Analytics PlatformStreamAnalytix - Multi-Engine Streaming Analytics Platform
StreamAnalytix - Multi-Engine Streaming Analytics Platform
Atul Sharma
 
New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream
New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream
New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream
Splunk
 
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
Druid: Sub-Second OLAP queries over Petabytes of Streaming DataDruid: Sub-Second OLAP queries over Petabytes of Streaming Data
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
DataWorks Summit
 
Splunk MINT and Stream Breakout
Splunk MINT and Stream BreakoutSplunk MINT and Stream Breakout
Splunk MINT and Stream Breakout
Splunk
 
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...
Splunk
 
Solving Cybersecurity at Scale
Solving Cybersecurity at ScaleSolving Cybersecurity at Scale
Solving Cybersecurity at Scale
DataWorks Summit
 
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFIHarnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Haimo Liu
 
Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical Enterprise
Hortonworks
 
Apache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming AnalyticsApache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming Analytics
Slim Baltagi
 
SplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT BreakoutSplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT Breakout
Splunk
 
Introduction to Streaming Analytics
Introduction to Streaming AnalyticsIntroduction to Streaming Analytics
Introduction to Streaming Analytics
Guido Schmutz
 
Apache NiFi Toronto Meetup
Apache NiFi Toronto MeetupApache NiFi Toronto Meetup
Apache NiFi Toronto Meetup
Hortonworks
 
Streaming Visualization
Streaming VisualizationStreaming Visualization
Streaming Visualization
Guido Schmutz
 

Similar to Make Streaming Analytics work for you: The Devil is in the Details (20)

Unlocking insights in streaming data
Unlocking insights in streaming dataUnlocking insights in streaming data
Unlocking insights in streaming data
 
HDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi IntroductionHDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi Introduction
 
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
 
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
 
What’s New: Splunk App for Stream and Splunk MINT
What’s New: Splunk App for Stream and Splunk MINTWhat’s New: Splunk App for Stream and Splunk MINT
What’s New: Splunk App for Stream and Splunk MINT
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical Workshop
 
StreamAnalytix - Multi-Engine Streaming Analytics Platform
StreamAnalytix - Multi-Engine Streaming Analytics PlatformStreamAnalytix - Multi-Engine Streaming Analytics Platform
StreamAnalytix - Multi-Engine Streaming Analytics Platform
 
New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream
New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream
New Splunk Management Solutions Update: Splunk MINT and Splunk App for Stream
 
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
Druid: Sub-Second OLAP queries over Petabytes of Streaming DataDruid: Sub-Second OLAP queries over Petabytes of Streaming Data
Druid: Sub-Second OLAP queries over Petabytes of Streaming Data
 
Splunk MINT and Stream Breakout
Splunk MINT and Stream BreakoutSplunk MINT and Stream Breakout
Splunk MINT and Stream Breakout
 
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...
 
Solving Cybersecurity at Scale
Solving Cybersecurity at ScaleSolving Cybersecurity at Scale
Solving Cybersecurity at Scale
 
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFIHarnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
 
Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical Enterprise
 
Apache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming AnalyticsApache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming Analytics
 
SplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT BreakoutSplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT Breakout
 
Introduction to Streaming Analytics
Introduction to Streaming AnalyticsIntroduction to Streaming Analytics
Introduction to Streaming Analytics
 
Apache NiFi Toronto Meetup
Apache NiFi Toronto MeetupApache NiFi Toronto Meetup
Apache NiFi Toronto Meetup
 
Streaming Visualization
Streaming VisualizationStreaming Visualization
Streaming Visualization
 

More from DataWorks Summit/Hadoop Summit

Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in ProductionRunning Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
 
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache ZeppelinState of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
 
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
 
Hadoop Crash Course
Hadoop Crash CourseHadoop Crash Course
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
DataWorks Summit/Hadoop Summit
 
Apache Spark Crash Course
Apache Spark Crash CourseApache Spark Crash Course
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
 
Schema Registry - Set you Data Free
Schema Registry - Set you Data FreeSchema Registry - Set you Data Free
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
 
HBase in Practice
HBase in Practice HBase in Practice
HBase in Practice
DataWorks Summit/Hadoop Summit
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS HadoopBreaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
 
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
 

More from DataWorks Summit/Hadoop Summit (20)

Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in ProductionRunning Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
 
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache ZeppelinState of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
 
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
 
Hadoop Crash Course
Hadoop Crash CourseHadoop Crash Course
Hadoop Crash Course
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Apache Spark Crash Course
Apache Spark Crash CourseApache Spark Crash Course
Apache Spark Crash Course
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
 
Schema Registry - Set you Data Free
Schema Registry - Set you Data FreeSchema Registry - Set you Data Free
Schema Registry - Set you Data Free
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
 
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
 
HBase in Practice
HBase in Practice HBase in Practice
HBase in Practice
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS HadoopBreaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
 
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
 

Recently uploaded

What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
DianaGray10
 
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Ukraine
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
LizaNolte
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
BibashShahi
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
UiPathCommunity
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
christinelarrosa
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
DianaGray10
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
Fwdays
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
Sease
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
zjhamm304
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
DanBrown980551
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 

Recently uploaded (20)

What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
 
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 

Make Streaming Analytics work for you: The Devil is in the Details

  • 1. 1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Make Streaming Analytics Work for you: The Devil is in the Details Kanishk Mahajan June 2016 Director Product Management, Hortonworks Ryan Medlin Director Software Engineering, Neustar
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda • Building a Stream Analytics Platform: Requirements • Building a Stream Analytics Platform: Capabilities • Use Case - • CyberSecurity - Threat Protection • Q&A
  • 3. Building a Stream Analytics Platform Requirements
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved 5 Attributes of a Streaming Platform  Ingest  Process  Analyze  Visualize  Respond
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Data Flow versus Stream Analytics  Data Flow –Ingest and route terabytes of data into a ”unified firehose” –Actively performance manage the latency and quality of these data flows - –across high variability of data formats, size of data and speed of data –Produces Streams - ”unbounded sequence of events” Real Time Stream Analytics –Sub second event processing with linear scalability to billions of events –Predictive Analytics at Scale – Real time data aggregation across edge nodes while processing 10s of millions of events and 100s of gigabytes per second –Guaranteed no data loss and events processed in order
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved 6 key focus areas for building a Streaming Platform  Common Abstraction Layer  Latency  Lambda Architecture –“Orchestrate” over static and real time data  Scale-out  Rapid Application Development  Data Visualization
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform Common Abstraction Layer –Select one or more streaming engines –Select one or more cloud providers –Select one or more event sources –…Future Proof
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform Latency –500 millisecond or less- Dashboards, Security Incidents, Asset Performance –20 milliseconds or less - Ad Networks, Preventive Maintenance –…
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform Lambda Architecture –Integrate static and real time data –Enrichment –Orchestration of Batch workflows –Predictive Classification and Scoring
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform Scale Out –Linear Scale out or scale down –Resource Management –Handle Transient workloads
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform Rapid Application Development –Aggregations –Filters –Multi Stream Correlations –Splits –Joins –Normalizations –Business Rules Editor
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Focus Areas for Streaming Platform Data Visualization –Time Series Visualizations –Metrics Dashboarding –Trends –Comparisons –Thresholds –Custom UI extensions
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Real Time Stream Analytics versus Streaming Engine Abstracts underlying Streaming Engine- Storm, Spark Streaming, Flink.. OOTB Support for multiple (cloud) event sources - Kafka, AWS Kinesis, Azure Event Hub Built-in Operators for Complex Event Processing Built-in Real Time Dashboarding- Metrics and Events Pluggable Workflow Management Business Rules Editor and Rapid Application Development Framework Cloud Deployment Scalable Architecture  Handles different latency requirements Hortonworks Inc Confidential 2016
  • 14. Building a Stream Analytics Platform Capabilities
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stream Analytics Platform - Capabilities • Continuous, Real Time Processing –Continuous Query and near real time processing of event data –Distributed, scalable platform for processing continuous unbounded streams of data –Abstraction across multiple open source stream engine technologies –Large Scale Event Processing –No coding approach for event processing logic • Event Source –Multiple event source support –Multiple data format support –Drag and Drop, no coding approach • Event Storage –Multiple event storage support –Event Retention time period support –Drag and Drop, no coding approach Page 15
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Streaming Platform Capabilities - Analytics Detect Trends Alert on probable Failures Descriptive Proactive Predictive Prescriptive • Rules Calculations on predefined settings • High Throughput, Low Latency Event Processing Framework • Train Models Offline • Provide Query interface and storage of Models • Integrate Real Time Data with Batch Data • Connect with Enterprise Applications Reporting and Dashboarding Visual Pattern Recognition Monitor System Alert before Failure Take Action on Data Avoid Failure and Drive Operational Efficiency
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stream Analytics Platform - Capabilities • Filtering –Event Selection based on pre-defined criteria • Transformations –Aggregations –Event Prioritization –Event Deduplication –Event Time Stamping –Custom User Defined Functions • Enrichment –Pull in reference data for additional context Page 17
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stream Analytics Platform - Capabilities • Correlations –Correlation of in-flight event data from multiple event streams • Temporal Calculations –Time Based Calculations on in-flight data in an event stream for e.g. a windowing computation with a time based window • Pattern Detection –Calculating trends on in-flight data in an event stream –Outlier detection : Pattern Matching with historical data Page 18
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stream Analytics Platform - Capabilities • Serialization/Deserialization –Marshalling/Unmarshalling event data into a series of bytes –Drag and Drop, no coding support • Rule Engine –Support for business rules (business directives that impact a decision) –Support for invoking external business rule engine • Visualization –Reporting and Dashboarding –Visual Pattern Recognition Page 19
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Stream Analytics - Capabilities Summary # Capability 1. Filtering 2. Transformations 3. Enrichment 4. Correlations 5. Temporal Calculations 6. Pattern Detection # Capability 1. Continuous Real Time Processing 2. Event Sources 3. Event Storage 4. Temporal Calculations 5. Rules Engine 6. Visualization # Capability 7. Filtering 8. Transformation 9. Event Storage 10. Enrichment 11. Correlations 12. Pattern Detection
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Use Cases CyberSecurity - Threat Protection
  • 22. 22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved CyberSecurity - Threat Protection Use Case
  • 23. 23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved CyberSecurity - Threat Protection - Steps Applying Streaming Analytics • Expose • Provide Visibility into your enterprise • Passively watch network traffic and construct events to describe the activity it sees. • “Ingest” • Transmit • Compress, Encrypt, De-identify event data • Send to the cloud for centralized log retention, real-time threat analysis and incident investigation
  • 24. 24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved CyberSecurity - Threat Protection - Steps Applying Streaming Analytics • Prioritize • Aggregate intelligence across all control points to identify and prioritize those systems that remain compromised and require immediate remediation. • Allows analysts to “zero in” on just those events of most importance. • Significantly reduce the number of incidents that security analysts need to investigate • Combine global telemetry from cyber intelligence networks with local customer context across endpoints, networks and email, to uncover attacks that would otherwise evade detection. • “Correlate” and “Enrich”
  • 25. 25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved CyberSecurity - Threat Protection - Steps Applying Streaming Analytics • Analyze • Create comprehensive detection rules, behavioral analytics and guided investigations to detect the latest threats. • Enable quick and nimble data exploration across billions of events to proactively hunt for hidden indicators of compromise (IOCs). • Provide agile investigation tooling/UI to pivot from one indicator to the next, reconstruct the attack storyline and plan a forceful response to disrupt the attack. • Allow Security analysts to visualize all related attack components, e.g., all files used in the attack, all email addresses, all malicious IP addresses, etc. • “Proactive”, “Proactive”, “Prescriptive” and “Descriptive” Analytics • “Real Time Analytics across large volumes of data at scale”
  • 26. 26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved CyberSecurity - Threat Protection - Steps Applying Streaming Analytics • Remediate • Generate alerts to expedite validating and scoping the incident. • Enrich alerts with supporting data. This data could include threat intelligence, point-in-time context regarding users impacted, actions taken and hosts involved help you validate and scope the incident. • Allow a Security Investigator to hunt for Indicators-of-Compromise across all your endpoints from a single console. • Leverage your existing installations of Endpoint Protection and Email Security for enforcement without having to install any new endpoint agents. • Export rich security intelligence into third-party security incident and event management systems (SIEMs). • “Notifications”, “Event Storage”, “Workflow”
  • 27. 27 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Make Streaming Analytics Work for you: The Devil is in the Details Kanishk Mahajan June 2016 Director Product Management, Hortonworks Ryan Medlin Director Software Engineering, Neustar
  • 28. Copyright © 2015 Neustar, Inc. All Rights Reserved 28
  • 29. AREAS OF FOCUS AND INNOVATION FOR NEUSTAR  Security  User  Device  Anomaly Detection  Multi-Factor Authentication (PKI, etc)  Policy Creation/Decisioning/Enforcement  Local / Edge / Distributed functionality  Openness and Standardization  (OCF/OneM2M/MQTT/CoAP/GitHub)  Partnerships Copyright © 2015 Neustar, Inc. All Rights Reserved 29
  • 30. DEVICE AND IOT PLATFORM Devices Data Processing Services Local & Offline Functionality Device and User Authentication Device Directory & Identity User and Device Policy Device Certification & Public / Private Key Distribution/MFA OIC/OCF and IoTivity Standards Based Platform for Interoperability & device definition Absolute, Group/Collection, Local & cloud Verification / Fraud / Anomaly detection Location Services Future Peer-to-Peer communication & Discovery. decentralization Secure Discovery & Communication OCF based security & communication protocols Data pipeline, enrichment and analytics
  • 31. CORE DATA PLATFORM REQUIREMENTS  Provide Ingestion for all Machine Data Exhaust from all system components. (ex: Anomaly Detection, Monitoring/Alerting)  Deserializing avro messages using given schema(s) and by default store a message in hdfs.  Given a message attribute (for example device type and customer Id or device Id) execute a rule to determine next course of action which should comprise of one or more of these actions: – Notify a user via email or sms or 3rd party api – Notify or pass message to registry service/API – Store data in hdfs – Store data in a relational data store (hbase phoenix or other?) Copyright © 2015 Neustar, Inc. All Rights Reserved 31
  • 32. Copyright © 2015 Neustar, Inc. All Rights Reserved 32
  • 33. Copyright © 2015 Neustar, Inc. All Rights Reserved 33
  • 34. Copyright © 2015 Neustar, Inc. All Rights Reserved 34 Outgoing Messages
  • 35. Copyright © 2015 Neustar, Inc. All Rights Reserved 35 Turn a Light on Manually Inbound Message Flow
  • 36. Copyright © 2015 Neustar, Inc. All Rights Reserved 36 Incoming Messages
  • 37. INTERESTING PROBLEMS  We need to work in offline and temporary offline mode so data replication is the hardest problem to solve and is based on unique use cases.  Also need to run a subset of the Cloud Inbound real time service rules and processing. How to reuse this logic both in the cloud and locally on a Local Gateway is a design challenge with tradeoffs. Copyright © 2015 Neustar, Inc. All Rights Reserved 37
  • 38. 38 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Q & A

Editor's Notes

  1. Allow Network sensors to be centrally managed from the cloud with no additional management consoles.
  2. --- this shows the core real time use case for anomoly detection we need
  3. Data processing