SlideShare a Scribd company logo
1 of 37
Download to read offline
Stream Processing in Action
Webinar
Nirmal Fernando
Senior Lead Solutions Engineer, WSO2
Nov, 2018
Contents
• Introduction
• WSO2 Stream Processor Overview
• Industry Use Cases
• Demo
Diverse Industries -> Unique Challenges
http://www.solgenie.com/industries/
“The price of light is less than the cost of
darkness.”
- Arthur C. Nielsen,
Market Researcher & Founder of ACNielsen
Value of Insights Degrade Fast
http://www.history.com/news/ask-history/who-determined-the-speed-of-light
http://www.bluntmoms.com/mom-guilt-theres-one-solution/
Stream Processor Core
WSO2 ANALYTICS OFFERING
7
▪ Consumes events, and publish alerts and
summarizations to and from various
enterprise systems.
▪ Event Processor Core with Streaming
Complex Event Processing, Incremental
Time Series Aggregations, and Streaming
Machine Learning.
▪ Stream Processing Functionalities via
Extension Store
▪ High Available and Scalable Analytics
Fabric
▪ Prebuilt and custom analytics solutions
Events
JMS, Thrift, SMTP, HTTP, MQTT, Kafka
Analytics Fabric
Complex Event
Processing
Incremental Time
Series Aggregation
Machine
Learning
Extension Store
FinancialandBanking
Analytics
RetailAnalytics
LocationAnalytics
OperationalAnalytics
SmartEnergyAnalytics
Custom
Analytics
Solutions
...
Solutions
WSO2 STREAM PROCESSOR
WSO2 Stream Processor
An open source, cloud-native
analytics product optimized to create
real-time, actionable insights for agile
digital businesses.
9
WSO2 Stream Processor (WSO2 SP)
1. Data collection
2. Data cleansing
3. Data transformation
4. Data enrichment
5. Data summarization
6. Rule processing
7. Machine Learning & Artificial Intelligence
8. Data pipelining
9. Data Publishing
10. On demand processing
11. Data Presentation
Stream Processing Patterns
● Lightweight, lean, and high performance
● Best suited for
○ Streaming Data Integration,
○ Streaming Analytics
● Streaming SQL & graphical drag-and-drop editor
● Multiple deployment options
○ Process data at the edge (java, python)
○ Micro Stream Processing
○ High availability with 2 nodes
○ Highly scalable distributed deployments
● Support for streaming ML & Long running aggregations
● Monitoring tools and citizen integration options
WSO2 Stream Processor
• Source and Sinks
– HTTP, Kafka, TCP, Email, JMS, File, Rabbitmq,
MQTT, Web-Socket, Twitter, Amazon SQS
• Message Formats
– JSON, XML, Text, Binary, Key-value, CSV
• Data Stores
– RDBMS, Solr, MongoDB, HBase, Cassandra,
Elasticsearch, Hazelcast, Redis
Supported connectors
(Streaming) Machine Learning
▪ Running PMML Models for predictions
- Built via Apache Spark MLlib, Python, H2O.ai (for deep learning algos) or R
- Export as PMML
- Load precreated PMML Model into Siddhi to predict in Realtime
▪ Supporting native models for predictions
- Spark MLlib Models, Java based Tensorflow Models
▪ Online Learning and predictions
- Regression Analytics - Data Classification
- K-Means Clustering - Markov Models
- Anomaly Detections - … more on the way
Supported Extensions
• Geo graphical processing
• NLP
• Graph
• Reordering
• Timeseries
• Import Machine Learning models
– PMML, TensorFlow, etc
• Streaming Machine Learning
– Clustering, Classification, Regression
– Makove Models, Anomaly detection, etc.
• .. more
Image : http://www.weewatch.com/wp-content/uploads/2016/01/28fc45d.jpg
60+
https://store.wso2.com/store/assets/analyticsextension/list
High Availability with 2 Nodes
• 2 node minimum HA
– Process upto 100k
events/sec
– While most other stream processing
systems need around 5+ nodes
• Does not require Kafka
• Incremental state persistence
and recovery
• Multi data center support
Stream Processor
Stream Processor
Event Sources
Dashboard
Notification
Invocation
Data Source
Siddhi App
Siddhi App
Siddhi App
Siddhi App
Siddhi App
Siddhi App
Event
Store
• Exactly-once
processing
• Fault tolerance
• Highly scalable
• No back pressure
• Distributed via
annotations
• Native support for
Kubernetes
Scaling with Distributed Deployment
INDUSTRY USE CASES
Finance and Banking
Use Case 1 - Fraud Detection
• Detecting fraud via known patterns using generic
rules
• Detecting unknown types of fraud via machine
learning
• Detecting rare activity sequences using Markov
Modeling
• Reduce false alarms using fraud scoring
• Caught them in the act - what next?
Demo: https://goo.gl/xo6Wf5
Use Case 2 - Risk Management
• Finding real-time Value at Risk (VaR)
– Historical simulation
– Variance-covariance
– Monte Carlo simulation
• Identifying Front Running with Patterns
Use Case 3 - Stock Market Surveillance
Hey Jude, Mike is
going to buy large qty
of ABC at $21. You
better buy now!
Great! I
bought. ABC
is just $18.9
right now!
Trade 1
Followed by
Trade 2
Jude sells to Mike at
$21.
Broker: Bob
Client Client
MikeJude
Use Case 3 - Stock Market Surveillance
• Identifying Pump with Regression
• Identifying signs of Insider Dealing
• Model “Perfect Trader” in order to detect
fraudsters
Retail
Use Case 1 - Recommendations
• Recommendations based on the buying
products
• Recommendations based on the buying history
of the customer
• Seasonal recommendations
• Contextual, intelligent recommendations
Use Case 2 - Ad Optimization
• Display personalized advertisements on online
shopping stores
– by identifying person’s living location
– by identifying person’s buying history
– by identifying person’s interest
• Optimize displaying advertisements on shopping
stores
– what impact the Ad made? Is it worth the cost?
• Send personalized information when a
customer enters the store
• If a customer is spending a considerable
amount of time near a shop, suggest offers,
send an agent to him etc.
Use Case 3 - Proximity Marketing
• Detect that the customer leaves the store and send
him a gift coupon
• Number of customers & employees in each floor
(heat maps)
• Number of customers & average time spent per
product
• Shopping path, purchased items, average time spent
by each customer
• Overall statistics about given and used offers
Use Case 3 - Proximity Marketing
Location Analytics
Use Case - Fleet Management
• Real-time monitoring - where is your fleet now?,
Visualize your fleet
• Geo-fencing based alerts - get alerted if a driver
exceeds a defined speed limit within an interactively
given geo area
• Predicting travel times - historical data can be used
to build a machine learning model in order to
predict travel times in advance and alert the
subscribers
Operations Analytics
Use Cases
• Collect data from different stages of business
operations
• Tracking the progress of your business operations
• Detect Service Level Agreement violations at each
stage and generate alerts
• Visualize your business operations real-time
• Generate alerts and notify the subscribers
DEMO
Setup
Download and Tryout
WSO2 Stream Processor
https://wso2.com/analytics-and-stream-processing/
Documentation
Further Reading
● WSO2 Stream Processor: Making Real-time Stream Processing Available to the Masses
● Making Real-Time Applications Simpler with WSO2 Stream Processor
● How to use a Stream Processor as a Notification Manager?
● Synchronous Request-Response based Real-time Processing with WSO2 SP
● Distributed Stream Processing with WSO2 SP
THANK YOU
wso2.com

More Related Content

What's hot

Api centric enterprises
Api centric enterprisesApi centric enterprises
Api centric enterprises
WSO2
 
Best Practices for Productizing APIs with API Management and Automated Testing
Best Practices for Productizing APIs with API Management and Automated TestingBest Practices for Productizing APIs with API Management and Automated Testing
Best Practices for Productizing APIs with API Management and Automated Testing
WSO2
 

What's hot (20)

apidays LIVE New York 2021 - Managing the usage of Asynchronous APIs: What do...
apidays LIVE New York 2021 - Managing the usage of Asynchronous APIs: What do...apidays LIVE New York 2021 - Managing the usage of Asynchronous APIs: What do...
apidays LIVE New York 2021 - Managing the usage of Asynchronous APIs: What do...
 
Hybrid integration platform reference architecture
Hybrid integration platform reference architectureHybrid integration platform reference architecture
Hybrid integration platform reference architecture
 
Role of API Management in an API led Digital Economy
Role of API Management in an API led Digital EconomyRole of API Management in an API led Digital Economy
Role of API Management in an API led Digital Economy
 
[WSO2 Summit Americas 2020] Creating Smart Endpoints Using Integration Micros...
[WSO2 Summit Americas 2020] Creating Smart Endpoints Using Integration Micros...[WSO2 Summit Americas 2020] Creating Smart Endpoints Using Integration Micros...
[WSO2 Summit Americas 2020] Creating Smart Endpoints Using Integration Micros...
 
Architecting SaaS
Architecting SaaSArchitecting SaaS
Architecting SaaS
 
Api centric enterprises
Api centric enterprisesApi centric enterprises
Api centric enterprises
 
Best Practices for Productizing APIs with API Management and Automated Testing
Best Practices for Productizing APIs with API Management and Automated TestingBest Practices for Productizing APIs with API Management and Automated Testing
Best Practices for Productizing APIs with API Management and Automated Testing
 
Achieving Microservices Maturity
Achieving Microservices MaturityAchieving Microservices Maturity
Achieving Microservices Maturity
 
Flavours of APIs
Flavours of APIs Flavours of APIs
Flavours of APIs
 
How to Enable Monetization of Your API Ecosystem
How to Enable Monetization of Your API EcosystemHow to Enable Monetization of Your API Ecosystem
How to Enable Monetization of Your API Ecosystem
 
Leveraging Async APIs to deliver cross domain agile collaboration
Leveraging Async APIs to deliver cross domain agile collaboration Leveraging Async APIs to deliver cross domain agile collaboration
Leveraging Async APIs to deliver cross domain agile collaboration
 
API Services: Harness the Power of Enterprise Infrastructure
API Services: Harness the Power of Enterprise InfrastructureAPI Services: Harness the Power of Enterprise Infrastructure
API Services: Harness the Power of Enterprise Infrastructure
 
[WSO2 API Manager Community Call: Streaming API Support in WSO2 API Manager 4.0
[WSO2 API Manager Community Call: Streaming API Support in WSO2 API Manager 4.0[WSO2 API Manager Community Call: Streaming API Support in WSO2 API Manager 4.0
[WSO2 API Manager Community Call: Streaming API Support in WSO2 API Manager 4.0
 
AWS Api Gateway by Łukasz Marchewka Scalacc
AWS Api Gateway by Łukasz Marchewka ScalaccAWS Api Gateway by Łukasz Marchewka Scalacc
AWS Api Gateway by Łukasz Marchewka Scalacc
 
Apigee Insights: Data & Context-Driven Actions
Apigee Insights: Data & Context-Driven ActionsApigee Insights: Data & Context-Driven Actions
Apigee Insights: Data & Context-Driven Actions
 
5 Pillars of Building Enterprise0grade APIs
5 Pillars of Building Enterprise0grade APIs5 Pillars of Building Enterprise0grade APIs
5 Pillars of Building Enterprise0grade APIs
 
Lessons Learned from Building Enterprise APIs (Gustaf Nyman)
Lessons Learned from Building Enterprise APIs (Gustaf Nyman)Lessons Learned from Building Enterprise APIs (Gustaf Nyman)
Lessons Learned from Building Enterprise APIs (Gustaf Nyman)
 
Which APIs? which business models - A real-world guide for African banks.
Which APIs? which business models - A real-world guide for African banks.Which APIs? which business models - A real-world guide for African banks.
Which APIs? which business models - A real-world guide for African banks.
 
[WSO2 API Day Toronto 2019] Cloud-native Integration for the Enterprise
[WSO2 API Day Toronto 2019] Cloud-native Integration for the Enterprise[WSO2 API Day Toronto 2019] Cloud-native Integration for the Enterprise
[WSO2 API Day Toronto 2019] Cloud-native Integration for the Enterprise
 
API workshop by AWS and 3scale
API workshop by AWS and 3scaleAPI workshop by AWS and 3scale
API workshop by AWS and 3scale
 

Similar to Stream Processing in Action

Data Science Out of The Box : Case Studies in the Telecommunication by Anand ...
Data Science Out of The Box : Case Studies in the Telecommunication by Anand ...Data Science Out of The Box : Case Studies in the Telecommunication by Anand ...
Data Science Out of The Box : Case Studies in the Telecommunication by Anand ...
Data Con LA
 
IEEE.BigData.Tutorial.2.slides
IEEE.BigData.Tutorial.2.slidesIEEE.BigData.Tutorial.2.slides
IEEE.BigData.Tutorial.2.slides
Nish Parikh
 

Similar to Stream Processing in Action (20)

WSO2Con USA 2017: Analytics Patterns for Your Digital Enterprise
WSO2Con USA 2017: Analytics Patterns for Your Digital EnterpriseWSO2Con USA 2017: Analytics Patterns for Your Digital Enterprise
WSO2Con USA 2017: Analytics Patterns for Your Digital Enterprise
 
Analytics Patterns for Your Digital Enterprise
Analytics Patterns for Your Digital EnterpriseAnalytics Patterns for Your Digital Enterprise
Analytics Patterns for Your Digital Enterprise
 
Solutions Using WSO2 Analytics
Solutions Using WSO2 AnalyticsSolutions Using WSO2 Analytics
Solutions Using WSO2 Analytics
 
Advanced Topics - Session 3 - Optimizing AWS Applications
Advanced Topics - Session 3 - Optimizing AWS ApplicationsAdvanced Topics - Session 3 - Optimizing AWS Applications
Advanced Topics - Session 3 - Optimizing AWS Applications
 
WSO2Con USA 2017: Scalable Real-time Complex Event Processing at Uber
WSO2Con USA 2017: Scalable Real-time Complex Event Processing at UberWSO2Con USA 2017: Scalable Real-time Complex Event Processing at Uber
WSO2Con USA 2017: Scalable Real-time Complex Event Processing at Uber
 
Webinar: Analytics with NoSQL: Why, for What, and When?
Webinar: Analytics with NoSQL: Why, for What, and When?Webinar: Analytics with NoSQL: Why, for What, and When?
Webinar: Analytics with NoSQL: Why, for What, and When?
 
Clickstream analytics with Markov Chains
Clickstream analytics with Markov ChainsClickstream analytics with Markov Chains
Clickstream analytics with Markov Chains
 
Algorithmic Trading
Algorithmic TradingAlgorithmic Trading
Algorithmic Trading
 
Data Science Out of The Box : Case Studies in the Telecommunication by Anand ...
Data Science Out of The Box : Case Studies in the Telecommunication by Anand ...Data Science Out of The Box : Case Studies in the Telecommunication by Anand ...
Data Science Out of The Box : Case Studies in the Telecommunication by Anand ...
 
IEEE.BigData.Tutorial.2.slides
IEEE.BigData.Tutorial.2.slidesIEEE.BigData.Tutorial.2.slides
IEEE.BigData.Tutorial.2.slides
 
Large scale Click-streaming and tranaction log mining
Large scale Click-streaming and tranaction log miningLarge scale Click-streaming and tranaction log mining
Large scale Click-streaming and tranaction log mining
 
Fraud prevention is better with TigerGraph inside
Fraud prevention is better with  TigerGraph insideFraud prevention is better with  TigerGraph inside
Fraud prevention is better with TigerGraph inside
 
Operational-Analytics
Operational-AnalyticsOperational-Analytics
Operational-Analytics
 
Optimizing Your AWS Apps & Usage to Reduce Costs - IP Expo
Optimizing Your AWS Apps & Usage to Reduce Costs - IP ExpoOptimizing Your AWS Apps & Usage to Reduce Costs - IP Expo
Optimizing Your AWS Apps & Usage to Reduce Costs - IP Expo
 
Events and microservices
Events and microservicesEvents and microservices
Events and microservices
 
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon KinesisDay 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis
 
Bigdata based fraud detection
Bigdata based fraud detectionBigdata based fraud detection
Bigdata based fraud detection
 
Hadoop application architectures - Fraud detection tutorial
Hadoop application architectures - Fraud detection tutorialHadoop application architectures - Fraud detection tutorial
Hadoop application architectures - Fraud detection tutorial
 
SnappyData @ Seattle Spark Meetup
SnappyData @ Seattle Spark MeetupSnappyData @ Seattle Spark Meetup
SnappyData @ Seattle Spark Meetup
 
AWS re:Invent 2016: Beeswax: Building a Real-Time Streaming Data Platform on ...
AWS re:Invent 2016: Beeswax: Building a Real-Time Streaming Data Platform on ...AWS re:Invent 2016: Beeswax: Building a Real-Time Streaming Data Platform on ...
AWS re:Invent 2016: Beeswax: Building a Real-Time Streaming Data Platform on ...
 

More from WSO2

More from WSO2 (20)

Accelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with PlatformlessAccelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with Platformless
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
architecting-ai-in-the-enterprise-apis-and-applications.pdf
architecting-ai-in-the-enterprise-apis-and-applications.pdfarchitecting-ai-in-the-enterprise-apis-and-applications.pdf
architecting-ai-in-the-enterprise-apis-and-applications.pdf
 
Driving Innovation: Scania's API Revolution with WSO2
Driving Innovation: Scania's API Revolution with WSO2Driving Innovation: Scania's API Revolution with WSO2
Driving Innovation: Scania's API Revolution with WSO2
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
 
WSO2CON 2024 Slides - Unlocking Value with AI
WSO2CON 2024 Slides - Unlocking Value with AIWSO2CON 2024 Slides - Unlocking Value with AI
WSO2CON 2024 Slides - Unlocking Value with AI
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
WSO2CON 2024 - Elevating the Integration Game to the Cloud
WSO2CON 2024 - Elevating the Integration Game to the CloudWSO2CON 2024 - Elevating the Integration Game to the Cloud
WSO2CON 2024 - Elevating the Integration Game to the Cloud
 
WSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & Innovation
WSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & InnovationWSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & Innovation
WSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & Innovation
 
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
 
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaS
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
 
WSO2CON 2024 - Software Engineering for Digital Businesses
WSO2CON 2024 - Software Engineering for Digital BusinessesWSO2CON 2024 - Software Engineering for Digital Businesses
WSO2CON 2024 - Software Engineering for Digital Businesses
 
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
 
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of TransformationWSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
 

Recently uploaded

Recently uploaded (20)

Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
Strategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsStrategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering Teams
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Buy Epson EcoTank L3210 Colour Printer Online.pptx
Buy Epson EcoTank L3210 Colour Printer Online.pptxBuy Epson EcoTank L3210 Colour Printer Online.pptx
Buy Epson EcoTank L3210 Colour Printer Online.pptx
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
 
THE BEST IPTV in GERMANY for 2024: IPTVreel
THE BEST IPTV in  GERMANY for 2024: IPTVreelTHE BEST IPTV in  GERMANY for 2024: IPTVreel
THE BEST IPTV in GERMANY for 2024: IPTVreel
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджера
 

Stream Processing in Action

  • 1. Stream Processing in Action Webinar Nirmal Fernando Senior Lead Solutions Engineer, WSO2 Nov, 2018
  • 2. Contents • Introduction • WSO2 Stream Processor Overview • Industry Use Cases • Demo
  • 3. Diverse Industries -> Unique Challenges http://www.solgenie.com/industries/
  • 4. “The price of light is less than the cost of darkness.” - Arthur C. Nielsen, Market Researcher & Founder of ACNielsen
  • 5. Value of Insights Degrade Fast http://www.history.com/news/ask-history/who-determined-the-speed-of-light
  • 7. Stream Processor Core WSO2 ANALYTICS OFFERING 7 ▪ Consumes events, and publish alerts and summarizations to and from various enterprise systems. ▪ Event Processor Core with Streaming Complex Event Processing, Incremental Time Series Aggregations, and Streaming Machine Learning. ▪ Stream Processing Functionalities via Extension Store ▪ High Available and Scalable Analytics Fabric ▪ Prebuilt and custom analytics solutions Events JMS, Thrift, SMTP, HTTP, MQTT, Kafka Analytics Fabric Complex Event Processing Incremental Time Series Aggregation Machine Learning Extension Store FinancialandBanking Analytics RetailAnalytics LocationAnalytics OperationalAnalytics SmartEnergyAnalytics Custom Analytics Solutions ... Solutions
  • 9. WSO2 Stream Processor An open source, cloud-native analytics product optimized to create real-time, actionable insights for agile digital businesses. 9
  • 11. 1. Data collection 2. Data cleansing 3. Data transformation 4. Data enrichment 5. Data summarization 6. Rule processing 7. Machine Learning & Artificial Intelligence 8. Data pipelining 9. Data Publishing 10. On demand processing 11. Data Presentation Stream Processing Patterns
  • 12. ● Lightweight, lean, and high performance ● Best suited for ○ Streaming Data Integration, ○ Streaming Analytics ● Streaming SQL & graphical drag-and-drop editor ● Multiple deployment options ○ Process data at the edge (java, python) ○ Micro Stream Processing ○ High availability with 2 nodes ○ Highly scalable distributed deployments ● Support for streaming ML & Long running aggregations ● Monitoring tools and citizen integration options WSO2 Stream Processor
  • 13. • Source and Sinks – HTTP, Kafka, TCP, Email, JMS, File, Rabbitmq, MQTT, Web-Socket, Twitter, Amazon SQS • Message Formats – JSON, XML, Text, Binary, Key-value, CSV • Data Stores – RDBMS, Solr, MongoDB, HBase, Cassandra, Elasticsearch, Hazelcast, Redis Supported connectors
  • 14. (Streaming) Machine Learning ▪ Running PMML Models for predictions - Built via Apache Spark MLlib, Python, H2O.ai (for deep learning algos) or R - Export as PMML - Load precreated PMML Model into Siddhi to predict in Realtime ▪ Supporting native models for predictions - Spark MLlib Models, Java based Tensorflow Models ▪ Online Learning and predictions - Regression Analytics - Data Classification - K-Means Clustering - Markov Models - Anomaly Detections - … more on the way
  • 15. Supported Extensions • Geo graphical processing • NLP • Graph • Reordering • Timeseries • Import Machine Learning models – PMML, TensorFlow, etc • Streaming Machine Learning – Clustering, Classification, Regression – Makove Models, Anomaly detection, etc. • .. more Image : http://www.weewatch.com/wp-content/uploads/2016/01/28fc45d.jpg 60+ https://store.wso2.com/store/assets/analyticsextension/list
  • 16. High Availability with 2 Nodes • 2 node minimum HA – Process upto 100k events/sec – While most other stream processing systems need around 5+ nodes • Does not require Kafka • Incremental state persistence and recovery • Multi data center support Stream Processor Stream Processor Event Sources Dashboard Notification Invocation Data Source Siddhi App Siddhi App Siddhi App Siddhi App Siddhi App Siddhi App Event Store
  • 17. • Exactly-once processing • Fault tolerance • Highly scalable • No back pressure • Distributed via annotations • Native support for Kubernetes Scaling with Distributed Deployment
  • 20. Use Case 1 - Fraud Detection • Detecting fraud via known patterns using generic rules • Detecting unknown types of fraud via machine learning • Detecting rare activity sequences using Markov Modeling • Reduce false alarms using fraud scoring • Caught them in the act - what next? Demo: https://goo.gl/xo6Wf5
  • 21. Use Case 2 - Risk Management • Finding real-time Value at Risk (VaR) – Historical simulation – Variance-covariance – Monte Carlo simulation
  • 22. • Identifying Front Running with Patterns Use Case 3 - Stock Market Surveillance Hey Jude, Mike is going to buy large qty of ABC at $21. You better buy now! Great! I bought. ABC is just $18.9 right now! Trade 1 Followed by Trade 2 Jude sells to Mike at $21. Broker: Bob Client Client MikeJude
  • 23. Use Case 3 - Stock Market Surveillance • Identifying Pump with Regression • Identifying signs of Insider Dealing • Model “Perfect Trader” in order to detect fraudsters
  • 25. Use Case 1 - Recommendations • Recommendations based on the buying products • Recommendations based on the buying history of the customer • Seasonal recommendations • Contextual, intelligent recommendations
  • 26. Use Case 2 - Ad Optimization • Display personalized advertisements on online shopping stores – by identifying person’s living location – by identifying person’s buying history – by identifying person’s interest • Optimize displaying advertisements on shopping stores – what impact the Ad made? Is it worth the cost?
  • 27. • Send personalized information when a customer enters the store • If a customer is spending a considerable amount of time near a shop, suggest offers, send an agent to him etc. Use Case 3 - Proximity Marketing
  • 28. • Detect that the customer leaves the store and send him a gift coupon • Number of customers & employees in each floor (heat maps) • Number of customers & average time spent per product • Shopping path, purchased items, average time spent by each customer • Overall statistics about given and used offers Use Case 3 - Proximity Marketing
  • 30. Use Case - Fleet Management • Real-time monitoring - where is your fleet now?, Visualize your fleet • Geo-fencing based alerts - get alerted if a driver exceeds a defined speed limit within an interactively given geo area • Predicting travel times - historical data can be used to build a machine learning model in order to predict travel times in advance and alert the subscribers
  • 32. Use Cases • Collect data from different stages of business operations • Tracking the progress of your business operations • Detect Service Level Agreement violations at each stage and generate alerts • Visualize your business operations real-time • Generate alerts and notify the subscribers
  • 33. DEMO
  • 34. Setup
  • 35. Download and Tryout WSO2 Stream Processor https://wso2.com/analytics-and-stream-processing/ Documentation
  • 36. Further Reading ● WSO2 Stream Processor: Making Real-time Stream Processing Available to the Masses ● Making Real-Time Applications Simpler with WSO2 Stream Processor ● How to use a Stream Processor as a Notification Manager? ● Synchronous Request-Response based Real-time Processing with WSO2 SP ● Distributed Stream Processing with WSO2 SP