SlideShare a Scribd company logo
Bas Geerdink & Martijn Visser, 13 September 2017
Streaming Analytics at
IT Chapter Lead at ING Analytics
Academic background: A.I. & Informatics
Working in IT since 2004
At ING since 2013
@bgeerdink https://nl.linkedin.com/in/geerdink
Bas Geerdink
Product Owner / Data Analyst at ING
Academic background: ICT & Law
Working with data since 2008
At ING since 2013
@martijnvisser82 https://nl.linkedin.com/in/martijnvisser1982
Martijn Visser
Strategy
ING is a top financial enterprise, operating since 1881
Customers
35 Million
Private, Corporate and
Institutional Customers
Countries
40+
In Europe, Asia,
Australia, North and
South America
Employees
51,000
Market leaders Benelux
Growth markets
Commercial Banking
Challengers
5
ING’s Think Forward strategy requires a data-driven approach
6
1. Earn the primary relationship
2. Develop analytics skills to understand our customers better
3. Increase the pace of innovation to serve changing customer needs
4. Think beyond traditional banking to develop new services and business models
Empowering people to stay a step
ahead in life and in business.
Simplify &
Streamline
Operational
Excellence
Performance
Culture
Lending
Capabilities
Purpose
Customer
Promise
Strategic
Priorities
Enablers
Creating a differentiating customer experience
Clear and Easy Anytime, Anywhere Empower Keep Getting Better
We’re making a shift from batch to real-time
7
Secure and reliable
Relevant
Personal
Omnichannel
Predictive Actionable
We’re building a Streaming Analytics platform with
high-end capabilities
8
• High performance & low latency to manage and process
users’ events.
• Scalability & availability to build new features, scenarios
and integration with other external systems.
• Fault tolerance mechanism to ensure high quality output
with strong information consistency.
• Online machine learning and business rules management
capabilities.
• Fraud detection:
• Detect possible fraudulent transactions
• Identify possible money laundering accounts
• Actionable Insights:
• Small financial insights and robotized advise
• Customer-defined alerts and push notifications
• Offer chat/call when customers don’t finish their customer journey
• Marketing and commercial offerings (‘Next Best Action’)
Main use cases for a Streaming Analytics Platform
9
• Wisdom of the crowd:
• making use of similar customers to make better predictions
• Online learning:
• making use of customer feedback in real-time to train models
• Beyond banking:
• enabling IoT
• robo advise
Future scenarios
10
Use case: Insights
Do people still need Banks, or just Banking?
Whoever controls the customer experience controls the customer value.
Alibaba, the most
valuable retailer,
has no inventory
Uber, the world’s
largest taxi
company owns no
fleet
AirBnB, the largest
accomodation
provider owns no
real estate
Facebook, the most
popular media
owner, creates no
content
We want our environment to be the front and center of customer financial
needs.
Problem: apps & site going everywhere
Current
One app to rule them all… ??
14
Omnichannel
Design for One Experience
• Actionable Insights rather than data –
but select problems that are easy for
machines but diffiicult for people e.g.
forecasting
• Real time alerts can be the real
differentiators e.g. when a card auth is
received, when a customer makes a
payment that he has made a few times
previously
• Hierarchy of alerts – absolutely tell me
when things are going wrong but
occasionallly celebrate with me too
• UI adapts to confidence in AI. E.g. The
message diiffers when we have high
confidence in forecast
• UI also needs to adapt to customer
behaviours e.g. easy login when a
customer logs in to the same device
• The key role of AI is to help me stay a
step ahead in life – so UI needs to make
it easy to find items to act upon
• Digital experience is personalised to what’s
important to me
• Make the benefit of using your system clear
to me immediately rather than making me
discover the benefit
• I expect consistent experience Everywhere
e.g. once I dismiss an action online, I don’t
want to be prompted for it on mobile again
• I am used to certain way of working e.g. I am
used to settings being in one place, and I am
used to certain interaction patterns. So reduce
my learning curve by reusing those
AI + UI + I
Formula for creating compelling digital experience
AI + UI + I
New Experience
18
Actionable Insights
19
Decision Engine: Batch Data Processing & APIs
Insert in cache
Send resultsApplication
Executes SQL
selections to
determine Insights
Data Lake
20
MVP: Batch implementation
Data warehouse
Determine
Insights
XFB File Transfer
Cassandra
[ODBC]
Realtime
Validator
InsightsAPI
Inbound response
Inbound request
1 API request response for
applicable Insights and rules
2 API request response
System Of Records
3 API request response
4 API request response for
Insights Content
Actionable Insights
21
If only we had realtime capabilities…
22
Streaming Data Platform
The Streaming Data Platform has to be probabilistic in nature
24
Deterministic Probabilistic
• Business rules / decision trees
• Fixed logic (if-then-else)
• Rigid
• Machine learning
• Fuzzy logic
• Flexible
High-level Streaming data platform architecture
25
Streaming Data Platform Architecture
RawToRelevance Model Execution Engine Post-Processor
“filter and detect
pattern”
“determine
relevancy”
“produce
notification/alert”
Application Flow:
Business Flow:
“Data generator”
Formats outgoing messages
Get
Customer
Profile
Apply
Selection
Criteria
Apply
Model
Detect
Pattern
Get
Intermed
Event
Filter
Send
Output
Create
Intermed
Event
Get
Business
Event
Create
Business
Event
Format
Event
“Data gatherer”
Load stream of raw events and
filter/combine/aggregate them into a
meaningful business event
“Event scorer”
Scoring of PMML models to determine
the personal and relevant follow-up
actions
Get Raw
Events
Streaming Data Platform Architecture – Example 1
Application Flow:
Business Flow:
-150 -100 -100
Transactions
9:14
AMS
10:10
AMS
10:12
BER
Prediction:
Possible fraudulaus
transaction
Alert:
Block transaction
Option 1:
Do nothing
Option 2:
Stop money flow
Option 3:
Block card
0.12
0.91
0.73
RawToRelevance Model Execution Engine Post-Processor
“filter and detect
pattern”
“determine
relevancy”
“produce
notification/alert”
Sliding window,
triggered by transaction
10:22
Pattern:
Magic carpet
Streaming Data Platform Architecture – Example 1
28
Not actual ING code
Streaming Data Platform Architecture – Example 2
Application Flow:
Business Flow:
-101 -13 -98
Transactions
13:58 14:24 15:02
Prediction:
Customer balance will
be zero at 16:11
Pattern:
Customer is shopping
Alert:
Ask customer to
transfer money
Option 1:
Do nothing
Option 2:
Send Whatsapp
Option 3:
Transfer money
0.34
0.13
0.57
-71
15:22
RawToRelevance Model Execution Engine Post-Processor
“filter and detect
pattern”
“determine
relevancy”
“produce
notification/alert”
Sliding window,
triggered by balance < 400
15:22
Streaming Data Platform Architecture – Example 3
Application Flow:
Business Flow:
P1 P2 P3
Page visits
18:01 18:03 18:06
Prediction:
Customer is unclear
what to do
Pattern:
Customer is
interested in a loan
Helpdesk calls
customer to give
support
Option 1:
Do nothing
Option 2:
Call customer
Option 3:
Send email
0.23
0.81
0.61
P4
RawToRelevance Model Execution Engine Post-Processor
“filter and detect
pattern”
“determine
relevancy”
“produce
notification/alert”
18:21
Sliding window,
triggered first page hit
Streaming Data Platform Architecture – Example 3
31
Not actual ING code
Streaming Data Platform Architecture
32
Application Flow :
Kafka Events:
Business Flow:
Business
Users
System configuration:
Machine
Learning
Environment
Configuration
GUI
(future scope) Data
Scientists Feedback loop
DSL
Offline training
RawToRelevance Model Execution Engine Post-Processor
“filter and detect
pattern”
“determine
relevancy”
“produce
notification/alert”
Get
Customer
Profile
Apply
Selection
Criteria
Apply
Model
Detect
Pattern
Get
Intermed
Event
Filter
Send
Output
Create
Intermed
Event
Get
Business
Event
Create
Business
Event
Format
Event
Get Raw
Events
Raw
Event
Business
Event
Alert / Notification
Event
Intermediate
Event
openscoring.io
Control
Stream
Online scoring
Error
Stream
Technology
Benefits for ING:
• true streaming: per-event processing, no micro-batching
• open source, active community
• high volumes, low latency
• connectors to Kafka
• enterprise ready
• Scala api
Features that we use:
• state management and fault tolerancy: savepointing, checkpointing  exactly-once
semanctics
• time windowing: event time, flexible (e.g. sum all transactions in the past minute)
• Complex Event Processing
ING chose Apache Flink for its Streaming Data Platform
34
Performance and scalability tests
35
PMML is the de facto standard for model representation
• XML-based interchange format for predictive models
• Support of hot-replacing an algorithm (configuration) at runtime by just providing an updated PMML
model and interpreting (scoring) it with openscoring.io
• PMML models can be created by a large number of tools, including: Spark, SAS, R, Knime, Python, H20.ai
• Large number of supported machine learning models
• Working together with the open source community on FlinkML
• Next steps: PFA, TensorFlow
PMML (Predictive Model Markup Language) bridges the gap
between data scientists and data engineers
36
Type Model support
Simple business rules • Rulesets, score cards
Complex business rules • Sequences, association rules
Predictive models • Naive Bayes
• k-Nearest Neighbors
• Regression: linear, logistic, regularized
• Trees: decision trees, random forest
• Support Vector Machines
Advanced models • Bayesian Network
• Neural Network (Deep Learning)
• Gaussian Process
Other • Time Series
• Cluster Models
Openscoring.io
PMML Example
37
Not actual ING code
• Cross functional squads
• Business, dev, ops, data scientist
• Reusing ING standard building blocks and processes
• Infrastructure, monitoring
• Continuous integration & delivery in DTAP environment
• Develop: Gitlab  Jenkins  Artifactory
• Using Flink’s savepointing mechanism
• No data loss, guaranteed
• Limited downtime
• Scalability, resilience and robustness
• Flink gives us automatic restart on failure, using checkpointing mechnism
• Rolling updates of infrastructure and middleware leads to no downtime
Flink fits the development & operations model with the ING way
of working
38
Design
Develop
TestDeploy
Monitor
Squad
In ING’s one way of working, ‘business’ and ‘IT’ go hand-in-hand
SquadSquad
Chapter
Guild
DS
DA
OpsOps
CJE
DS
DA
DS
CJE
DS
Ops
DS
CJE
Dev DevDevDevDev
Tribe
Conclusions
 ING is an “IT company with a banking licence”, whose competitive advantage is in its use
of data and analytics. To excel, we’ll handle bigger volumes of streaming data with higher
performance demands.
 ING build a streaming analytics platform as a generic, multi-purpose solution with fraud
detection and actionable insights as it first use cases.
 Enterprise-readyness and security of Flink is continuously improving; although major
improvement points are on the topics of data retention and lineage (GDPR requirements).
 With PMML we have a standard format that is very well suited for multidisciplinary
teams, where data scientists work on the same user stories as engineers.
 We set up a streaming analytics community within ING, and work together with industry
leaders (Lightbend, DataArtisans).
Why is Streaming Analytics with Flink the right choice for ING,
moving forward?
42
43
Questions
We’re hiring!
Follow us to stay a step ahead
ING.com
YouTube.com/ING
SlideShare.net/ING@ING_News LinkedIn.com/company/ING
Flickr.com/INGGroupFacebook.com/ING

More Related Content

Similar to Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - buidling a streaming data platform with Flink and Kafka

Moving To MicroServices
Moving To MicroServicesMoving To MicroServices
Moving To MicroServices
David Walker
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
Big Data User Group Karlsruhe/Stuttgart
 
IBM i & Data Science in the AI era.
IBM i & Data Science in the AI era.  IBM i & Data Science in the AI era.
IBM i & Data Science in the AI era.
Benoit Marolleau
 
EVAM_Streaming Analytics_v1.5
EVAM_Streaming Analytics_v1.5EVAM_Streaming Analytics_v1.5
EVAM_Streaming Analytics_v1.5John Nikolaidis
 
Big Data LDN 2018: DATA SCIENCE AT ING
Big Data LDN 2018: DATA SCIENCE AT INGBig Data LDN 2018: DATA SCIENCE AT ING
Big Data LDN 2018: DATA SCIENCE AT ING
Matt Stubbs
 
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Capgemini
 
DFS21_Main Stage_Steve Butcher_Microsoft_211130
DFS21_Main Stage_Steve Butcher_Microsoft_211130DFS21_Main Stage_Steve Butcher_Microsoft_211130
DFS21_Main Stage_Steve Butcher_Microsoft_211130
FinTech Belgium
 
Digital Platfrom 4 Summary
Digital Platfrom 4 SummaryDigital Platfrom 4 Summary
Digital Platfrom 4 SummaryIan Thomas
 
I Love APIs Europe 2015: Technical Sessions
I Love APIs Europe 2015: Technical SessionsI Love APIs Europe 2015: Technical Sessions
I Love APIs Europe 2015: Technical Sessions
Apigee | Google Cloud
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4Janani Eshwaran
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4Janani Eshwaran
 
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
VoltDB
 
Kochi mulesoft meetup 02
Kochi mulesoft meetup 02Kochi mulesoft meetup 02
Kochi mulesoft meetup 02
sumitahuja94
 
Is your data paying you dividends?
Is your data paying you dividends? Is your data paying you dividends?
Is your data paying you dividends?
Karan Sachdeva
 
Kafka Summit NYC 2017 - The Real-time Event Driven Bank: A Kafka Story
Kafka Summit NYC 2017 - The Real-time Event Driven Bank: A Kafka Story Kafka Summit NYC 2017 - The Real-time Event Driven Bank: A Kafka Story
Kafka Summit NYC 2017 - The Real-time Event Driven Bank: A Kafka Story
confluent
 
APIdays Open Banking & Fintech: Workshop - Financial Services Use Cases for APIs
APIdays Open Banking & Fintech: Workshop - Financial Services Use Cases for APIsAPIdays Open Banking & Fintech: Workshop - Financial Services Use Cases for APIs
APIdays Open Banking & Fintech: Workshop - Financial Services Use Cases for APIs
Jeremy Brown
 
Three Dimensions of Data as a Service
Three Dimensions of Data as a ServiceThree Dimensions of Data as a Service
Three Dimensions of Data as a Service
Denodo
 
BSFI Technology Offerings by Value Innovation Labs
BSFI Technology Offerings by Value Innovation LabsBSFI Technology Offerings by Value Innovation Labs
BSFI Technology Offerings by Value Innovation Labs
Mount Talent Consulting
 
Gartner CIO & IT Executive Summit -- Event Mesh: The Architecture Layer That ...
Gartner CIO & IT Executive Summit -- Event Mesh: The Architecture Layer That ...Gartner CIO & IT Executive Summit -- Event Mesh: The Architecture Layer That ...
Gartner CIO & IT Executive Summit -- Event Mesh: The Architecture Layer That ...
Solace
 
FSI - Digital Transformation.pdf
FSI - Digital Transformation.pdfFSI - Digital Transformation.pdf
FSI - Digital Transformation.pdf
YasmineBoudhina
 

Similar to Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - buidling a streaming data platform with Flink and Kafka (20)

Moving To MicroServices
Moving To MicroServicesMoving To MicroServices
Moving To MicroServices
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
IBM i & Data Science in the AI era.
IBM i & Data Science in the AI era.  IBM i & Data Science in the AI era.
IBM i & Data Science in the AI era.
 
EVAM_Streaming Analytics_v1.5
EVAM_Streaming Analytics_v1.5EVAM_Streaming Analytics_v1.5
EVAM_Streaming Analytics_v1.5
 
Big Data LDN 2018: DATA SCIENCE AT ING
Big Data LDN 2018: DATA SCIENCE AT INGBig Data LDN 2018: DATA SCIENCE AT ING
Big Data LDN 2018: DATA SCIENCE AT ING
 
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
 
DFS21_Main Stage_Steve Butcher_Microsoft_211130
DFS21_Main Stage_Steve Butcher_Microsoft_211130DFS21_Main Stage_Steve Butcher_Microsoft_211130
DFS21_Main Stage_Steve Butcher_Microsoft_211130
 
Digital Platfrom 4 Summary
Digital Platfrom 4 SummaryDigital Platfrom 4 Summary
Digital Platfrom 4 Summary
 
I Love APIs Europe 2015: Technical Sessions
I Love APIs Europe 2015: Technical SessionsI Love APIs Europe 2015: Technical Sessions
I Love APIs Europe 2015: Technical Sessions
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4
 
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
 
Kochi mulesoft meetup 02
Kochi mulesoft meetup 02Kochi mulesoft meetup 02
Kochi mulesoft meetup 02
 
Is your data paying you dividends?
Is your data paying you dividends? Is your data paying you dividends?
Is your data paying you dividends?
 
Kafka Summit NYC 2017 - The Real-time Event Driven Bank: A Kafka Story
Kafka Summit NYC 2017 - The Real-time Event Driven Bank: A Kafka Story Kafka Summit NYC 2017 - The Real-time Event Driven Bank: A Kafka Story
Kafka Summit NYC 2017 - The Real-time Event Driven Bank: A Kafka Story
 
APIdays Open Banking & Fintech: Workshop - Financial Services Use Cases for APIs
APIdays Open Banking & Fintech: Workshop - Financial Services Use Cases for APIsAPIdays Open Banking & Fintech: Workshop - Financial Services Use Cases for APIs
APIdays Open Banking & Fintech: Workshop - Financial Services Use Cases for APIs
 
Three Dimensions of Data as a Service
Three Dimensions of Data as a ServiceThree Dimensions of Data as a Service
Three Dimensions of Data as a Service
 
BSFI Technology Offerings by Value Innovation Labs
BSFI Technology Offerings by Value Innovation LabsBSFI Technology Offerings by Value Innovation Labs
BSFI Technology Offerings by Value Innovation Labs
 
Gartner CIO & IT Executive Summit -- Event Mesh: The Architecture Layer That ...
Gartner CIO & IT Executive Summit -- Event Mesh: The Architecture Layer That ...Gartner CIO & IT Executive Summit -- Event Mesh: The Architecture Layer That ...
Gartner CIO & IT Executive Summit -- Event Mesh: The Architecture Layer That ...
 
FSI - Digital Transformation.pdf
FSI - Digital Transformation.pdfFSI - Digital Transformation.pdf
FSI - Digital Transformation.pdf
 

More from Flink Forward

Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...
Flink Forward
 
Evening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkEvening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in Flink
Flink Forward
 
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
Flink Forward
 
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Flink Forward
 
Introducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes OperatorIntroducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes Operator
Flink Forward
 
Autoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive ModeAutoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive Mode
Flink Forward
 
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Flink Forward
 
One sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async SinkOne sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async Sink
Flink Forward
 
Tuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptxTuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptx
Flink Forward
 
Flink powered stream processing platform at Pinterest
Flink powered stream processing platform at PinterestFlink powered stream processing platform at Pinterest
Flink powered stream processing platform at Pinterest
Flink Forward
 
Apache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native EraApache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native Era
Flink Forward
 
Where is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in FlinkWhere is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in Flink
Flink Forward
 
Using the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production DeploymentUsing the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production Deployment
Flink Forward
 
The Current State of Table API in 2022
The Current State of Table API in 2022The Current State of Table API in 2022
The Current State of Table API in 2022
Flink Forward
 
Flink SQL on Pulsar made easy
Flink SQL on Pulsar made easyFlink SQL on Pulsar made easy
Flink SQL on Pulsar made easy
Flink Forward
 
Dynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data AlertsDynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data Alerts
Flink Forward
 
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and PinotExactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
Flink Forward
 
Processing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial ServicesProcessing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial Services
Flink Forward
 
Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...
Flink Forward
 
Batch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & IcebergBatch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & Iceberg
Flink Forward
 

More from Flink Forward (20)

Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...Building a fully managed stream processing platform on Flink at scale for Lin...
Building a fully managed stream processing platform on Flink at scale for Lin...
 
Evening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkEvening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in Flink
 
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
 
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
 
Introducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes OperatorIntroducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes Operator
 
Autoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive ModeAutoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive Mode
 
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
 
One sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async SinkOne sink to rule them all: Introducing the new Async Sink
One sink to rule them all: Introducing the new Async Sink
 
Tuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptxTuning Apache Kafka Connectors for Flink.pptx
Tuning Apache Kafka Connectors for Flink.pptx
 
Flink powered stream processing platform at Pinterest
Flink powered stream processing platform at PinterestFlink powered stream processing platform at Pinterest
Flink powered stream processing platform at Pinterest
 
Apache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native EraApache Flink in the Cloud-Native Era
Apache Flink in the Cloud-Native Era
 
Where is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in FlinkWhere is my bottleneck? Performance troubleshooting in Flink
Where is my bottleneck? Performance troubleshooting in Flink
 
Using the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production DeploymentUsing the New Apache Flink Kubernetes Operator in a Production Deployment
Using the New Apache Flink Kubernetes Operator in a Production Deployment
 
The Current State of Table API in 2022
The Current State of Table API in 2022The Current State of Table API in 2022
The Current State of Table API in 2022
 
Flink SQL on Pulsar made easy
Flink SQL on Pulsar made easyFlink SQL on Pulsar made easy
Flink SQL on Pulsar made easy
 
Dynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data AlertsDynamic Rule-based Real-time Market Data Alerts
Dynamic Rule-based Real-time Market Data Alerts
 
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and PinotExactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
 
Processing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial ServicesProcessing Semantically-Ordered Streams in Financial Services
Processing Semantically-Ordered Streams in Financial Services
 
Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...Tame the small files problem and optimize data layout for streaming ingestion...
Tame the small files problem and optimize data layout for streaming ingestion...
 
Batch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & IcebergBatch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & Iceberg
 

Recently uploaded

tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
theahmadsaood
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
correoyaya
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 

Recently uploaded (20)

tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 

Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - buidling a streaming data platform with Flink and Kafka

  • 1. Bas Geerdink & Martijn Visser, 13 September 2017 Streaming Analytics at
  • 2. IT Chapter Lead at ING Analytics Academic background: A.I. & Informatics Working in IT since 2004 At ING since 2013 @bgeerdink https://nl.linkedin.com/in/geerdink Bas Geerdink
  • 3. Product Owner / Data Analyst at ING Academic background: ICT & Law Working with data since 2008 At ING since 2013 @martijnvisser82 https://nl.linkedin.com/in/martijnvisser1982 Martijn Visser
  • 5. ING is a top financial enterprise, operating since 1881 Customers 35 Million Private, Corporate and Institutional Customers Countries 40+ In Europe, Asia, Australia, North and South America Employees 51,000 Market leaders Benelux Growth markets Commercial Banking Challengers 5
  • 6. ING’s Think Forward strategy requires a data-driven approach 6 1. Earn the primary relationship 2. Develop analytics skills to understand our customers better 3. Increase the pace of innovation to serve changing customer needs 4. Think beyond traditional banking to develop new services and business models Empowering people to stay a step ahead in life and in business. Simplify & Streamline Operational Excellence Performance Culture Lending Capabilities Purpose Customer Promise Strategic Priorities Enablers Creating a differentiating customer experience Clear and Easy Anytime, Anywhere Empower Keep Getting Better
  • 7. We’re making a shift from batch to real-time 7 Secure and reliable Relevant Personal Omnichannel Predictive Actionable
  • 8. We’re building a Streaming Analytics platform with high-end capabilities 8 • High performance & low latency to manage and process users’ events. • Scalability & availability to build new features, scenarios and integration with other external systems. • Fault tolerance mechanism to ensure high quality output with strong information consistency. • Online machine learning and business rules management capabilities.
  • 9. • Fraud detection: • Detect possible fraudulent transactions • Identify possible money laundering accounts • Actionable Insights: • Small financial insights and robotized advise • Customer-defined alerts and push notifications • Offer chat/call when customers don’t finish their customer journey • Marketing and commercial offerings (‘Next Best Action’) Main use cases for a Streaming Analytics Platform 9
  • 10. • Wisdom of the crowd: • making use of similar customers to make better predictions • Online learning: • making use of customer feedback in real-time to train models • Beyond banking: • enabling IoT • robo advise Future scenarios 10
  • 12. Do people still need Banks, or just Banking? Whoever controls the customer experience controls the customer value. Alibaba, the most valuable retailer, has no inventory Uber, the world’s largest taxi company owns no fleet AirBnB, the largest accomodation provider owns no real estate Facebook, the most popular media owner, creates no content We want our environment to be the front and center of customer financial needs.
  • 13. Problem: apps & site going everywhere Current
  • 14. One app to rule them all… ?? 14
  • 16. • Actionable Insights rather than data – but select problems that are easy for machines but diffiicult for people e.g. forecasting • Real time alerts can be the real differentiators e.g. when a card auth is received, when a customer makes a payment that he has made a few times previously • Hierarchy of alerts – absolutely tell me when things are going wrong but occasionallly celebrate with me too • UI adapts to confidence in AI. E.g. The message diiffers when we have high confidence in forecast • UI also needs to adapt to customer behaviours e.g. easy login when a customer logs in to the same device • The key role of AI is to help me stay a step ahead in life – so UI needs to make it easy to find items to act upon • Digital experience is personalised to what’s important to me • Make the benefit of using your system clear to me immediately rather than making me discover the benefit • I expect consistent experience Everywhere e.g. once I dismiss an action online, I don’t want to be prompted for it on mobile again • I am used to certain way of working e.g. I am used to settings being in one place, and I am used to certain interaction patterns. So reduce my learning curve by reusing those AI + UI + I Formula for creating compelling digital experience
  • 17. AI + UI + I New Experience
  • 18. 18
  • 20. Decision Engine: Batch Data Processing & APIs Insert in cache Send resultsApplication Executes SQL selections to determine Insights Data Lake 20 MVP: Batch implementation Data warehouse Determine Insights XFB File Transfer Cassandra [ODBC] Realtime Validator InsightsAPI Inbound response Inbound request 1 API request response for applicable Insights and rules 2 API request response System Of Records 3 API request response 4 API request response for Insights Content Actionable Insights
  • 21. 21
  • 22. If only we had realtime capabilities… 22
  • 24. The Streaming Data Platform has to be probabilistic in nature 24 Deterministic Probabilistic • Business rules / decision trees • Fixed logic (if-then-else) • Rigid • Machine learning • Fuzzy logic • Flexible
  • 25. High-level Streaming data platform architecture 25
  • 26. Streaming Data Platform Architecture RawToRelevance Model Execution Engine Post-Processor “filter and detect pattern” “determine relevancy” “produce notification/alert” Application Flow: Business Flow: “Data generator” Formats outgoing messages Get Customer Profile Apply Selection Criteria Apply Model Detect Pattern Get Intermed Event Filter Send Output Create Intermed Event Get Business Event Create Business Event Format Event “Data gatherer” Load stream of raw events and filter/combine/aggregate them into a meaningful business event “Event scorer” Scoring of PMML models to determine the personal and relevant follow-up actions Get Raw Events
  • 27. Streaming Data Platform Architecture – Example 1 Application Flow: Business Flow: -150 -100 -100 Transactions 9:14 AMS 10:10 AMS 10:12 BER Prediction: Possible fraudulaus transaction Alert: Block transaction Option 1: Do nothing Option 2: Stop money flow Option 3: Block card 0.12 0.91 0.73 RawToRelevance Model Execution Engine Post-Processor “filter and detect pattern” “determine relevancy” “produce notification/alert” Sliding window, triggered by transaction 10:22 Pattern: Magic carpet
  • 28. Streaming Data Platform Architecture – Example 1 28 Not actual ING code
  • 29. Streaming Data Platform Architecture – Example 2 Application Flow: Business Flow: -101 -13 -98 Transactions 13:58 14:24 15:02 Prediction: Customer balance will be zero at 16:11 Pattern: Customer is shopping Alert: Ask customer to transfer money Option 1: Do nothing Option 2: Send Whatsapp Option 3: Transfer money 0.34 0.13 0.57 -71 15:22 RawToRelevance Model Execution Engine Post-Processor “filter and detect pattern” “determine relevancy” “produce notification/alert” Sliding window, triggered by balance < 400 15:22
  • 30. Streaming Data Platform Architecture – Example 3 Application Flow: Business Flow: P1 P2 P3 Page visits 18:01 18:03 18:06 Prediction: Customer is unclear what to do Pattern: Customer is interested in a loan Helpdesk calls customer to give support Option 1: Do nothing Option 2: Call customer Option 3: Send email 0.23 0.81 0.61 P4 RawToRelevance Model Execution Engine Post-Processor “filter and detect pattern” “determine relevancy” “produce notification/alert” 18:21 Sliding window, triggered first page hit
  • 31. Streaming Data Platform Architecture – Example 3 31 Not actual ING code
  • 32. Streaming Data Platform Architecture 32 Application Flow : Kafka Events: Business Flow: Business Users System configuration: Machine Learning Environment Configuration GUI (future scope) Data Scientists Feedback loop DSL Offline training RawToRelevance Model Execution Engine Post-Processor “filter and detect pattern” “determine relevancy” “produce notification/alert” Get Customer Profile Apply Selection Criteria Apply Model Detect Pattern Get Intermed Event Filter Send Output Create Intermed Event Get Business Event Create Business Event Format Event Get Raw Events Raw Event Business Event Alert / Notification Event Intermediate Event openscoring.io Control Stream Online scoring Error Stream
  • 34. Benefits for ING: • true streaming: per-event processing, no micro-batching • open source, active community • high volumes, low latency • connectors to Kafka • enterprise ready • Scala api Features that we use: • state management and fault tolerancy: savepointing, checkpointing  exactly-once semanctics • time windowing: event time, flexible (e.g. sum all transactions in the past minute) • Complex Event Processing ING chose Apache Flink for its Streaming Data Platform 34
  • 36. PMML is the de facto standard for model representation • XML-based interchange format for predictive models • Support of hot-replacing an algorithm (configuration) at runtime by just providing an updated PMML model and interpreting (scoring) it with openscoring.io • PMML models can be created by a large number of tools, including: Spark, SAS, R, Knime, Python, H20.ai • Large number of supported machine learning models • Working together with the open source community on FlinkML • Next steps: PFA, TensorFlow PMML (Predictive Model Markup Language) bridges the gap between data scientists and data engineers 36 Type Model support Simple business rules • Rulesets, score cards Complex business rules • Sequences, association rules Predictive models • Naive Bayes • k-Nearest Neighbors • Regression: linear, logistic, regularized • Trees: decision trees, random forest • Support Vector Machines Advanced models • Bayesian Network • Neural Network (Deep Learning) • Gaussian Process Other • Time Series • Cluster Models Openscoring.io
  • 38. • Cross functional squads • Business, dev, ops, data scientist • Reusing ING standard building blocks and processes • Infrastructure, monitoring • Continuous integration & delivery in DTAP environment • Develop: Gitlab  Jenkins  Artifactory • Using Flink’s savepointing mechanism • No data loss, guaranteed • Limited downtime • Scalability, resilience and robustness • Flink gives us automatic restart on failure, using checkpointing mechnism • Rolling updates of infrastructure and middleware leads to no downtime Flink fits the development & operations model with the ING way of working 38 Design Develop TestDeploy Monitor
  • 39. Squad In ING’s one way of working, ‘business’ and ‘IT’ go hand-in-hand SquadSquad Chapter Guild DS DA OpsOps CJE DS DA DS CJE DS Ops DS CJE Dev DevDevDevDev Tribe
  • 41.  ING is an “IT company with a banking licence”, whose competitive advantage is in its use of data and analytics. To excel, we’ll handle bigger volumes of streaming data with higher performance demands.  ING build a streaming analytics platform as a generic, multi-purpose solution with fraud detection and actionable insights as it first use cases.  Enterprise-readyness and security of Flink is continuously improving; although major improvement points are on the topics of data retention and lineage (GDPR requirements).  With PMML we have a standard format that is very well suited for multidisciplinary teams, where data scientists work on the same user stories as engineers.  We set up a streaming analytics community within ING, and work together with industry leaders (Lightbend, DataArtisans). Why is Streaming Analytics with Flink the right choice for ING, moving forward? 42
  • 43. We’re hiring! Follow us to stay a step ahead ING.com YouTube.com/ING SlideShare.net/ING@ING_News LinkedIn.com/company/ING Flickr.com/INGGroupFacebook.com/ING