SlideShare a Scribd company logo
Depositing value from transactional data
Advanced Analytics Platform for Fraud
Nadeem Gulzar,
Senior Development Manager of Advanced Analytics and Architecture, IMD, CBIT
11
• Senior Development Manager
• Head of Advanced Analytics and Architecture
• IT at Danske Bank
− Approx. 2000 people, more than 2000 systems
• Education:
− M.Sc. from Copenhagen University in Computer Science and
Mathematics, with a bachelor in psychology
• Employed in Danske Bank Group since: 2003
• Experience Summary:
− 15+ years of IT experience, primarily within software
development and management
Nadeem Gulzar
22
Agenda
Big Data team
Big Data journey in Danske Bank
Introduction to fraud
Advanced Analytics Platform for Fraud
Next steps
Q&A
33
Agenda
Big Data team
Big Data journey in Danske Bank
Introduction to fraud
Advanced Analytics Platform for Fraud
Next steps
Q&A
44
Big Data team
Platform
Engineers
Data
Engineers
Data
Scientists
Business
The Data Scientist Venn Diagram (v.2)
The
Perfect
Data
Scientist
The Good
Consultant
Drew
Conway’s
Data
Scientist
R Core
Team
The Data
Nerd
The Hacker
The
Accountant
The IT Guy
The Stats ProfHot Air
Comp Sci
Prof
The
Number
Cruncher
The
Salesperson
Head
of IT
Ana-lyst
Communication
Statistics Programming
Business
66
Agenda
Big Data team
Big Data journey in Danske Bank
Introduction to fraud
Advanced Analytics Platform for Fraud
Next steps
Q&A
77
Core team
 Architects
 Data scientists
How did we start this journey?
 Data engineers
 Platform engineers
Coverage
 Hackathon
 Proof-of-concept
 Roadshows
 Feedback loop
 Partner up
 Business value
 Reference architecture
 Use cases
 Pilot project
 Infrastructure + Data science
laboratory
 Operating model and
organization structure
 Legal and compliance use
cases
 Security
 Data governance
 Automated data science
1–2 month 3–4 month 1 year
Plan
Build
Learn
Adoption lifecycle: Hadoop
 Business
88
Big data Eco system
Big Data
Eco system
Technology
Exploring
process
DataCompliance
Competencies
Benchmark
Governance
99
Agenda
Big Data team
Big Data journey in Danske Bank
Introduction to fraud
Advanced Analytics Platform for Fraud
Next steps
Q&A
1010
• Initial system built many many years
ago
• Was built on a rule based engine with
minimal statistical capabilities
• Very good detection rate, but
generates a lot of false positives
• Very difficult to get an overview of all
rules
• Difficult to do an impact analysis of
changes
• Minimal ”control” in the hands of the
Global Graud Investigation Unit
Introduction to Fraud handling in Danske Bank
1111
”Old” fraud engine: expert-driven approach
Global
Payment
face
eBanking
MobilePa
y
Business
Online
Global
Payment
Interface
Central Fraud Engine
Fraudulent ?
(next slide)
Yes/maybe
Create verification
Manual evaluation
Fraudulent ?
Yes !
Return Payment
to customer
Fraud payment Update
frauddata
NoProcess payment
to beneficiary
No
1212
”Old” fraud engine: expert-driven approach
• How it works:
• Understanding the domain
• Manually generating IF-THEN-ELSE rules based on the domain expertise
• Manually maintaining the rule engine: validation of existing rules, addition of new rules
and/or exceptions
Pros:
• Easy to understand
• Easy to explain
• Easy to develop
Cons:
• Limited abilities to detect complex
relations between the attributes
• Subjective inference of the rules
• Rather long rule detection and
inference time
• Requires manual maintenance
1313
Agenda
Big Data team
Big Data journey in Danske Bank
Introduction to fraud
Advanced Analytics Platform for Fraud
Next steps
Q&A
1414
Advanced Analytics Platform for Fraud - goals
• Develop a scalable and expandable
platform which follows the Danske Bank
blueprint of digitalization
• 100 % data-driven approach to find patterns
in the data and complement the existing
fraud engine
• Use Hadoop to handle the large data
volumes for training models on transaction
data
• Implement a real-time solution that can
score live transactions in under 5.000 ms
• Reduce amount of false-positives by at least
20-40 %
1515
Advanced Platform for Fraud: data-driven approach
Global
Payment
face
eBanking
MobilePa
y
Business
Online
Global
Payment
Interface
Central Fraud Engine
Fraudulent ?
(next slide)
Yes/maybe
Create verification
Manual evaluation
Fraudulent ?
Yes !
Return Payment
to customer
Fraud payment Update
frauddata
NoProcess payment
to beneficiary
No
Basic
customer
data
Advanced Analytics
Platform for Fraud
Historic
transaction
data
Weblog
data
Customer
product
data
Aggregated
customer
data
1616
Advanced Platform for Fraud: data-driven approach
• How it works:
• Understanding the domain
• Gathering and preparing the data
• Automatically generating the rules and recognizing fraudulent patterns by training
models on historical data
• Automatically maintaining the engine by retraining the model
Pros:
• Automatic/data-driven/objective
inference of the rules
• Ability to detect patterns in a high
dimensional data input
• Fast detection of new/changing
fraudulent patterns
Cons:
• Might be unintuitive and hardly
interpretable
• Data preparation and feature
aggregation is time consuming
1717
Expert vs. data-driven approach
Data understanding Pattern definition Rule application Case handling
Data understanding Pattern definition Pattern recognition Case handling
Expert – driven approach
Data - driven approach
1818*Depends a lot on what fraud-appetite GFU has.
Advanced Analytics Platform for Fraud - outcome
Goals
• Scalable and expandable platform
following the Danske Bank blueprint
• 100 % data-driven approach
• Large data-set management on
Hadoop
• Implement a real-time solution that
can score live transactions in under
5.000 ms
• Reduce amount of false-positives
by at least 20-40 %
Accomplished
• Danske Bank blueprint is the spine
of our architecture
• Transaction- and customer data
• Hadoop used for ingesting- and
enrichment of data
• AAPF can deliver a transaction
score in under 300 ms (avg)
• Depending on GFU risk-appetite,
initial results indicate >90 %
decrease in false-positives*
1919
Big Data team – lessons learned
Success: From PowerPoint to production in 12 weeks (8 sprints)
Team effort: Thorough collaboration across IMD, GFU and Think Big
Synergy: Successfully spearheaded innovation in all involved systems
Inspiration: Incorporation Danske Bank advanced analytics blueprint
sets a generic scene for combatting new challenges in advanced
analytics
Agile influence: Without following the agile approach we would not
have been able to deliver in this timeframe (daily standups, Atlassian,
sprints,…)
2020
Agenda
Big Data team
Big Data journey in Danske Bank
Introduction to fraud
Advanced Analytics Platform for Fraud
Next steps
Q&A
2121
Next steps
• Reduce scoring time
• Data quality
• Data enrichment
• Spark for ‘heavy lifting’
Improve
current setup
• All transactions
• All countries
• Credit Cards
• MobilePay
New use cases
• Weblog data
• Behavioural analysis
• Deep Learning
• Predictive Maintenance
Adv. Analytics
2017
2222
Questions

More Related Content

What's hot

Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Precisely
 
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
DataWorks Summit/Hadoop Summit
 
Data-In-Motion Unleashed
Data-In-Motion UnleashedData-In-Motion Unleashed
Data-In-Motion Unleashed
DataWorks Summit
 
Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...
Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...
Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...
DataWorks Summit/Hadoop Summit
 
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
DataWorks Summit
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Hortonworks
 
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
 
Filling the Data Lake
Filling the Data LakeFilling the Data Lake
Filling the Data Lake
DataWorks Summit/Hadoop Summit
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
Hortonworks
 
Rob Bearden Keynote Hadoop Summit San Jose
Rob Bearden Keynote Hadoop Summit San JoseRob Bearden Keynote Hadoop Summit San Jose
Rob Bearden Keynote Hadoop Summit San Jose
DataWorks Summit/Hadoop Summit
 
Hadoop Journey at Walgreens
Hadoop Journey at WalgreensHadoop Journey at Walgreens
Hadoop Journey at Walgreens
DataWorks Summit
 
Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...
Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...
Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...
DataWorks Summit/Hadoop Summit
 
Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloud
DataWorks Summit
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
DataWorks Summit
 
Big Data at your Desk with KNIME
Big Data at your Desk with KNIMEBig Data at your Desk with KNIME
Big Data at your Desk with KNIME
DataWorks Summit/Hadoop Summit
 
Beyond Big Data: Data Science and AI
Beyond Big Data: Data Science and AIBeyond Big Data: Data Science and AI
Beyond Big Data: Data Science and AI
DataWorks Summit
 
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
DataWorks Summit
 
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
Data in Motion - Data at Rest - Hortonworks a Modern ArchitectureData in Motion - Data at Rest - Hortonworks a Modern Architecture
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
Mats Johansson
 
Enterprise large scale graph analytics and computing base on distribute graph...
Enterprise large scale graph analytics and computing base on distribute graph...Enterprise large scale graph analytics and computing base on distribute graph...
Enterprise large scale graph analytics and computing base on distribute graph...
DataWorks Summit
 
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & TrifactaExtend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
DataWorks Summit/Hadoop Summit
 

What's hot (20)

Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
 
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
 
Data-In-Motion Unleashed
Data-In-Motion UnleashedData-In-Motion Unleashed
Data-In-Motion Unleashed
 
Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...
Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...
Worldpay - Delivering Multi-Tenancy Applications in A Secure Operational Plat...
 
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
 
Filling the Data Lake
Filling the Data LakeFilling the Data Lake
Filling the Data Lake
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
 
Rob Bearden Keynote Hadoop Summit San Jose
Rob Bearden Keynote Hadoop Summit San JoseRob Bearden Keynote Hadoop Summit San Jose
Rob Bearden Keynote Hadoop Summit San Jose
 
Hadoop Journey at Walgreens
Hadoop Journey at WalgreensHadoop Journey at Walgreens
Hadoop Journey at Walgreens
 
Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...
Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...
Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...
 
Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloud
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Big Data at your Desk with KNIME
Big Data at your Desk with KNIMEBig Data at your Desk with KNIME
Big Data at your Desk with KNIME
 
Beyond Big Data: Data Science and AI
Beyond Big Data: Data Science and AIBeyond Big Data: Data Science and AI
Beyond Big Data: Data Science and AI
 
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
 
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
Data in Motion - Data at Rest - Hortonworks a Modern ArchitectureData in Motion - Data at Rest - Hortonworks a Modern Architecture
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
 
Enterprise large scale graph analytics and computing base on distribute graph...
Enterprise large scale graph analytics and computing base on distribute graph...Enterprise large scale graph analytics and computing base on distribute graph...
Enterprise large scale graph analytics and computing base on distribute graph...
 
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & TrifactaExtend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
 

Similar to Depositing Value from Transactional Data at Danske Bank

EVAM_Streaming Analytics_v1.5
EVAM_Streaming Analytics_v1.5EVAM_Streaming Analytics_v1.5
EVAM_Streaming Analytics_v1.5John Nikolaidis
 
Data Mining Services in various types
Data Mining Services in various typesData Mining Services in various types
Data Mining Services in various types
loginworks software
 
Fighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceFighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligence
Ron Bodkin
 
Next Generation Fraud Solutions using Neo4j
Next Generation Fraud Solutions using Neo4jNext Generation Fraud Solutions using Neo4j
Next Generation Fraud Solutions using Neo4j
Neo4j
 
Strategyzing big data in telco industry
Strategyzing big data in telco industryStrategyzing big data in telco industry
Strategyzing big data in telco industry
Parviz Iskhakov
 
Dataiku tatvic webinar presentation
Dataiku tatvic webinar presentationDataiku tatvic webinar presentation
Dataiku tatvic webinar presentation
Tatvic Analytics
 
Skylads - Big Data for Telcos
Skylads - Big Data for TelcosSkylads - Big Data for Telcos
Skylads - Big Data for Telcos
Xavier Litt
 
Ensighten for waw houston 20150326
Ensighten for waw houston 20150326Ensighten for waw houston 20150326
Ensighten for waw houston 20150326
Kashif Khurshid
 
Artificial Intelligence high ROI case studies from around the world: approach...
Artificial Intelligence high ROI case studies from around the world: approach...Artificial Intelligence high ROI case studies from around the world: approach...
Artificial Intelligence high ROI case studies from around the world: approach...
Data Driven Innovation
 
Deliver solutions cv_vebtech
Deliver solutions cv_vebtechDeliver solutions cv_vebtech
Deliver solutions cv_vebtech
SvetlanaUsikava
 
Essential Prerequisites for Maximizing Success from Big Data
Essential Prerequisites for Maximizing Success from Big DataEssential Prerequisites for Maximizing Success from Big Data
Essential Prerequisites for Maximizing Success from Big Data
Society of Petroleum Engineers
 
GraphDay Stockholm 2015 - Graphs in Telecommunications
GraphDay Stockholm 2015 - Graphs in TelecommunicationsGraphDay Stockholm 2015 - Graphs in Telecommunications
GraphDay Stockholm 2015 - Graphs in Telecommunications
Neo4j
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
StampedeCon
 
Solid Day - IoT en el contexto empresarial
Solid Day - IoT en el contexto empresarialSolid Day - IoT en el contexto empresarial
Solid Day - IoT en el contexto empresarial
Software Guru
 
ITMAGINATION - competences, facts, technologies, clients
ITMAGINATION - competences, facts, technologies, clientsITMAGINATION - competences, facts, technologies, clients
ITMAGINATION - competences, facts, technologies, clients
ITMAGINATION
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Looker
 
Think Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial IntelligenceThink Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial Intelligence
Data Science Milan
 
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014
 
Webinar: Fighting Fraud with Graph Databases
Webinar: Fighting Fraud with Graph DatabasesWebinar: Fighting Fraud with Graph Databases
Webinar: Fighting Fraud with Graph Databases
DataStax
 

Similar to Depositing Value from Transactional Data at Danske Bank (20)

EVAM_Streaming Analytics_v1.5
EVAM_Streaming Analytics_v1.5EVAM_Streaming Analytics_v1.5
EVAM_Streaming Analytics_v1.5
 
Data Mining Services in various types
Data Mining Services in various typesData Mining Services in various types
Data Mining Services in various types
 
Fighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceFighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligence
 
Next Generation Fraud Solutions using Neo4j
Next Generation Fraud Solutions using Neo4jNext Generation Fraud Solutions using Neo4j
Next Generation Fraud Solutions using Neo4j
 
Strategyzing big data in telco industry
Strategyzing big data in telco industryStrategyzing big data in telco industry
Strategyzing big data in telco industry
 
Dataiku tatvic webinar presentation
Dataiku tatvic webinar presentationDataiku tatvic webinar presentation
Dataiku tatvic webinar presentation
 
Skylads - Big Data for Telcos
Skylads - Big Data for TelcosSkylads - Big Data for Telcos
Skylads - Big Data for Telcos
 
Ensighten for waw houston 20150326
Ensighten for waw houston 20150326Ensighten for waw houston 20150326
Ensighten for waw houston 20150326
 
Artificial Intelligence high ROI case studies from around the world: approach...
Artificial Intelligence high ROI case studies from around the world: approach...Artificial Intelligence high ROI case studies from around the world: approach...
Artificial Intelligence high ROI case studies from around the world: approach...
 
Deliver solutions cv_vebtech
Deliver solutions cv_vebtechDeliver solutions cv_vebtech
Deliver solutions cv_vebtech
 
Essential Prerequisites for Maximizing Success from Big Data
Essential Prerequisites for Maximizing Success from Big DataEssential Prerequisites for Maximizing Success from Big Data
Essential Prerequisites for Maximizing Success from Big Data
 
GraphDay Stockholm 2015 - Graphs in Telecommunications
GraphDay Stockholm 2015 - Graphs in TelecommunicationsGraphDay Stockholm 2015 - Graphs in Telecommunications
GraphDay Stockholm 2015 - Graphs in Telecommunications
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
 
Solid Day - IoT en el contexto empresarial
Solid Day - IoT en el contexto empresarialSolid Day - IoT en el contexto empresarial
Solid Day - IoT en el contexto empresarial
 
ITMAGINATION - competences, facts, technologies, clients
ITMAGINATION - competences, facts, technologies, clientsITMAGINATION - competences, facts, technologies, clients
ITMAGINATION - competences, facts, technologies, clients
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
 
Data mining
Data miningData mining
Data mining
 
Think Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial IntelligenceThink Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial Intelligence
 
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
 
Webinar: Fighting Fraud with Graph Databases
Webinar: Fighting Fraud with Graph DatabasesWebinar: Fighting Fraud with Graph Databases
Webinar: Fighting Fraud with Graph Databases
 

More from DataWorks Summit/Hadoop Summit

Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in ProductionRunning Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
 
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache ZeppelinState of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
 
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
 
Hadoop Crash Course
Hadoop Crash CourseHadoop Crash Course
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
DataWorks Summit/Hadoop Summit
 
Apache Spark Crash Course
Apache Spark Crash CourseApache Spark Crash Course
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
 
Schema Registry - Set you Data Free
Schema Registry - Set you Data FreeSchema Registry - Set you Data Free
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
HBase in Practice
HBase in Practice HBase in Practice
HBase in Practice
DataWorks Summit/Hadoop Summit
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS HadoopBreaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
 
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesScaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
DataWorks Summit/Hadoop Summit
 
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
DataWorks Summit/Hadoop Summit
 

More from DataWorks Summit/Hadoop Summit (20)

Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in ProductionRunning Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
 
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache ZeppelinState of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
 
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
 
Hadoop Crash Course
Hadoop Crash CourseHadoop Crash Course
Hadoop Crash Course
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Apache Spark Crash Course
Apache Spark Crash CourseApache Spark Crash Course
Apache Spark Crash Course
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
 
Schema Registry - Set you Data Free
Schema Registry - Set you Data FreeSchema Registry - Set you Data Free
Schema Registry - Set you Data Free
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
 
HBase in Practice
HBase in Practice HBase in Practice
HBase in Practice
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS HadoopBreaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
 
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesScaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
 
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
 

Recently uploaded

DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
Jen Stirrup
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 

Recently uploaded (20)

DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 

Depositing Value from Transactional Data at Danske Bank

  • 1. Depositing value from transactional data Advanced Analytics Platform for Fraud Nadeem Gulzar, Senior Development Manager of Advanced Analytics and Architecture, IMD, CBIT
  • 2. 11 • Senior Development Manager • Head of Advanced Analytics and Architecture • IT at Danske Bank − Approx. 2000 people, more than 2000 systems • Education: − M.Sc. from Copenhagen University in Computer Science and Mathematics, with a bachelor in psychology • Employed in Danske Bank Group since: 2003 • Experience Summary: − 15+ years of IT experience, primarily within software development and management Nadeem Gulzar
  • 3. 22 Agenda Big Data team Big Data journey in Danske Bank Introduction to fraud Advanced Analytics Platform for Fraud Next steps Q&A
  • 4. 33 Agenda Big Data team Big Data journey in Danske Bank Introduction to fraud Advanced Analytics Platform for Fraud Next steps Q&A
  • 6. The Data Scientist Venn Diagram (v.2) The Perfect Data Scientist The Good Consultant Drew Conway’s Data Scientist R Core Team The Data Nerd The Hacker The Accountant The IT Guy The Stats ProfHot Air Comp Sci Prof The Number Cruncher The Salesperson Head of IT Ana-lyst Communication Statistics Programming Business
  • 7. 66 Agenda Big Data team Big Data journey in Danske Bank Introduction to fraud Advanced Analytics Platform for Fraud Next steps Q&A
  • 8. 77 Core team  Architects  Data scientists How did we start this journey?  Data engineers  Platform engineers Coverage  Hackathon  Proof-of-concept  Roadshows  Feedback loop  Partner up  Business value  Reference architecture  Use cases  Pilot project  Infrastructure + Data science laboratory  Operating model and organization structure  Legal and compliance use cases  Security  Data governance  Automated data science 1–2 month 3–4 month 1 year Plan Build Learn Adoption lifecycle: Hadoop  Business
  • 9. 88 Big data Eco system Big Data Eco system Technology Exploring process DataCompliance Competencies Benchmark Governance
  • 10. 99 Agenda Big Data team Big Data journey in Danske Bank Introduction to fraud Advanced Analytics Platform for Fraud Next steps Q&A
  • 11. 1010 • Initial system built many many years ago • Was built on a rule based engine with minimal statistical capabilities • Very good detection rate, but generates a lot of false positives • Very difficult to get an overview of all rules • Difficult to do an impact analysis of changes • Minimal ”control” in the hands of the Global Graud Investigation Unit Introduction to Fraud handling in Danske Bank
  • 12. 1111 ”Old” fraud engine: expert-driven approach Global Payment face eBanking MobilePa y Business Online Global Payment Interface Central Fraud Engine Fraudulent ? (next slide) Yes/maybe Create verification Manual evaluation Fraudulent ? Yes ! Return Payment to customer Fraud payment Update frauddata NoProcess payment to beneficiary No
  • 13. 1212 ”Old” fraud engine: expert-driven approach • How it works: • Understanding the domain • Manually generating IF-THEN-ELSE rules based on the domain expertise • Manually maintaining the rule engine: validation of existing rules, addition of new rules and/or exceptions Pros: • Easy to understand • Easy to explain • Easy to develop Cons: • Limited abilities to detect complex relations between the attributes • Subjective inference of the rules • Rather long rule detection and inference time • Requires manual maintenance
  • 14. 1313 Agenda Big Data team Big Data journey in Danske Bank Introduction to fraud Advanced Analytics Platform for Fraud Next steps Q&A
  • 15. 1414 Advanced Analytics Platform for Fraud - goals • Develop a scalable and expandable platform which follows the Danske Bank blueprint of digitalization • 100 % data-driven approach to find patterns in the data and complement the existing fraud engine • Use Hadoop to handle the large data volumes for training models on transaction data • Implement a real-time solution that can score live transactions in under 5.000 ms • Reduce amount of false-positives by at least 20-40 %
  • 16. 1515 Advanced Platform for Fraud: data-driven approach Global Payment face eBanking MobilePa y Business Online Global Payment Interface Central Fraud Engine Fraudulent ? (next slide) Yes/maybe Create verification Manual evaluation Fraudulent ? Yes ! Return Payment to customer Fraud payment Update frauddata NoProcess payment to beneficiary No Basic customer data Advanced Analytics Platform for Fraud Historic transaction data Weblog data Customer product data Aggregated customer data
  • 17. 1616 Advanced Platform for Fraud: data-driven approach • How it works: • Understanding the domain • Gathering and preparing the data • Automatically generating the rules and recognizing fraudulent patterns by training models on historical data • Automatically maintaining the engine by retraining the model Pros: • Automatic/data-driven/objective inference of the rules • Ability to detect patterns in a high dimensional data input • Fast detection of new/changing fraudulent patterns Cons: • Might be unintuitive and hardly interpretable • Data preparation and feature aggregation is time consuming
  • 18. 1717 Expert vs. data-driven approach Data understanding Pattern definition Rule application Case handling Data understanding Pattern definition Pattern recognition Case handling Expert – driven approach Data - driven approach
  • 19. 1818*Depends a lot on what fraud-appetite GFU has. Advanced Analytics Platform for Fraud - outcome Goals • Scalable and expandable platform following the Danske Bank blueprint • 100 % data-driven approach • Large data-set management on Hadoop • Implement a real-time solution that can score live transactions in under 5.000 ms • Reduce amount of false-positives by at least 20-40 % Accomplished • Danske Bank blueprint is the spine of our architecture • Transaction- and customer data • Hadoop used for ingesting- and enrichment of data • AAPF can deliver a transaction score in under 300 ms (avg) • Depending on GFU risk-appetite, initial results indicate >90 % decrease in false-positives*
  • 20. 1919 Big Data team – lessons learned Success: From PowerPoint to production in 12 weeks (8 sprints) Team effort: Thorough collaboration across IMD, GFU and Think Big Synergy: Successfully spearheaded innovation in all involved systems Inspiration: Incorporation Danske Bank advanced analytics blueprint sets a generic scene for combatting new challenges in advanced analytics Agile influence: Without following the agile approach we would not have been able to deliver in this timeframe (daily standups, Atlassian, sprints,…)
  • 21. 2020 Agenda Big Data team Big Data journey in Danske Bank Introduction to fraud Advanced Analytics Platform for Fraud Next steps Q&A
  • 22. 2121 Next steps • Reduce scoring time • Data quality • Data enrichment • Spark for ‘heavy lifting’ Improve current setup • All transactions • All countries • Credit Cards • MobilePay New use cases • Weblog data • Behavioural analysis • Deep Learning • Predictive Maintenance Adv. Analytics 2017

Editor's Notes

  1. Explain different roles and functions
  2. Overview previously used – shows how AAPF contributes to the central fraud engine (‘augments’ it)
  3. We are only modifying the recognition part of the flow, augmenting it with a data-driven approach
  4. Atlassian: JIRA Confluence Stash
  5. Future steps