SlideShare a Scribd company logo
1 of 26
Download to read offline
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Fraud Detection in Financial Services using
Graph Analysis and Machine Learning
Swiss Conference on Data Science 2019
Hans Viehmann
Product Manager EMEA
ORACLE Corporation
June 14, 2019
@SpatialHannes
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Safe Harbor Statement
The following is intended to outline our general product direction. It is intended for
information purposes only, and may not be incorporated into any contract. It is not a
commitment to deliver any material, code, or functionality, and should not be relied upon
in making purchasing decisions. The development, release, and timing of any features or
functionality described for Oracle’s products remains at the sole discretion of Oracle.
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | 4
https://twitter.jeffprod.com
Following, no follow back
Follower, no follow back
Follow each other
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Graph – an important growth area for data & analytics
Confidential – Oracle Internal/Restricted/Highly Restricted 5
Gartner identifies Top 10 Data and Analytics
Technology Trends for 2019 / No. 5
• Graph processing and Graph DBMS grow 100% annually through 2022.
• Adoption of graphs is limited by lack of skills in the market.
• Graphs are more practical to answer complex questions on
complex data – SQL is not always efficient.
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Graph Analysis for Business Insight
9
Identify
Influencers
Discover Graph Patterns
in Big Data
Generate
Recommendations
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
• Customer profitability analysis
– Part of larger Hadoop/Big Data project
• Analysis of banking transactions
– Focus on corporate customers
• Identification of undesired behavioural
patterns, eg.
– Customers using other banks to make large
numbers of transactions
– Many of which flow back to Banco Galicia
• Increase fees, terminate contracts, or
move activities to Banco Galicia
10
Banco de Galicia
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
• Providing online payment solutions
– Real-time payments, e-Wallets
– 1bn revenue/yr
– 500000 payments/day
• Strong demand for fraud detection
– Only feasible with graph data
– In real-time, upon money movement
– During account creation
– In investigation, visualizing payment flows
• Analysis of payment flows
– Identifying suspicious patterns
11
Paysafe
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | 12
Followed by interactive analysis with visualization and PGQL
Using graph algorithms for initial assessment
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | 13
Moving towards graph analysis with machine learning
Rule Engine:
Takes decision
to process or
fail payment
Graph Query
Example: Is there fraudster in 3
payments distance?
Graph Query
Example: Do we have
linked by password
customer in 3 payments
distance?
Example: Pass fraud probability as
fact to the rule engine
Graph
Database
Machine
Learning
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
• Example: Finding anomalies in
healthcare billing data
– Medical providers and their operations
• Providers of the same specialty are
close to each other in the graph
– Closely connected by common services
• a provider vertex exceptionally close
to vertices of a different specialty
should be an anomaly
• Using closeness as a metric
– Eg. Hop-distance, ...
Anomaly Detection (without Machine Learning)
X
Doctors
900,000 HCPCS
6,000Edges
9,000,000
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Using Personalized Pagerank to find outliers and anomalies
Variant of Page-rank algorithm that requires a set
of starting vertices*
– Random walks (with restart) from the starting vertices
– Computes a new probability of visiting each vertex in
the graph biased by the vertices on the starting set
– Personalized Page-rank score ➔ a natural relative
distance (or closeness) with respect to the vertices
from the starting set
– Algorithm generates regular pagerank values when
starting set contains all vertices in the graph
15
Starting set
of vertices
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Anomaly Detection Procedure
1. Compute Pagerank (PR)
2. Select a Specialty S (e.g. Optometrist)
3. Find the set of doctors of the specialty (starting set)
4. Personalized Pagerank from the starting set (PPR)
5. Mark procedures that has PPR – PR > ɸ as specialty action
6. For every doctor X of other specialties W who has high PPR
value.
– Mark doctor X as anomalous
– If doctor X indeed performed any of specialty action
– And if W does NOT have more than δ% of such doctors
X
Doctors
900,000 HCPCS
6,000
Same specialty
(starting set)
Anomalous
(other specialty)
Edges
9,000,000
Specialty
Actions
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Combining Graph Analytics and Machine Learning
Graph Analytics
• Compute graph metric(s)
• Explore graph or compute
new metrics using ML result
Machine Learning
• Build predictive model
using graph metric
• Build model(s) and
score or classify data
17
Add to
structured data
Add to graph
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Encoding similarity for use in machine learning
• Graph captures fine-grained relationship between data entities
– As before, closeness can be defined and measured on the graph
– Providing numeric representation of your data that retains the distance information
18
Raw
Data
ML
Model
Graph
Representation
Numeric Representation
(N-dimensional vector)
x, y: data entity (represented as vertex in graph)
v(x), v(y): n-dimensionsal vector representation of x and y
x, y close in graph ➔ ∥ v(x) - v(y) ∥ small in n-dimensional vector space
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Encoding similarity for use in machine learning
• Different approaches available
– eg. exploiting techniques from modern NLP
(natural language processing)
• Used Word2Vec in our example
– a ML technique that learns closeness between
words from large number of sentences
• Perform many random walks on the
graph
• Apply W2V technique on random walk
traces, treating vertices as words
19
KDD‘14
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Deepwalk – Translate graphs to a vector space
20
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Practical example – Student classification
• Can you predict a student’s major or
department just by looking at the
classmates in the course that (s)he is
taking?
• Very similar to customer segmentation
problem
– Student => Customer
– Course taking => Item or service purchase
– Department => Segment label
21
CS
ME
10.003
10.004
10.005
11.103
11.213
12.118
students courses
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Evaluation – Comparison
1. CNN trained on “standard” features
(e.g., student age, courses taken, …)
2. Use PPR and predict the department of
the highest-scoring vertex
3. Train a CNN on vertex embeddings
extracted with DeepWalk
4. Add “standard” features beside graph
embeddings
22
CS
ME
10.003
10.004
10.005
11.103
11.213
12.118
students courses
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
• (Result #1) Graph-based prediction gives better
result than naïve application of ML (e.g. CNN) on
basic student features (e.g. age, gender,
background, …)
• (Result #2) Deep-Walk preserves information
from graph representation
• (Result #3) Deep-Walk allows to combined graph
data with other features
Results
CNN on Original Features
PPR (Graph Algorithm)
CNN on Extracted Graph Features
(from deep-walk)
CNN on Original + Graph Features
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Summary
Graph capabilities in Oracle Products and Cloud Services
• Graph databases are powerful tools, complementing relational databases
– Especially strong for analysis of graph topology and connectedness
• Graph analytics offer new insight
– Especially relationships, dependencies and behavioural patterns
• Oracle Property Graph technology offers
– Comprehensive analytics through various APIs, integration with relational database
– Scaleable, parallel in-memory processing
– Secure and scaleable graph storage using Hadoop platform or Oracle Database
• Available both on-premise or in the Cloud already today
24
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Appendix: Oracle Graph Analytics Architecture
Scalable and Persistent StorageScalable and Persistent Storage
Graph Storage ManagementGraph Storage Management
Graph Analytics
In-memory Analytic Engine
Graph Analytics
In-memory Analytic Engine
Blueprints & SolrCloud / Lucene
RESTWebService
Python,Perl,PHP,Ruby,
Javascript,…
Java APIs
Java APIs/JDBC/SQL/PLSQL
25
Visualization
RIntegration(OAAgraph)
Sparkintegration
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Appendix: Graph Cloud Service
• “One-click” deployment: no installation, zero configuration
– Automated failure detection and recovery
• Automated graph modeler
– Easily convert your relational data into property graphs
• Pre-built Algorithms, Flows and SQL-like graph query language
– Java, Groovy
– Rest APIs
• Rich User Interface
– Low code / zero code features
– Notebook support and powerful data visualization features
Fully managed graph cloud service
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
Appendix: More information
• Oracle Property Graph Technologies OTN product page:
https://www.oracle.com/database/technologies/spatialandgraph/property-graph-features.html
– White papers, software downloads, documentation and videos
• Oracle Labs Tutorials
https://docs.oracle.com/cd/E56133_01/latest/tutorials/index.html
• Blog post series on setting up Graph Analysis on Oracle Cloud
https://blogs.oracle.com/oraclespatial/how-to-enable-oracle-database-cloud-service-with-property-graph-
capabilities
– Free cloud credits available on http://cloud.oracle.com
• Blog – examples, tips & tricks: blogs.oracle.com/bigdataspatialgraph
• @OracleBigData, @SpatialHannes, @JeanIhm Oracle Spatial and Graph Group
27
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | 28
Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |

More Related Content

Similar to Fraud Detection in Financial Services using Graph Analysis and Machine Learning

Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...Fwdays
 
An Introduction to Graph: Database, Analytics, and Cloud Services
An Introduction to Graph:  Database, Analytics, and Cloud ServicesAn Introduction to Graph:  Database, Analytics, and Cloud Services
An Introduction to Graph: Database, Analytics, and Cloud ServicesJean Ihm
 
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"
AGIT 2015  - Hans Viehmann: "Big Data and Smart Cities"AGIT 2015  - Hans Viehmann: "Big Data and Smart Cities"
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"jstrobl
 
What's New In Primavera Analytics 16.1
What's New In Primavera Analytics 16.1What's New In Primavera Analytics 16.1
What's New In Primavera Analytics 16.1p6academy
 
Data meets AI - ATP Roadshow India
Data meets AI - ATP Roadshow IndiaData meets AI - ATP Roadshow India
Data meets AI - ATP Roadshow IndiaSandesh Rao
 
Data Science at Scale on MPP databases - Use Cases & Open Source Tools
Data Science at Scale on MPP databases - Use Cases & Open Source ToolsData Science at Scale on MPP databases - Use Cases & Open Source Tools
Data Science at Scale on MPP databases - Use Cases & Open Source ToolsEsther Vasiete
 
asset_16467169720871528299376665087430.pdf
asset_16467169720871528299376665087430.pdfasset_16467169720871528299376665087430.pdf
asset_16467169720871528299376665087430.pdfAhmedSeid38
 
Domain Specific Languages for Parallel Graph AnalytiX (PGX)
Domain Specific Languages for Parallel Graph AnalytiX (PGX)Domain Specific Languages for Parallel Graph AnalytiX (PGX)
Domain Specific Languages for Parallel Graph AnalytiX (PGX)Eelco Visser
 
Spark-Zeppelin-ML on HWX
Spark-Zeppelin-ML on HWXSpark-Zeppelin-ML on HWX
Spark-Zeppelin-ML on HWXKirk Haslbeck
 
Data meets AI - AICUG - Santa Clara
Data meets AI  - AICUG - Santa ClaraData meets AI  - AICUG - Santa Clara
Data meets AI - AICUG - Santa ClaraSandesh Rao
 
Product Engineer Certified Lean Six Sigma Black Belt by IASSC
Product Engineer Certified Lean Six Sigma Black Belt by IASSCProduct Engineer Certified Lean Six Sigma Black Belt by IASSC
Product Engineer Certified Lean Six Sigma Black Belt by IASSCHAKKACHE Mohamed
 
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock ExchangeIRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock ExchangeIRJET Journal
 
How To Visualize Graphs
How To Visualize GraphsHow To Visualize Graphs
How To Visualize GraphsJean Ihm
 
Big Data Case study - caixa bank
Big Data Case study - caixa bankBig Data Case study - caixa bank
Big Data Case study - caixa bankChungsik Yun
 
End-to-end Machine Learning Pipelines with HP Vertica and Distributed R
End-to-end Machine Learning Pipelines with HP Vertica and Distributed REnd-to-end Machine Learning Pipelines with HP Vertica and Distributed R
End-to-end Machine Learning Pipelines with HP Vertica and Distributed RJorge Martinez de Salinas
 
Analyze your application portfolio to know where the quality and risk issues ...
Analyze your application portfolio to know where the quality and risk issues ...Analyze your application portfolio to know where the quality and risk issues ...
Analyze your application portfolio to know where the quality and risk issues ...Sogeti Nederland B.V.
 
Tree Plotter - Tree Inventory Software - Updates & Training Webinar - Q4 2016
Tree Plotter - Tree Inventory Software - Updates & Training Webinar - Q4 2016Tree Plotter - Tree Inventory Software - Updates & Training Webinar - Q4 2016
Tree Plotter - Tree Inventory Software - Updates & Training Webinar - Q4 2016Ian Hanou
 
Predicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via HadoopPredicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via HadoopSkillspeed
 

Similar to Fraud Detection in Financial Services using Graph Analysis and Machine Learning (20)

Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
 
An Introduction to Graph: Database, Analytics, and Cloud Services
An Introduction to Graph:  Database, Analytics, and Cloud ServicesAn Introduction to Graph:  Database, Analytics, and Cloud Services
An Introduction to Graph: Database, Analytics, and Cloud Services
 
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"
AGIT 2015  - Hans Viehmann: "Big Data and Smart Cities"AGIT 2015  - Hans Viehmann: "Big Data and Smart Cities"
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"
 
Primavera Analytics 16.1 is Released - Everything You Need To Know
Primavera Analytics 16.1 is Released - Everything You Need To KnowPrimavera Analytics 16.1 is Released - Everything You Need To Know
Primavera Analytics 16.1 is Released - Everything You Need To Know
 
What's New In Primavera Analytics 16.1
What's New In Primavera Analytics 16.1What's New In Primavera Analytics 16.1
What's New In Primavera Analytics 16.1
 
Data meets AI - ATP Roadshow India
Data meets AI - ATP Roadshow IndiaData meets AI - ATP Roadshow India
Data meets AI - ATP Roadshow India
 
Data Science at Scale on MPP databases - Use Cases & Open Source Tools
Data Science at Scale on MPP databases - Use Cases & Open Source ToolsData Science at Scale on MPP databases - Use Cases & Open Source Tools
Data Science at Scale on MPP databases - Use Cases & Open Source Tools
 
asset_16467169720871528299376665087430.pdf
asset_16467169720871528299376665087430.pdfasset_16467169720871528299376665087430.pdf
asset_16467169720871528299376665087430.pdf
 
Domain Specific Languages for Parallel Graph AnalytiX (PGX)
Domain Specific Languages for Parallel Graph AnalytiX (PGX)Domain Specific Languages for Parallel Graph AnalytiX (PGX)
Domain Specific Languages for Parallel Graph AnalytiX (PGX)
 
Internship Presentation.pdf
Internship Presentation.pdfInternship Presentation.pdf
Internship Presentation.pdf
 
Spark-Zeppelin-ML on HWX
Spark-Zeppelin-ML on HWXSpark-Zeppelin-ML on HWX
Spark-Zeppelin-ML on HWX
 
Data meets AI - AICUG - Santa Clara
Data meets AI  - AICUG - Santa ClaraData meets AI  - AICUG - Santa Clara
Data meets AI - AICUG - Santa Clara
 
Product Engineer Certified Lean Six Sigma Black Belt by IASSC
Product Engineer Certified Lean Six Sigma Black Belt by IASSCProduct Engineer Certified Lean Six Sigma Black Belt by IASSC
Product Engineer Certified Lean Six Sigma Black Belt by IASSC
 
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock ExchangeIRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
 
How To Visualize Graphs
How To Visualize GraphsHow To Visualize Graphs
How To Visualize Graphs
 
Big Data Case study - caixa bank
Big Data Case study - caixa bankBig Data Case study - caixa bank
Big Data Case study - caixa bank
 
End-to-end Machine Learning Pipelines with HP Vertica and Distributed R
End-to-end Machine Learning Pipelines with HP Vertica and Distributed REnd-to-end Machine Learning Pipelines with HP Vertica and Distributed R
End-to-end Machine Learning Pipelines with HP Vertica and Distributed R
 
Analyze your application portfolio to know where the quality and risk issues ...
Analyze your application portfolio to know where the quality and risk issues ...Analyze your application portfolio to know where the quality and risk issues ...
Analyze your application portfolio to know where the quality and risk issues ...
 
Tree Plotter - Tree Inventory Software - Updates & Training Webinar - Q4 2016
Tree Plotter - Tree Inventory Software - Updates & Training Webinar - Q4 2016Tree Plotter - Tree Inventory Software - Updates & Training Webinar - Q4 2016
Tree Plotter - Tree Inventory Software - Updates & Training Webinar - Q4 2016
 
Predicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via HadoopPredicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via Hadoop
 

More from Thomas Teske

Artwork by Thomas Teske 2018
Artwork by Thomas Teske 2018Artwork by Thomas Teske 2018
Artwork by Thomas Teske 2018Thomas Teske
 
SOUG Day - autonomous what is next
SOUG Day - autonomous what is nextSOUG Day - autonomous what is next
SOUG Day - autonomous what is nextThomas Teske
 
My artwork created in 2017
My artwork created in 2017My artwork created in 2017
My artwork created in 2017Thomas Teske
 
My artwork created in 2016
My artwork created in 2016My artwork created in 2016
My artwork created in 2016Thomas Teske
 
My artwork created in 2015
My artwork created in 2015My artwork created in 2015
My artwork created in 2015Thomas Teske
 
My artwork created in 2014
My artwork created in 2014My artwork created in 2014
My artwork created in 2014Thomas Teske
 
My artwork created in 2013
My artwork created in 2013My artwork created in 2013
My artwork created in 2013Thomas Teske
 
Oracle 122 partitioning_in_action_slide_share
Oracle 122 partitioning_in_action_slide_shareOracle 122 partitioning_in_action_slide_share
Oracle 122 partitioning_in_action_slide_shareThomas Teske
 

More from Thomas Teske (8)

Artwork by Thomas Teske 2018
Artwork by Thomas Teske 2018Artwork by Thomas Teske 2018
Artwork by Thomas Teske 2018
 
SOUG Day - autonomous what is next
SOUG Day - autonomous what is nextSOUG Day - autonomous what is next
SOUG Day - autonomous what is next
 
My artwork created in 2017
My artwork created in 2017My artwork created in 2017
My artwork created in 2017
 
My artwork created in 2016
My artwork created in 2016My artwork created in 2016
My artwork created in 2016
 
My artwork created in 2015
My artwork created in 2015My artwork created in 2015
My artwork created in 2015
 
My artwork created in 2014
My artwork created in 2014My artwork created in 2014
My artwork created in 2014
 
My artwork created in 2013
My artwork created in 2013My artwork created in 2013
My artwork created in 2013
 
Oracle 122 partitioning_in_action_slide_share
Oracle 122 partitioning_in_action_slide_shareOracle 122 partitioning_in_action_slide_share
Oracle 122 partitioning_in_action_slide_share
 

Recently uploaded

FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad EscortsCall girls in Ahmedabad High profile
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 

Recently uploaded (20)

FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts
(ISHITA) Call Girls Service Hyderabad Call Now 8617697112 Hyderabad Escorts
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 

Fraud Detection in Financial Services using Graph Analysis and Machine Learning

  • 1. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |
  • 2. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Fraud Detection in Financial Services using Graph Analysis and Machine Learning Swiss Conference on Data Science 2019 Hans Viehmann Product Manager EMEA ORACLE Corporation June 14, 2019 @SpatialHannes
  • 3. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle.
  • 4. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | 4 https://twitter.jeffprod.com Following, no follow back Follower, no follow back Follow each other
  • 5. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Graph – an important growth area for data & analytics Confidential – Oracle Internal/Restricted/Highly Restricted 5 Gartner identifies Top 10 Data and Analytics Technology Trends for 2019 / No. 5 • Graph processing and Graph DBMS grow 100% annually through 2022. • Adoption of graphs is limited by lack of skills in the market. • Graphs are more practical to answer complex questions on complex data – SQL is not always efficient.
  • 6. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Graph Analysis for Business Insight 9 Identify Influencers Discover Graph Patterns in Big Data Generate Recommendations
  • 7. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | • Customer profitability analysis – Part of larger Hadoop/Big Data project • Analysis of banking transactions – Focus on corporate customers • Identification of undesired behavioural patterns, eg. – Customers using other banks to make large numbers of transactions – Many of which flow back to Banco Galicia • Increase fees, terminate contracts, or move activities to Banco Galicia 10 Banco de Galicia
  • 8. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | • Providing online payment solutions – Real-time payments, e-Wallets – 1bn revenue/yr – 500000 payments/day • Strong demand for fraud detection – Only feasible with graph data – In real-time, upon money movement – During account creation – In investigation, visualizing payment flows • Analysis of payment flows – Identifying suspicious patterns 11 Paysafe
  • 9. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | 12 Followed by interactive analysis with visualization and PGQL Using graph algorithms for initial assessment
  • 10. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | 13 Moving towards graph analysis with machine learning Rule Engine: Takes decision to process or fail payment Graph Query Example: Is there fraudster in 3 payments distance? Graph Query Example: Do we have linked by password customer in 3 payments distance? Example: Pass fraud probability as fact to the rule engine Graph Database Machine Learning
  • 11. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | • Example: Finding anomalies in healthcare billing data – Medical providers and their operations • Providers of the same specialty are close to each other in the graph – Closely connected by common services • a provider vertex exceptionally close to vertices of a different specialty should be an anomaly • Using closeness as a metric – Eg. Hop-distance, ... Anomaly Detection (without Machine Learning) X Doctors 900,000 HCPCS 6,000Edges 9,000,000
  • 12. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Using Personalized Pagerank to find outliers and anomalies Variant of Page-rank algorithm that requires a set of starting vertices* – Random walks (with restart) from the starting vertices – Computes a new probability of visiting each vertex in the graph biased by the vertices on the starting set – Personalized Page-rank score ➔ a natural relative distance (or closeness) with respect to the vertices from the starting set – Algorithm generates regular pagerank values when starting set contains all vertices in the graph 15 Starting set of vertices
  • 13. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Anomaly Detection Procedure 1. Compute Pagerank (PR) 2. Select a Specialty S (e.g. Optometrist) 3. Find the set of doctors of the specialty (starting set) 4. Personalized Pagerank from the starting set (PPR) 5. Mark procedures that has PPR – PR > ɸ as specialty action 6. For every doctor X of other specialties W who has high PPR value. – Mark doctor X as anomalous – If doctor X indeed performed any of specialty action – And if W does NOT have more than δ% of such doctors X Doctors 900,000 HCPCS 6,000 Same specialty (starting set) Anomalous (other specialty) Edges 9,000,000 Specialty Actions
  • 14. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Combining Graph Analytics and Machine Learning Graph Analytics • Compute graph metric(s) • Explore graph or compute new metrics using ML result Machine Learning • Build predictive model using graph metric • Build model(s) and score or classify data 17 Add to structured data Add to graph
  • 15. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Encoding similarity for use in machine learning • Graph captures fine-grained relationship between data entities – As before, closeness can be defined and measured on the graph – Providing numeric representation of your data that retains the distance information 18 Raw Data ML Model Graph Representation Numeric Representation (N-dimensional vector) x, y: data entity (represented as vertex in graph) v(x), v(y): n-dimensionsal vector representation of x and y x, y close in graph ➔ ∥ v(x) - v(y) ∥ small in n-dimensional vector space
  • 16. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Encoding similarity for use in machine learning • Different approaches available – eg. exploiting techniques from modern NLP (natural language processing) • Used Word2Vec in our example – a ML technique that learns closeness between words from large number of sentences • Perform many random walks on the graph • Apply W2V technique on random walk traces, treating vertices as words 19 KDD‘14
  • 17. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Deepwalk – Translate graphs to a vector space 20
  • 18. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Practical example – Student classification • Can you predict a student’s major or department just by looking at the classmates in the course that (s)he is taking? • Very similar to customer segmentation problem – Student => Customer – Course taking => Item or service purchase – Department => Segment label 21 CS ME 10.003 10.004 10.005 11.103 11.213 12.118 students courses
  • 19. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Evaluation – Comparison 1. CNN trained on “standard” features (e.g., student age, courses taken, …) 2. Use PPR and predict the department of the highest-scoring vertex 3. Train a CNN on vertex embeddings extracted with DeepWalk 4. Add “standard” features beside graph embeddings 22 CS ME 10.003 10.004 10.005 11.103 11.213 12.118 students courses
  • 20. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | • (Result #1) Graph-based prediction gives better result than naïve application of ML (e.g. CNN) on basic student features (e.g. age, gender, background, …) • (Result #2) Deep-Walk preserves information from graph representation • (Result #3) Deep-Walk allows to combined graph data with other features Results CNN on Original Features PPR (Graph Algorithm) CNN on Extracted Graph Features (from deep-walk) CNN on Original + Graph Features
  • 21. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Summary Graph capabilities in Oracle Products and Cloud Services • Graph databases are powerful tools, complementing relational databases – Especially strong for analysis of graph topology and connectedness • Graph analytics offer new insight – Especially relationships, dependencies and behavioural patterns • Oracle Property Graph technology offers – Comprehensive analytics through various APIs, integration with relational database – Scaleable, parallel in-memory processing – Secure and scaleable graph storage using Hadoop platform or Oracle Database • Available both on-premise or in the Cloud already today 24
  • 22. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Appendix: Oracle Graph Analytics Architecture Scalable and Persistent StorageScalable and Persistent Storage Graph Storage ManagementGraph Storage Management Graph Analytics In-memory Analytic Engine Graph Analytics In-memory Analytic Engine Blueprints & SolrCloud / Lucene RESTWebService Python,Perl,PHP,Ruby, Javascript,… Java APIs Java APIs/JDBC/SQL/PLSQL 25 Visualization RIntegration(OAAgraph) Sparkintegration
  • 23. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Appendix: Graph Cloud Service • “One-click” deployment: no installation, zero configuration – Automated failure detection and recovery • Automated graph modeler – Easily convert your relational data into property graphs • Pre-built Algorithms, Flows and SQL-like graph query language – Java, Groovy – Rest APIs • Rich User Interface – Low code / zero code features – Notebook support and powerful data visualization features Fully managed graph cloud service
  • 24. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | Appendix: More information • Oracle Property Graph Technologies OTN product page: https://www.oracle.com/database/technologies/spatialandgraph/property-graph-features.html – White papers, software downloads, documentation and videos • Oracle Labs Tutorials https://docs.oracle.com/cd/E56133_01/latest/tutorials/index.html • Blog post series on setting up Graph Analysis on Oracle Cloud https://blogs.oracle.com/oraclespatial/how-to-enable-oracle-database-cloud-service-with-property-graph- capabilities – Free cloud credits available on http://cloud.oracle.com • Blog – examples, tips & tricks: blogs.oracle.com/bigdataspatialgraph • @OracleBigData, @SpatialHannes, @JeanIhm Oracle Spatial and Graph Group 27
  • 25. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. | 28
  • 26. Copyright © 2019 Oracle and/or its affiliates. All rights reserved. |