SlideShare a Scribd company logo
1 of 22
Fraudsters are smart, “Frank” is Smarter

               - Fareed and Vivek




28/01/13                            1
Outline
Why detect fraud – Is there a problem?
Why an intelligent system?
How we built one




                                          2 of 22
Show me some numbers
What was the value of all electronic
 transactions globally for year 2012?
$17 trillion (with a T)
This includes all credit, debit and pre-paid
 cards used in both online and offline (card
 present) scenarios for purchases and cash
 withdrawals
More Numbers
How much of $17T was lost due to FRAUD?
$8 billion in 2012, > $10 billion by 2015
Fraud rate of 0.05% – Not too bad right?
Wrong !!
Getting specific
Reminder - 0.05% ratio is for all transactions
 including face to face transactions
The fraud rate is a much more scary 3.5% for
 Online transactions aka CNP
Global e-Commerce is expected to exceed $1T in
 2013 –> $3.5B will be lost due to fraud
Add to this, the erosion due to loss of future
 business from impacted customers
Big Customer Impact ! Big deal for us !!
The Big Fight...
Fraud to transaction ratio has been constant over
 the past 10 years
This ratio should not lull us into a false sense of
 security – bigger numbers are at stake and
 increasing as volumes grow
The crooks LOVE e-Commerce (think 3.5%)
How do we then figure out if a transaction is
 genuine or a victim of fraud
Intelligently of course! - ENTER FRANK !!
Why Frank?
Fraud Detection System
Two parts
   Signals/Features
   Algorithm




                                 8 of 22
Rule based system
Rules on various signals
   Num of transaction from a card in last one day
   Transaction amount
   and many more
Thresholds are hand crafted
Fraud Score = sum of individual scores



                                                     9 of 22
Need for Smarter system

    Too much data for manual analysis

    Businesses are evolving

    Fraudsters are evolving

    Extending to really high dimension – pushing
    beyond limits of rule based system




                                           10 of 22
Designing Frank
Labeled data missing
Observation
   Very few fraud records
   When you see one, you can identify one
Social behavior




                                             11 of 22
Visualization



 Total
Amount




         Number of transactions in a day by a user



                                                     12 of 22
Visualization




                13 of 22
Clustering – Centroid based




                              14 of 22
Clustering - Distance based




                              15 of 22
Clustering - Distribution based




                                  16 of 22
Clustering – Density based




                             17 of 22
Density based clustering
Epsilon
Min-pts
                                      p
Statistical distance
                                 p1
   Scale invariant          q
   Correlation taken into
    account




                                          18 of 22
Clustering for detecting fraud
Cluster the data using density based clustering
For new point find distance to all the existing
 clusters
If there exists min-pts with epsilon dist in a
 cluster, new point belongs to this cluster
If doesn't belong to any cluster -> fraud


                                             19 of 22
Computing fraud probability
We find nearest cluster
Convert the distance to probability
   using chi-square distribution
Probability of fraud between 0 and 1




                                        20 of 22
Execution
Distributed clustering
Real-time model updating
< 20ms to compute fraud probability
Suspend the payment authorization in real
 time




                                       21 of 22
We Frank, You Shop.




                      22 of 22

More Related Content

Similar to Slash n: Technical Session 7 - Fraudsters are smart, Frank is smarter - Vivek Mehta, Fareed Jawad

1609 Fraud Data Science
1609 Fraud Data Science1609 Fraud Data Science
1609 Fraud Data Science
Alejandro Correa Bahnsen, PhD
 
credit card fraud analysis using predictive modeling python project abstract
credit card fraud analysis using predictive modeling python project abstractcredit card fraud analysis using predictive modeling python project abstract
credit card fraud analysis using predictive modeling python project abstract
Venkat Projects
 
Fraud Detection with Cost-Sensitive Predictive Analytics
Fraud Detection with Cost-Sensitive Predictive AnalyticsFraud Detection with Cost-Sensitive Predictive Analytics
Fraud Detection with Cost-Sensitive Predictive Analytics
Alejandro Correa Bahnsen, PhD
 

Similar to Slash n: Technical Session 7 - Fraudsters are smart, Frank is smarter - Vivek Mehta, Fareed Jawad (20)

1609 Fraud Data Science
1609 Fraud Data Science1609 Fraud Data Science
1609 Fraud Data Science
 
Online Transaction Fraud Detection using Hidden Markov Model & Behavior Analysis
Online Transaction Fraud Detection using Hidden Markov Model & Behavior AnalysisOnline Transaction Fraud Detection using Hidden Markov Model & Behavior Analysis
Online Transaction Fraud Detection using Hidden Markov Model & Behavior Analysis
 
credit card fraud analysis using predictive modeling python project abstract
credit card fraud analysis using predictive modeling python project abstractcredit card fraud analysis using predictive modeling python project abstract
credit card fraud analysis using predictive modeling python project abstract
 
Fraud Analytics
Fraud AnalyticsFraud Analytics
Fraud Analytics
 
Fraud Detection with Cost-Sensitive Predictive Analytics
Fraud Detection with Cost-Sensitive Predictive AnalyticsFraud Detection with Cost-Sensitive Predictive Analytics
Fraud Detection with Cost-Sensitive Predictive Analytics
 
Challenge Accepted
Challenge AcceptedChallenge Accepted
Challenge Accepted
 
Liubomyr Bregman "Financial Crime Detection using Advanced Analytics"
Liubomyr Bregman  "Financial Crime Detection using Advanced Analytics"Liubomyr Bregman  "Financial Crime Detection using Advanced Analytics"
Liubomyr Bregman "Financial Crime Detection using Advanced Analytics"
 
Ai and machine learning help detect, predict and prevent fraud - IBM Watson ...
Ai and machine learning help detect, predict and prevent fraud -  IBM Watson ...Ai and machine learning help detect, predict and prevent fraud -  IBM Watson ...
Ai and machine learning help detect, predict and prevent fraud - IBM Watson ...
 
An Identification and Detection of Fraudulence in Credit Card Fraud Transacti...
An Identification and Detection of Fraudulence in Credit Card Fraud Transacti...An Identification and Detection of Fraudulence in Credit Card Fraud Transacti...
An Identification and Detection of Fraudulence in Credit Card Fraud Transacti...
 
A Practical Guide to Post-EMV Card Not Present Fraud
A Practical Guide to Post-EMV Card Not Present FraudA Practical Guide to Post-EMV Card Not Present Fraud
A Practical Guide to Post-EMV Card Not Present Fraud
 
Ali AlMeshal - The need for a secure & trusted payment - ArmIGF 2015
Ali AlMeshal - The need for a secure & trusted payment - ArmIGF 2015Ali AlMeshal - The need for a secure & trusted payment - ArmIGF 2015
Ali AlMeshal - The need for a secure & trusted payment - ArmIGF 2015
 
Krupin kirill (fraud) research proposal
Krupin kirill (fraud) research proposalKrupin kirill (fraud) research proposal
Krupin kirill (fraud) research proposal
 
IRJET- Credit Card Fraud Detection Analysis
IRJET- Credit Card Fraud Detection AnalysisIRJET- Credit Card Fraud Detection Analysis
IRJET- Credit Card Fraud Detection Analysis
 
RIPE 83: How much 'bad traffic' should I be seeing from each economy?
RIPE 83: How much 'bad traffic' should I be seeing from each economy?RIPE 83: How much 'bad traffic' should I be seeing from each economy?
RIPE 83: How much 'bad traffic' should I be seeing from each economy?
 
Credit Card Fraud Detection
Credit Card Fraud DetectionCredit Card Fraud Detection
Credit Card Fraud Detection
 
A Novel Framework for Credit Card.
A Novel Framework for Credit Card.A Novel Framework for Credit Card.
A Novel Framework for Credit Card.
 
Vol 17 No 2 - July-December 2017
Vol 17 No 2 - July-December 2017Vol 17 No 2 - July-December 2017
Vol 17 No 2 - July-December 2017
 
Modern Data Science
Modern Data ScienceModern Data Science
Modern Data Science
 
Fraud Prevention Strategies to Fight First-Party Fraud and Synthetic Identity...
Fraud Prevention Strategies to Fight First-Party Fraud and Synthetic Identity...Fraud Prevention Strategies to Fight First-Party Fraud and Synthetic Identity...
Fraud Prevention Strategies to Fight First-Party Fraud and Synthetic Identity...
 
Data Science: De la Matemática a la Práctica
Data Science: De la Matemática a la PrácticaData Science: De la Matemática a la Práctica
Data Science: De la Matemática a la Práctica
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 

Slash n: Technical Session 7 - Fraudsters are smart, Frank is smarter - Vivek Mehta, Fareed Jawad

  • 1. Fraudsters are smart, “Frank” is Smarter - Fareed and Vivek 28/01/13 1
  • 2. Outline Why detect fraud – Is there a problem? Why an intelligent system? How we built one 2 of 22
  • 3. Show me some numbers What was the value of all electronic transactions globally for year 2012? $17 trillion (with a T) This includes all credit, debit and pre-paid cards used in both online and offline (card present) scenarios for purchases and cash withdrawals
  • 4. More Numbers How much of $17T was lost due to FRAUD? $8 billion in 2012, > $10 billion by 2015 Fraud rate of 0.05% – Not too bad right? Wrong !!
  • 5. Getting specific Reminder - 0.05% ratio is for all transactions including face to face transactions The fraud rate is a much more scary 3.5% for Online transactions aka CNP Global e-Commerce is expected to exceed $1T in 2013 –> $3.5B will be lost due to fraud Add to this, the erosion due to loss of future business from impacted customers Big Customer Impact ! Big deal for us !!
  • 6. The Big Fight... Fraud to transaction ratio has been constant over the past 10 years This ratio should not lull us into a false sense of security – bigger numbers are at stake and increasing as volumes grow The crooks LOVE e-Commerce (think 3.5%) How do we then figure out if a transaction is genuine or a victim of fraud Intelligently of course! - ENTER FRANK !!
  • 8. Fraud Detection System Two parts  Signals/Features  Algorithm 8 of 22
  • 9. Rule based system Rules on various signals  Num of transaction from a card in last one day  Transaction amount  and many more Thresholds are hand crafted Fraud Score = sum of individual scores 9 of 22
  • 10. Need for Smarter system  Too much data for manual analysis  Businesses are evolving  Fraudsters are evolving  Extending to really high dimension – pushing beyond limits of rule based system 10 of 22
  • 11. Designing Frank Labeled data missing Observation  Very few fraud records  When you see one, you can identify one Social behavior 11 of 22
  • 12. Visualization Total Amount Number of transactions in a day by a user 12 of 22
  • 13. Visualization 13 of 22
  • 14. Clustering – Centroid based 14 of 22
  • 15. Clustering - Distance based 15 of 22
  • 16. Clustering - Distribution based 16 of 22
  • 17. Clustering – Density based 17 of 22
  • 18. Density based clustering Epsilon Min-pts p Statistical distance p1  Scale invariant q  Correlation taken into account 18 of 22
  • 19. Clustering for detecting fraud Cluster the data using density based clustering For new point find distance to all the existing clusters If there exists min-pts with epsilon dist in a cluster, new point belongs to this cluster If doesn't belong to any cluster -> fraud 19 of 22
  • 20. Computing fraud probability We find nearest cluster Convert the distance to probability  using chi-square distribution Probability of fraud between 0 and 1 20 of 22
  • 21. Execution Distributed clustering Real-time model updating < 20ms to compute fraud probability Suspend the payment authorization in real time 21 of 22
  • 22. We Frank, You Shop. 22 of 22

Editor's Notes

  1. Frank Abagnale Jr. First fraud before age of 16 Bank fraud, airline pilot, doctor and so on. 7-8 identities, fraud in 16 countries and got away from police custody twice, once from airplane. All this before age of 21. He was jailed for 5 years and helped FBI and then started his own company to help fraud prevention.