SlideShare a Scribd company logo
1 of 8
Advanced Analytics: Fraud Detection
System
Conceptual Design Of
A Fraud Detection System




                           2
Fraud Detection System- What is It?

•   Fraud Detection System improves the productivity of claims (analyst) department
    to detect fraud
     – Higher detection with lower human effort
•   Some challenges for an insurance agency
     –   Fraud cases are not labeled and often unknown i.e. not self revealing
     –   Patterns of fraud change frequently. Old fraud patterns might not continue.
     –   Cases occur with relative rarity
     –   Fewer cases of fraud across small data set (base rate and sample size problems)
•   Given the challenges
     – Off-the-shelf products might not work effectively. They might yield a one time
       performance gain and level off afterwards
     – Need a hand crafted solution that matures over time to fit a specific insurance agency’s
       business lines
•   Long Term view of Fraud Detection System
     – Instead of one-time quick performance gain, Deep Blue proposes a long term view for
       fraud detection in which we continuously label the new cases with the help of Analysts
       and improve the coverage of fraud cases.

                                                                                                  3
Our Methodology For Chartis Context
•   Bootstrap the knowledge base of fraud detection
     – Work with existing fraud analyst team/experts to construct criterion for fraud cases
     – Anomaly detection by deep analysis of available data and features generates a large
       number of hypothesis to locate potential fraud cases
          • This is done by detecting anomalies across various hierarchies (providers, claimants,
            geographies, etc.) and across features within hierarchies
     – This leads to a simple system which flags cases for labeling


•   Deploy Machine Learning to analyze labeled cases and construct robust fraud
    prediction models
     – Adapt the algorithms to changing patterns in the fraud by periodic rebuilding
     – Continuously force the fraud prediction models to explore other features (attributes) as
       potential lead indicators of fraud. Expand types of fraud that are uncovered.


•   Make continuous effort to improve the quality of fraud detection case data
     – In bootstrapped system, cases flagged for review may not have a prediction (i.e.
       fraud/not and “case of interest” or not) due to lack of labeled data
     – Active learning uses subsequently labeled cases to enable prediction of (1) Fraud; and
       (2) Cases of Interest
                                                                                                    4
Fraud Detection System (FDS) – Boot Strapping



                                              Fraud Analysts   Labeled Cases
                                                 (Human)                       Labeled Case database




     Anomaly              Expert System
     Detection             - Boot Strapping
- Distribution Analysis    Knowledge base
   - Feature Analysis




                                                                                                       5
FDS – Expansion and Adaption


                 Expert System
                Decommissioned                                                 Labeled Cases
                                                        Fraud Analysts
                                                           (Human)                                 Labeled Case database



                                           Cases to                      Fraud And/Or
                                           Evaluate                      Case Of Interest


     Anomaly                                                      Fraud Detection Engine
     Detection                         Active Learner                -ML Predictive Models
- Distribution Analysis    Ongoing                                    - Rankings & Voting
   - Feature Analysis                                                  - Adapting Models                       Case Stream
                          Monitoring




                                                        Not Classifiable

                                                                            Not Fraud and
                                                                            Not Case Of Interest




                                                                                                                             6
Key Strengths Of Proposed Design

•   Incremental design which produces incremental benefits at each step
•   Extremely adaptable to changing patterns of fraud
•   Modular in design and highly reusable across business lines
     – A minor customization is necessary to adapt to specific business lines
•   Low risk investment approach
     – To improve data collection and knowledge repository around fraud detection
     – To develop analytical infrastructure that creates fraud detection capabilities inside
       Chartis
•   Ability to apply the best of breed techniques and latest research advancements in
    fraud detection
     – Packaged products often lag cutting edge modeling advancements by a few years




                                                                                               7
Sample Application - Auto Insurance Claim Fraud


                                                                                                                                   Labeled Claims
                                                                                                                      1) Bumper, injury, NY, …      F, COI*
                                                              Fraud Analysts                Labeled Cases
                                                                                                                      2) Side collision, dent, OH,..NF, NCOI
                                                                 (Human)                                              3) flooding, radiator, AZ,.. NF, COI


                                                  Cases to                          Fraud And/Or
                                                  Evaluate                          Case Of Interest
                                                                                                                                        Input for
                                                                             Fraud Detection Engine                                     Model Building
        Anomaly                              Active Learner                  - Some Techniques: Logistic
        Detection                                                            Regression, Neural Networks,
                                             -Active clustering              Decision Trees , Random Forest
   - Distribution Analysis       Ongoing
      - Feature Analysis                       - Fuzzy claims                - If trees are used, a potential rule:                 Case Stream
                                Monitoring                                   If ( zip = 10063 && type = bumper
                                                                             && time_of_incident < 6 AM ) =>
                                                                             COI

       Input Data:
 (1) Accident Characteristics                                     Not Classifiable
(2) Claimant Characteristics
(3) Insured Characteristics                                                              Not Fraud and
(4) Injury Characteristics                                                               Not Case Of Interest
(5) Treatment




                                                                  * F- Fraud, NF- No Fraud, COI- Case of Interest, NCOI- No Case of Interest

                                                                                                                                                       8

More Related Content

Viewers also liked

Can Stem Cells Aid in Finding a Cure for Huntington's Disease?
Can Stem Cells Aid in Finding a Cure for Huntington's Disease?Can Stem Cells Aid in Finding a Cure for Huntington's Disease?
Can Stem Cells Aid in Finding a Cure for Huntington's Disease?Anton van Dellen
 
SKYLER WILLIAMS REVISED RESUME' PDF
SKYLER WILLIAMS REVISED RESUME' PDFSKYLER WILLIAMS REVISED RESUME' PDF
SKYLER WILLIAMS REVISED RESUME' PDFSkyler Williams
 
Il treno-della-vita
Il treno-della-vitaIl treno-della-vita
Il treno-della-vitaTina Nipco
 
Il treno-della-vita
Il treno-della-vitaIl treno-della-vita
Il treno-della-vitaTina Nipco
 
Il treno-della-vita
Il treno-della-vitaIl treno-della-vita
Il treno-della-vitaTina Nipco
 

Viewers also liked (10)

Israel
IsraelIsrael
Israel
 
Can Stem Cells Aid in Finding a Cure for Huntington's Disease?
Can Stem Cells Aid in Finding a Cure for Huntington's Disease?Can Stem Cells Aid in Finding a Cure for Huntington's Disease?
Can Stem Cells Aid in Finding a Cure for Huntington's Disease?
 
SKYLER WILLIAMS REVISED RESUME' PDF
SKYLER WILLIAMS REVISED RESUME' PDFSKYLER WILLIAMS REVISED RESUME' PDF
SKYLER WILLIAMS REVISED RESUME' PDF
 
Bab3
Bab3Bab3
Bab3
 
Bab3
Bab3Bab3
Bab3
 
Bab3
Bab3Bab3
Bab3
 
Il treno-della-vita
Il treno-della-vitaIl treno-della-vita
Il treno-della-vita
 
Bab 4
Bab 4Bab 4
Bab 4
 
Il treno-della-vita
Il treno-della-vitaIl treno-della-vita
Il treno-della-vita
 
Il treno-della-vita
Il treno-della-vitaIl treno-della-vita
Il treno-della-vita
 

Similar to Fraud white paper

February 2010 8 Things You Cant Afford To Ignore About eDiscovery
February 2010 8 Things You Cant Afford To Ignore About eDiscoveryFebruary 2010 8 Things You Cant Afford To Ignore About eDiscovery
February 2010 8 Things You Cant Afford To Ignore About eDiscoveryJohn Wang
 
SAS Fraud Framework for Insurance
SAS Fraud Framework for InsuranceSAS Fraud Framework for Insurance
SAS Fraud Framework for Insurancestuartdrose
 
RSA 2006 - Visual Security Event Analysis
RSA 2006 - Visual Security Event AnalysisRSA 2006 - Visual Security Event Analysis
RSA 2006 - Visual Security Event AnalysisRaffael Marty
 
Fighting Fraud With Digital Forensics
Fighting Fraud With Digital ForensicsFighting Fraud With Digital Forensics
Fighting Fraud With Digital ForensicsTOMMY SEAH
 
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptxAnomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptxImpetus Technologies
 
Analisi dati per software anti frode. Roberto Marmo, Università di Pavia
Analisi dati per software anti frode. Roberto Marmo, Università di PaviaAnalisi dati per software anti frode. Roberto Marmo, Università di Pavia
Analisi dati per software anti frode. Roberto Marmo, Università di PaviaData Driven Innovation
 
IRJET- A Comparative Study to Detect Fraud Financial Statement using Data Min...
IRJET- A Comparative Study to Detect Fraud Financial Statement using Data Min...IRJET- A Comparative Study to Detect Fraud Financial Statement using Data Min...
IRJET- A Comparative Study to Detect Fraud Financial Statement using Data Min...IRJET Journal
 
Fraud detection Presentation
Fraud detection PresentationFraud detection Presentation
Fraud detection PresentationHernan Huwyler
 
Fraud Detection presentation
Fraud Detection presentationFraud Detection presentation
Fraud Detection presentationHernan Huwyler
 
Six Steps to SIEM Success
Six Steps to SIEM SuccessSix Steps to SIEM Success
Six Steps to SIEM SuccessAlienVault
 
Data Security Metricsa Value Based Approach
Data Security Metricsa Value Based ApproachData Security Metricsa Value Based Approach
Data Security Metricsa Value Based ApproachFlaskdata.io
 

Similar to Fraud white paper (20)

February 2010 8 Things You Cant Afford To Ignore About eDiscovery
February 2010 8 Things You Cant Afford To Ignore About eDiscoveryFebruary 2010 8 Things You Cant Afford To Ignore About eDiscovery
February 2010 8 Things You Cant Afford To Ignore About eDiscovery
 
SAS Fraud Framework for Insurance
SAS Fraud Framework for InsuranceSAS Fraud Framework for Insurance
SAS Fraud Framework for Insurance
 
Fraud Detection Presentation Forum
Fraud Detection Presentation ForumFraud Detection Presentation Forum
Fraud Detection Presentation Forum
 
Analisis de Riesgos O-ISM3
Analisis de Riesgos O-ISM3Analisis de Riesgos O-ISM3
Analisis de Riesgos O-ISM3
 
Fraud Management Solutions
Fraud Management SolutionsFraud Management Solutions
Fraud Management Solutions
 
Credit Card Fraud Detection_ Mansi_Choudhary.pptx
Credit Card Fraud Detection_ Mansi_Choudhary.pptxCredit Card Fraud Detection_ Mansi_Choudhary.pptx
Credit Card Fraud Detection_ Mansi_Choudhary.pptx
 
RSA 2006 - Visual Security Event Analysis
RSA 2006 - Visual Security Event AnalysisRSA 2006 - Visual Security Event Analysis
RSA 2006 - Visual Security Event Analysis
 
50 51
50 5150 51
50 51
 
Fighting Fraud With Digital Forensics
Fighting Fraud With Digital ForensicsFighting Fraud With Digital Forensics
Fighting Fraud With Digital Forensics
 
Fraud Monitoring Solution
Fraud Monitoring SolutionFraud Monitoring Solution
Fraud Monitoring Solution
 
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptxAnomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
 
Analisi dati per software anti frode. Roberto Marmo, Università di Pavia
Analisi dati per software anti frode. Roberto Marmo, Università di PaviaAnalisi dati per software anti frode. Roberto Marmo, Università di Pavia
Analisi dati per software anti frode. Roberto Marmo, Università di Pavia
 
IRJET- A Comparative Study to Detect Fraud Financial Statement using Data Min...
IRJET- A Comparative Study to Detect Fraud Financial Statement using Data Min...IRJET- A Comparative Study to Detect Fraud Financial Statement using Data Min...
IRJET- A Comparative Study to Detect Fraud Financial Statement using Data Min...
 
Fraud detection Presentation
Fraud detection PresentationFraud detection Presentation
Fraud detection Presentation
 
Fraud Detection presentation
Fraud Detection presentationFraud Detection presentation
Fraud Detection presentation
 
Combating Revenue Leakage 2013 03 18 webinar 2013
 Combating Revenue Leakage 2013 03 18 webinar 2013    Combating Revenue Leakage 2013 03 18 webinar 2013
Combating Revenue Leakage 2013 03 18 webinar 2013
 
Anomaly Detection
Anomaly DetectionAnomaly Detection
Anomaly Detection
 
Six Steps to SIEM Success
Six Steps to SIEM SuccessSix Steps to SIEM Success
Six Steps to SIEM Success
 
Data Security Metricsa Value Based Approach
Data Security Metricsa Value Based ApproachData Security Metricsa Value Based Approach
Data Security Metricsa Value Based Approach
 
Window of Compromise
Window of CompromiseWindow of Compromise
Window of Compromise
 

Fraud white paper

  • 1. Advanced Analytics: Fraud Detection System
  • 2. Conceptual Design Of A Fraud Detection System 2
  • 3. Fraud Detection System- What is It? • Fraud Detection System improves the productivity of claims (analyst) department to detect fraud – Higher detection with lower human effort • Some challenges for an insurance agency – Fraud cases are not labeled and often unknown i.e. not self revealing – Patterns of fraud change frequently. Old fraud patterns might not continue. – Cases occur with relative rarity – Fewer cases of fraud across small data set (base rate and sample size problems) • Given the challenges – Off-the-shelf products might not work effectively. They might yield a one time performance gain and level off afterwards – Need a hand crafted solution that matures over time to fit a specific insurance agency’s business lines • Long Term view of Fraud Detection System – Instead of one-time quick performance gain, Deep Blue proposes a long term view for fraud detection in which we continuously label the new cases with the help of Analysts and improve the coverage of fraud cases. 3
  • 4. Our Methodology For Chartis Context • Bootstrap the knowledge base of fraud detection – Work with existing fraud analyst team/experts to construct criterion for fraud cases – Anomaly detection by deep analysis of available data and features generates a large number of hypothesis to locate potential fraud cases • This is done by detecting anomalies across various hierarchies (providers, claimants, geographies, etc.) and across features within hierarchies – This leads to a simple system which flags cases for labeling • Deploy Machine Learning to analyze labeled cases and construct robust fraud prediction models – Adapt the algorithms to changing patterns in the fraud by periodic rebuilding – Continuously force the fraud prediction models to explore other features (attributes) as potential lead indicators of fraud. Expand types of fraud that are uncovered. • Make continuous effort to improve the quality of fraud detection case data – In bootstrapped system, cases flagged for review may not have a prediction (i.e. fraud/not and “case of interest” or not) due to lack of labeled data – Active learning uses subsequently labeled cases to enable prediction of (1) Fraud; and (2) Cases of Interest 4
  • 5. Fraud Detection System (FDS) – Boot Strapping Fraud Analysts Labeled Cases (Human) Labeled Case database Anomaly Expert System Detection - Boot Strapping - Distribution Analysis Knowledge base - Feature Analysis 5
  • 6. FDS – Expansion and Adaption Expert System Decommissioned Labeled Cases Fraud Analysts (Human) Labeled Case database Cases to Fraud And/Or Evaluate Case Of Interest Anomaly Fraud Detection Engine Detection Active Learner -ML Predictive Models - Distribution Analysis Ongoing - Rankings & Voting - Feature Analysis - Adapting Models Case Stream Monitoring Not Classifiable Not Fraud and Not Case Of Interest 6
  • 7. Key Strengths Of Proposed Design • Incremental design which produces incremental benefits at each step • Extremely adaptable to changing patterns of fraud • Modular in design and highly reusable across business lines – A minor customization is necessary to adapt to specific business lines • Low risk investment approach – To improve data collection and knowledge repository around fraud detection – To develop analytical infrastructure that creates fraud detection capabilities inside Chartis • Ability to apply the best of breed techniques and latest research advancements in fraud detection – Packaged products often lag cutting edge modeling advancements by a few years 7
  • 8. Sample Application - Auto Insurance Claim Fraud Labeled Claims 1) Bumper, injury, NY, … F, COI* Fraud Analysts Labeled Cases 2) Side collision, dent, OH,..NF, NCOI (Human) 3) flooding, radiator, AZ,.. NF, COI Cases to Fraud And/Or Evaluate Case Of Interest Input for Fraud Detection Engine Model Building Anomaly Active Learner - Some Techniques: Logistic Detection Regression, Neural Networks, -Active clustering Decision Trees , Random Forest - Distribution Analysis Ongoing - Feature Analysis - Fuzzy claims - If trees are used, a potential rule: Case Stream Monitoring If ( zip = 10063 && type = bumper && time_of_incident < 6 AM ) => COI Input Data: (1) Accident Characteristics Not Classifiable (2) Claimant Characteristics (3) Insured Characteristics Not Fraud and (4) Injury Characteristics Not Case Of Interest (5) Treatment * F- Fraud, NF- No Fraud, COI- Case of Interest, NCOI- No Case of Interest 8