SlideShare a Scribd company logo
1 of 4
Download to read offline
Statistical Approach to Customer Risk Rating
By Mayank Johri, Amin Ahmadi, Michael Spieler
Introduction
The customer risk rating (CRR) model’s objective is to assess the AML risk of the Bank’s customers based on each
customer’s profile. The risk attributes considered in model development are based on FFIEC guidelines as well as
industry expertise. Each population group (Individual, Entity, and Alternative Investment), is a separate model which
has a combination of population group specific characteristics and characteristics common to all customer (“attributes”).
To score these attributes, samples of customer profiles are drawn from each population group. Using only these sample
customer profiles, BSA/AML SMEs risk rate these customer files. The results of these reviews are then used to calibrate
the model. Once calibrated, the model assigns each customer a continuous risk score that is converted to a discrete risk
rating (High, Medium, and Low). After the application of the model (“CRR Run”), each customer is assigned a risk
score. Reference sources for the risk model include the existing model, regulatory guidance and AML subject matter
expertise.
Phases
The following outlines the eight phase process for building CRR model:
Phase 0 | Define Approach. The customer risk scoring model is based on Customer Due Diligence Standards by each
population group: Individuals, Entity, and Alternative Investment. Each of the three risk models will be calibrated
separately. The risk attributes included in the risk models assess and measure customer AML risk in alignment with the
views of risk held by BSA/AML Management.
Phase 1 | Define Risk Categories. The model consists of the following categories: Geography, Customer (e.g.
Political Profile, Negative News, and Person of Interest), Product/Account, and Transaction. An additional
“Governance” category is included for Entity and Alternative Investment. These categories will contain risk attributes
indicative of money laundering risk and are scored individually. There is an example of the geography category in Table
1.1.
Table 1.1
Individuals Entity Alternative Investments
Country of Citizenship
Country of Residency
Non-US Address
Country of Formation
Country of Operation
Country of Incorporation
Country of Primary
Business Operations
Country of Registration
Country of Investments
Geography
Due Diligence Standards
Phase 2 |Data Analysis. Determine the relevant risk attributes for each risk categories, by first conducting working
group sessions to conceptually agree on the risk attributes, then using existing customer data for the following:
 Assessing data completeness (% of non-null/blank values)
 Assessing variation within a given variable
Risk attributes that have low data completeness or low variation are removed. Low data completeness means a large
portion of the customers don’t have usable values, making it a meaningless attribute for the majority of customers. Low
variation means a large portion of the customers have the same value, complicating the model with additional attributes
that don’t help differentiate any one customer from the rest.
Phase 3 |Correlation Test. To give a preliminary understanding of the levels pairwise correlation between attributes,
test using various correlation coefficients based on the data types (e.g. binary, ordinal and continuous). Appropriate
pairwise correlation coefficients are as follows:
i. Binary - Binary: phi coefficient (mean square contingency coefficient)
ii. Binary - Continuous: ANOVA to evaluate how likely the correlation between two
variables is by chance
iii. Ordinal - Continuous: Kendall’s(τ)
iv. Ordinal – Ordinal: Spearman’s (rs)
A correlation study would entail quantifying the relationships among all the attributes (as opposed to pairwise) to
evaluate the relationship between one attribute relative to all of the others. This study can be directly used to decide
which attributes need to stay in the model, and whether their contribution is at the additive level or also needs to be
accounted at two-way (or three-way) interactive level (s). Highly pairwise correlated (> 90%) attributes suggest that one
might be eliminated without losing the accuracy of the model, but not if high multicollinearity, which can only be
quantified at the model level, also exists. Below is the interpretation of the correlation study.
 Highly pairwise correlated (> 90%) attributes: leave only one in the model. In order to
decide which one to remove one can calculate “coefficient of partial determination” and
eliminate the one with lower value
 Attributes with no or low (<5%) correlation: leave both in the model; there is strong
evidence that including the interaction term of these two attributes would not significantly
contribute to the accuracy of the model
 Attributes with some level of correlation: leave both in the model; interaction term should
be investigated at the model level analysis
Phase 3 |Sampling. For each population group, draw samples to cover all combinations of risks for SME risk rating.
The outcome of the SME risk rating is the labeled data (label being the ranking of the customer) which is used to “learn”
the regression model. The samples may be generated as follows:
 Organize data into a list with each row representing a specific combination and the last
column showing the frequency of the case
 Select the target sample size: n
 For each row, develop the binomial marginal distribution
 Determine expected value of each cell based on the selected sample size
 For rare event cases, augment to have size of at least N= 10k/p where k is the number of
covariates and p is the probability of the rare event case
Phase 4 |SME Rating. Reviewing one sampled (from phase 3) customer at a time, an AML SME looks at that
customer’s values for each risk attribute and ascertains a risk rating or high, medium, or low, based regulatory guidance
and the BSA/AML department’s risk tolerance.
Phase 5 |Model Calibration. Conduct statistical analysis using SME’s risk rating to calibrate the model. Model
calibration has the following steps:
The risk rating of samples, as labeled by SME, is fitted by ordinal logistic regression. The developed model is used to risk
rate the entire population. Ordinal logistic model assigns each customer, based on the customer’s attributes, to a
distribution interval (i.e., risk tier). Fig. 1.2 (typical result of ordinal logistic regression) shows a schematic representation
of the model result.
Figure 1.2
Phase 6 |Model Performance Evaluation. Given the structure of CRR data, a logistic regression is used to relate
selected attributes to customer risk ranks (as determined by an SME). An optimized model is developed based on the
performance of the selected attributes (i.e., how much does the attribute contribute toward the accuracy of the model?)
and how well the optimized model performs vis-à-vis the benchmark model.
 A Wald test is used to decide if an attribute needs to be included in the optimized model by
quantifying how far the estimated parameter is from zero. The Wald test can also be used for
multiple parameters simultaneously
 A log-likelihood ratio test is used to compare “reduced” (candidate optimized) models with
the benchmark model. The benchmark model, which includes all the attributes and their
second or/and higher level interaction terms, is also the most restrictive model. This means
that the benchmark model has the best performance within the current data, but has
diminished generalization capability. In general one tries to select the most simplified model
which still has acceptable performance
 McFadden pseudo-R2 is used to gauge how much the logistic regression model explains a
variation in the response. This is considered a similar measure to the ubiquitous R2 used in
linear regression. The value of pseudo- R2 ranges between 0 and 1; values close to 0 indicates
the model has no predictive capability
 Uncertainty of the final model is quantified to evaluate how well the model would apply to
future customers. The uncertainty may be characterized at two levels:
i. Uncertainty of the selected attributes represented by the variance of the regression
coefficients
ii. Residual uncertainty represented by the standard error of the model
Assess if the differences between model prediction and SME ratings are within acceptable model error rate. Confirm the
model output’s alignment with working group’s views and expectations. If it is not, go back to phase 5 and calibrate the
model again.
Phase 7 |Model Implementation. Once the evaluation is complete and the result is satisfactory, the model is
implemented and run on a regular schedule in an automated fashion. The outcome of the model can potentially drive
various processes, including periodic due diligence reviews and stratified transaction monitoring thresholds.

More Related Content

What's hot

Aon FI Risk Advisory - CCAR Variable Selection
Aon FI Risk Advisory - CCAR Variable SelectionAon FI Risk Advisory - CCAR Variable Selection
Aon FI Risk Advisory - CCAR Variable SelectionEvan Sekeris
 
Step by Step guide to executing an analytics project
Step by Step guide to executing an analytics projectStep by Step guide to executing an analytics project
Step by Step guide to executing an analytics projectRamkumar Ravichandran
 
Variable Selection for CCAR
Variable Selection for CCARVariable Selection for CCAR
Variable Selection for CCAREvan Sekeris
 
Making Analytics Actionable for Financial Institutions (Part II of III)
Making Analytics Actionable for Financial Institutions (Part II of III)Making Analytics Actionable for Financial Institutions (Part II of III)
Making Analytics Actionable for Financial Institutions (Part II of III)Cognizant
 
Presentation Title
Presentation TitlePresentation Title
Presentation Titlebutest
 
Advantages of Regression Models Over Expert Judgement for Characterizing Cybe...
Advantages of Regression Models Over Expert Judgement for Characterizing Cybe...Advantages of Regression Models Over Expert Judgement for Characterizing Cybe...
Advantages of Regression Models Over Expert Judgement for Characterizing Cybe...Thomas Lee
 
Customer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenCustomer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenSmarten Augmented Analytics
 
What Is a Model, Anyhow?
What Is a Model, Anyhow?What Is a Model, Anyhow?
What Is a Model, Anyhow?Bill Cassill
 
Automation of IT Ticket Automation using NLP and Deep Learning
Automation of IT Ticket Automation using NLP and Deep LearningAutomation of IT Ticket Automation using NLP and Deep Learning
Automation of IT Ticket Automation using NLP and Deep LearningPranov Mishra
 
IFRS17 Risk Adjustment modeling
IFRS17 Risk Adjustment modelingIFRS17 Risk Adjustment modeling
IFRS17 Risk Adjustment modelingSyedDanishAli25
 
How You Can Demystify Model Risk Management
How You Can Demystify Model Risk ManagementHow You Can Demystify Model Risk Management
How You Can Demystify Model Risk ManagementColleen Beck-Domanico
 
Data Science by Chappuis Halder & Co.
Data Science by Chappuis Halder & Co.Data Science by Chappuis Halder & Co.
Data Science by Chappuis Halder & Co.Genest Benoit
 
Risk Assessment Model and its Integration into an Established Test Process
Risk Assessment Model and its Integration into an Established Test ProcessRisk Assessment Model and its Integration into an Established Test Process
Risk Assessment Model and its Integration into an Established Test Processijtsrd
 
Cap markets news sep2002
Cap markets news sep2002Cap markets news sep2002
Cap markets news sep2002Gloria Ikosi
 
Churn in the Telecommunications Industry
Churn in the Telecommunications IndustryChurn in the Telecommunications Industry
Churn in the Telecommunications Industryskewdlogix
 
Demand forecasting
Demand forecastingDemand forecasting
Demand forecastingdkamalim92
 

What's hot (18)

Aon FI Risk Advisory - CCAR Variable Selection
Aon FI Risk Advisory - CCAR Variable SelectionAon FI Risk Advisory - CCAR Variable Selection
Aon FI Risk Advisory - CCAR Variable Selection
 
Step by Step guide to executing an analytics project
Step by Step guide to executing an analytics projectStep by Step guide to executing an analytics project
Step by Step guide to executing an analytics project
 
Variable Selection for CCAR
Variable Selection for CCARVariable Selection for CCAR
Variable Selection for CCAR
 
Making Analytics Actionable for Financial Institutions (Part II of III)
Making Analytics Actionable for Financial Institutions (Part II of III)Making Analytics Actionable for Financial Institutions (Part II of III)
Making Analytics Actionable for Financial Institutions (Part II of III)
 
Presentation Title
Presentation TitlePresentation Title
Presentation Title
 
Advantages of Regression Models Over Expert Judgement for Characterizing Cybe...
Advantages of Regression Models Over Expert Judgement for Characterizing Cybe...Advantages of Regression Models Over Expert Judgement for Characterizing Cybe...
Advantages of Regression Models Over Expert Judgement for Characterizing Cybe...
 
Classes of Model
Classes of ModelClasses of Model
Classes of Model
 
Customer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenCustomer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - Smarten
 
What Is a Model, Anyhow?
What Is a Model, Anyhow?What Is a Model, Anyhow?
What Is a Model, Anyhow?
 
Automation of IT Ticket Automation using NLP and Deep Learning
Automation of IT Ticket Automation using NLP and Deep LearningAutomation of IT Ticket Automation using NLP and Deep Learning
Automation of IT Ticket Automation using NLP and Deep Learning
 
IFRS17 Risk Adjustment modeling
IFRS17 Risk Adjustment modelingIFRS17 Risk Adjustment modeling
IFRS17 Risk Adjustment modeling
 
Predictive analytics
Predictive analyticsPredictive analytics
Predictive analytics
 
How You Can Demystify Model Risk Management
How You Can Demystify Model Risk ManagementHow You Can Demystify Model Risk Management
How You Can Demystify Model Risk Management
 
Data Science by Chappuis Halder & Co.
Data Science by Chappuis Halder & Co.Data Science by Chappuis Halder & Co.
Data Science by Chappuis Halder & Co.
 
Risk Assessment Model and its Integration into an Established Test Process
Risk Assessment Model and its Integration into an Established Test ProcessRisk Assessment Model and its Integration into an Established Test Process
Risk Assessment Model and its Integration into an Established Test Process
 
Cap markets news sep2002
Cap markets news sep2002Cap markets news sep2002
Cap markets news sep2002
 
Churn in the Telecommunications Industry
Churn in the Telecommunications IndustryChurn in the Telecommunications Industry
Churn in the Telecommunications Industry
 
Demand forecasting
Demand forecastingDemand forecasting
Demand forecasting
 

Similar to Statistical Approach to CRR

Pillar III presentation 2 27-15 - redacted version
Pillar III presentation 2 27-15 - redacted versionPillar III presentation 2 27-15 - redacted version
Pillar III presentation 2 27-15 - redacted versionBenjamin Huston
 
Model Risk Management: Using an infinitely scalable stress testing platform f...
Model Risk Management: Using an infinitely scalable stress testing platform f...Model Risk Management: Using an infinitely scalable stress testing platform f...
Model Risk Management: Using an infinitely scalable stress testing platform f...QuantUniversity
 
Review Parameters Model Building & Interpretation and Model Tunin.docx
Review Parameters Model Building & Interpretation and Model Tunin.docxReview Parameters Model Building & Interpretation and Model Tunin.docx
Review Parameters Model Building & Interpretation and Model Tunin.docxcarlstromcurtis
 
Zou_Resume_2015_Dec_Q
Zou_Resume_2015_Dec_QZou_Resume_2015_Dec_Q
Zou_Resume_2015_Dec_QXiaorong Zou
 
Credit Card Marketing Classification Trees Fr.docx
 Credit Card Marketing Classification Trees Fr.docx Credit Card Marketing Classification Trees Fr.docx
Credit Card Marketing Classification Trees Fr.docxShiraPrater50
 
Proficiency comparison ofladtree
Proficiency comparison ofladtreeProficiency comparison ofladtree
Proficiency comparison ofladtreeijcsa
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptxVishalLabde
 
Session 07_Risk Assessment Program for YSP_Risk Evaluation
Session 07_Risk Assessment Program for YSP_Risk EvaluationSession 07_Risk Assessment Program for YSP_Risk Evaluation
Session 07_Risk Assessment Program for YSP_Risk EvaluationMuizz Anibire
 
Guide for building GLMS
Guide for building GLMSGuide for building GLMS
Guide for building GLMSAli T. Lotia
 
Campaign response modeling
Campaign response modelingCampaign response modeling
Campaign response modelingEsteban Ribero
 
A Novel Performance Measure For Machine Learning Classification
A Novel Performance Measure For Machine Learning ClassificationA Novel Performance Measure For Machine Learning Classification
A Novel Performance Measure For Machine Learning ClassificationKarin Faust
 
FitchLearning QuantUniversity Model Risk Presentation
FitchLearning QuantUniversity Model Risk PresentationFitchLearning QuantUniversity Model Risk Presentation
FitchLearning QuantUniversity Model Risk PresentationQuantUniversity
 
Advanced Pricing in General Insurance
Advanced Pricing in General InsuranceAdvanced Pricing in General Insurance
Advanced Pricing in General InsuranceSyed Danish Ali
 
A Framework Driven Approach to Model Risk Management (www.dataanalyticsfinanc...
A Framework Driven Approach to Model Risk Management (www.dataanalyticsfinanc...A Framework Driven Approach to Model Risk Management (www.dataanalyticsfinanc...
A Framework Driven Approach to Model Risk Management (www.dataanalyticsfinanc...QuantUniversity
 
Model Risk Management | How to measure and quantify model risk?
Model Risk Management | How to measure and quantify model risk?Model Risk Management | How to measure and quantify model risk?
Model Risk Management | How to measure and quantify model risk?Genest Benoit
 
Independent models validation and automation
Independent models validation and automationIndependent models validation and automation
Independent models validation and automationSohail_farooq
 
Validating Qualitative Models
Validating Qualitative ModelsValidating Qualitative Models
Validating Qualitative ModelsJacob Kosoff
 

Similar to Statistical Approach to CRR (20)

Pillar III presentation 2 27-15 - redacted version
Pillar III presentation 2 27-15 - redacted versionPillar III presentation 2 27-15 - redacted version
Pillar III presentation 2 27-15 - redacted version
 
Model Risk Management: Using an infinitely scalable stress testing platform f...
Model Risk Management: Using an infinitely scalable stress testing platform f...Model Risk Management: Using an infinitely scalable stress testing platform f...
Model Risk Management: Using an infinitely scalable stress testing platform f...
 
Credit scoring i financial sector
Credit scoring i financial  sector Credit scoring i financial  sector
Credit scoring i financial sector
 
Review Parameters Model Building & Interpretation and Model Tunin.docx
Review Parameters Model Building & Interpretation and Model Tunin.docxReview Parameters Model Building & Interpretation and Model Tunin.docx
Review Parameters Model Building & Interpretation and Model Tunin.docx
 
Zou_Resume_2015_Dec_Q
Zou_Resume_2015_Dec_QZou_Resume_2015_Dec_Q
Zou_Resume_2015_Dec_Q
 
Credit scorecard
Credit scorecardCredit scorecard
Credit scorecard
 
Credit Card Marketing Classification Trees Fr.docx
 Credit Card Marketing Classification Trees Fr.docx Credit Card Marketing Classification Trees Fr.docx
Credit Card Marketing Classification Trees Fr.docx
 
Proficiency comparison ofladtree
Proficiency comparison ofladtreeProficiency comparison ofladtree
Proficiency comparison ofladtree
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
 
Session 07_Risk Assessment Program for YSP_Risk Evaluation
Session 07_Risk Assessment Program for YSP_Risk EvaluationSession 07_Risk Assessment Program for YSP_Risk Evaluation
Session 07_Risk Assessment Program for YSP_Risk Evaluation
 
Guide for building GLMS
Guide for building GLMSGuide for building GLMS
Guide for building GLMS
 
Campaign response modeling
Campaign response modelingCampaign response modeling
Campaign response modeling
 
A Novel Performance Measure For Machine Learning Classification
A Novel Performance Measure For Machine Learning ClassificationA Novel Performance Measure For Machine Learning Classification
A Novel Performance Measure For Machine Learning Classification
 
FitchLearning QuantUniversity Model Risk Presentation
FitchLearning QuantUniversity Model Risk PresentationFitchLearning QuantUniversity Model Risk Presentation
FitchLearning QuantUniversity Model Risk Presentation
 
Using rcm
Using rcmUsing rcm
Using rcm
 
Advanced Pricing in General Insurance
Advanced Pricing in General InsuranceAdvanced Pricing in General Insurance
Advanced Pricing in General Insurance
 
A Framework Driven Approach to Model Risk Management (www.dataanalyticsfinanc...
A Framework Driven Approach to Model Risk Management (www.dataanalyticsfinanc...A Framework Driven Approach to Model Risk Management (www.dataanalyticsfinanc...
A Framework Driven Approach to Model Risk Management (www.dataanalyticsfinanc...
 
Model Risk Management | How to measure and quantify model risk?
Model Risk Management | How to measure and quantify model risk?Model Risk Management | How to measure and quantify model risk?
Model Risk Management | How to measure and quantify model risk?
 
Independent models validation and automation
Independent models validation and automationIndependent models validation and automation
Independent models validation and automation
 
Validating Qualitative Models
Validating Qualitative ModelsValidating Qualitative Models
Validating Qualitative Models
 

Recently uploaded

9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 

Recently uploaded (20)

Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 

Statistical Approach to CRR

  • 1. Statistical Approach to Customer Risk Rating By Mayank Johri, Amin Ahmadi, Michael Spieler Introduction The customer risk rating (CRR) model’s objective is to assess the AML risk of the Bank’s customers based on each customer’s profile. The risk attributes considered in model development are based on FFIEC guidelines as well as industry expertise. Each population group (Individual, Entity, and Alternative Investment), is a separate model which has a combination of population group specific characteristics and characteristics common to all customer (“attributes”). To score these attributes, samples of customer profiles are drawn from each population group. Using only these sample customer profiles, BSA/AML SMEs risk rate these customer files. The results of these reviews are then used to calibrate the model. Once calibrated, the model assigns each customer a continuous risk score that is converted to a discrete risk rating (High, Medium, and Low). After the application of the model (“CRR Run”), each customer is assigned a risk score. Reference sources for the risk model include the existing model, regulatory guidance and AML subject matter expertise. Phases The following outlines the eight phase process for building CRR model: Phase 0 | Define Approach. The customer risk scoring model is based on Customer Due Diligence Standards by each population group: Individuals, Entity, and Alternative Investment. Each of the three risk models will be calibrated separately. The risk attributes included in the risk models assess and measure customer AML risk in alignment with the views of risk held by BSA/AML Management. Phase 1 | Define Risk Categories. The model consists of the following categories: Geography, Customer (e.g. Political Profile, Negative News, and Person of Interest), Product/Account, and Transaction. An additional “Governance” category is included for Entity and Alternative Investment. These categories will contain risk attributes indicative of money laundering risk and are scored individually. There is an example of the geography category in Table 1.1. Table 1.1 Individuals Entity Alternative Investments Country of Citizenship Country of Residency Non-US Address Country of Formation Country of Operation Country of Incorporation Country of Primary Business Operations Country of Registration Country of Investments Geography Due Diligence Standards
  • 2. Phase 2 |Data Analysis. Determine the relevant risk attributes for each risk categories, by first conducting working group sessions to conceptually agree on the risk attributes, then using existing customer data for the following:  Assessing data completeness (% of non-null/blank values)  Assessing variation within a given variable Risk attributes that have low data completeness or low variation are removed. Low data completeness means a large portion of the customers don’t have usable values, making it a meaningless attribute for the majority of customers. Low variation means a large portion of the customers have the same value, complicating the model with additional attributes that don’t help differentiate any one customer from the rest. Phase 3 |Correlation Test. To give a preliminary understanding of the levels pairwise correlation between attributes, test using various correlation coefficients based on the data types (e.g. binary, ordinal and continuous). Appropriate pairwise correlation coefficients are as follows: i. Binary - Binary: phi coefficient (mean square contingency coefficient) ii. Binary - Continuous: ANOVA to evaluate how likely the correlation between two variables is by chance iii. Ordinal - Continuous: Kendall’s(τ) iv. Ordinal – Ordinal: Spearman’s (rs) A correlation study would entail quantifying the relationships among all the attributes (as opposed to pairwise) to evaluate the relationship between one attribute relative to all of the others. This study can be directly used to decide which attributes need to stay in the model, and whether their contribution is at the additive level or also needs to be accounted at two-way (or three-way) interactive level (s). Highly pairwise correlated (> 90%) attributes suggest that one might be eliminated without losing the accuracy of the model, but not if high multicollinearity, which can only be quantified at the model level, also exists. Below is the interpretation of the correlation study.  Highly pairwise correlated (> 90%) attributes: leave only one in the model. In order to decide which one to remove one can calculate “coefficient of partial determination” and eliminate the one with lower value  Attributes with no or low (<5%) correlation: leave both in the model; there is strong evidence that including the interaction term of these two attributes would not significantly contribute to the accuracy of the model  Attributes with some level of correlation: leave both in the model; interaction term should be investigated at the model level analysis Phase 3 |Sampling. For each population group, draw samples to cover all combinations of risks for SME risk rating. The outcome of the SME risk rating is the labeled data (label being the ranking of the customer) which is used to “learn” the regression model. The samples may be generated as follows:  Organize data into a list with each row representing a specific combination and the last column showing the frequency of the case  Select the target sample size: n  For each row, develop the binomial marginal distribution  Determine expected value of each cell based on the selected sample size
  • 3.  For rare event cases, augment to have size of at least N= 10k/p where k is the number of covariates and p is the probability of the rare event case Phase 4 |SME Rating. Reviewing one sampled (from phase 3) customer at a time, an AML SME looks at that customer’s values for each risk attribute and ascertains a risk rating or high, medium, or low, based regulatory guidance and the BSA/AML department’s risk tolerance. Phase 5 |Model Calibration. Conduct statistical analysis using SME’s risk rating to calibrate the model. Model calibration has the following steps: The risk rating of samples, as labeled by SME, is fitted by ordinal logistic regression. The developed model is used to risk rate the entire population. Ordinal logistic model assigns each customer, based on the customer’s attributes, to a distribution interval (i.e., risk tier). Fig. 1.2 (typical result of ordinal logistic regression) shows a schematic representation of the model result. Figure 1.2 Phase 6 |Model Performance Evaluation. Given the structure of CRR data, a logistic regression is used to relate selected attributes to customer risk ranks (as determined by an SME). An optimized model is developed based on the performance of the selected attributes (i.e., how much does the attribute contribute toward the accuracy of the model?) and how well the optimized model performs vis-à-vis the benchmark model.  A Wald test is used to decide if an attribute needs to be included in the optimized model by quantifying how far the estimated parameter is from zero. The Wald test can also be used for multiple parameters simultaneously  A log-likelihood ratio test is used to compare “reduced” (candidate optimized) models with the benchmark model. The benchmark model, which includes all the attributes and their second or/and higher level interaction terms, is also the most restrictive model. This means that the benchmark model has the best performance within the current data, but has diminished generalization capability. In general one tries to select the most simplified model which still has acceptable performance  McFadden pseudo-R2 is used to gauge how much the logistic regression model explains a variation in the response. This is considered a similar measure to the ubiquitous R2 used in linear regression. The value of pseudo- R2 ranges between 0 and 1; values close to 0 indicates the model has no predictive capability
  • 4.  Uncertainty of the final model is quantified to evaluate how well the model would apply to future customers. The uncertainty may be characterized at two levels: i. Uncertainty of the selected attributes represented by the variance of the regression coefficients ii. Residual uncertainty represented by the standard error of the model Assess if the differences between model prediction and SME ratings are within acceptable model error rate. Confirm the model output’s alignment with working group’s views and expectations. If it is not, go back to phase 5 and calibrate the model again. Phase 7 |Model Implementation. Once the evaluation is complete and the result is satisfactory, the model is implemented and run on a regular schedule in an automated fashion. The outcome of the model can potentially drive various processes, including periodic due diligence reviews and stratified transaction monitoring thresholds.