SlideShare a Scribd company logo
Benford’s Law… Is it magic? Gaetan “Guy” Lion  July 2010
What is the probability that the population number of any country starts with any of the first digit: 1,2,3,4,5,6,7,8, or 9? ,[object Object]
…  The Correct Answer
Countries populations follow Benford’s Law Chi Square P value the two distributions are the same: 0.8
Benford’s Law ,[object Object],[object Object],[object Object]
When does this law work? The data crosses at least one scale (or order of magnitude) as shown below:  You preferably need a sample > 100.
Demographic data follows  Benford Law very closely The U.S. has over 3,000 counties.  All shown demographic measures follow Benford’s Law pretty closely.  This very large sample renders the Chi Square Goodness of fit test very (if not excessively) rigorous.
NYSE Stocks volume This captures the first digit frequency of volume of over 2,000 NYSE stocks on June 21 st . The fit is excellent both visually and statistically.
PG&E SmartMeter test This captures 91 observations between April and July 2010 of analog vs SmartMeter kWh consumption readings.  Both the visual and statistical fit are pretty good.
Tennis pros ATP points The number of ATP points of the first 1,600 professional tennis players follow closely Benford’s Law.  Because of the large sample the associated P value is small.
Even when it is not supposed to  work…  It kind of does. I investigated Bernie Madoff’s monthly returns vs its closest competitor (GATEX).  Although those data sets were not fit to use Benford’s Law the visual fit was surprisingly good.
Is Benford Law magic?   Bacteria > No, a simple rule is that there are more small things than large things in the universe…
…  a simple explanation… The general principle is that there are more smaller observations vs larger ones.  There are probably nearly twice as many 1s as there are 2s and three times as many 1s as there are 3s, etc…  Using such a principle throughout gives us a frequency that is close to Benford’s Law.  We would need a sample > 1,000 to reach statistical significance at the 0.05 level that those two distributions are different.
Extending Benford’s Law beyond first digit ,[object Object]
Benford vs Simple rule for first two digits When dealing with first two digits (10 – 99), Benford’s Law and the Simple Rule have indistinguishable distributions.  You would need samples > 700,000 to reach statistical significance at the 0.05 level that the two distributions are different.
Time series growing by 2% per period A time series growing by 2% per period over 116 periods replicates almost exactly Benford’s Law frequency distribution.  This makes sense.  The difference between 1 and 2 is a 100% increase vs between 2 and 3 is only a a 50% increase, etc…  This entails there will be a lot more 1s than other digits.
Math properties of Benford’s Law ,[object Object],[object Object]
The Ones Scaling Test Looking at tax return numbers that followed BL closely, someone used the Ones Scaling Test to see if the number of “1s” would remain the same if multiplied by a constant.  In this case, they multiplied the set of numbers by 1.01 and did that 696 times.  This corresponds to multiplying the numbers progressively up to a factor of 1,000 as 1.01^696 = 1,000.  As shown, across all iterations the number of 1s remained very stable around the BL predicated level of 30.1%.  Source: “The Scientist and Engineer’s Guide to Digital Signal Processing.  Steve Smith, PhD.
[object Object],[object Object]
A few Benford’s Law applications… ,[object Object],[object Object],[object Object],[object Object]
Iran Election Mahmoud Ahmadinejad's vote totals have more '2s' and fewer '1s' than expected.  Roukema speculates Iranian officials replaced 1s by 2s.  So, for instance, in some town where he received 1,954 votes, they would report his having received 2,954 votes.  Source: Nate Silver.  fivethirtyeight.com
Franken Vote count “…This hugely violates Benford's Law -- there are not nearly enough totals beginning in 1 and too many beginning in numbers like 5, 6 and 7. The odds of these anomalies having occurred by chance are greater than a quadrillion to one against…  the reason this pattern emerges is because precinct sizes in Minnesota are not truly random .  There is a large number of precincts in Minnesota that are designed to serve between 1,000 and 2,000 voters; since Franken won about 42 percent of the votes statewide, this leads to a relatively high number of instances where his vote totals are in the high single digits (672, 704, 588, etc.)” Source: Nate Silver. fivethirtyeight.com Senator
Inspector Clouseau demonstrates how to run a fraud investigation
Detecting fraud (an example).  Step 1   A company issued 483 checks in 2009 Q4 that was audited and everything checked out.  It also issued 522 checks in 2010 Q1.  A fraud investigator notes that 09 Q4 pattern fit Benford Law very closely (P value 0.84).  He notes that the fit deteriorated in 010 Q1 9 (P value 0.06).
Step 2. Focus on the difference As shown, the company has issued many more checks starting with the ‘6’ digit than expected (60 vs 35 for BL).
Step 3. Focus on the 6s first two digits We have 28 checks out of 522 starting with the two digits 66 vs 3.4 expected per Benford’s Law.  This calls for further investigation.
Step 4.  Focus on the 66s to three digits   Carrying this analysis to the first three digits, we see an unusual # of checks starting with ‘666’ and ‘668.’  Later, we find that the checks starting with ‘666’ were legitimate ones that four employees wrote to pay for a monthly service that cost $5.95 per month plus tax or $6.66 with tax.  Meanwhile, 9 of the 10 checks starting with ‘668’ were fraudulent ones.
Replicating Clouseau’s success ,[object Object],[object Object],[object Object]
The Key ,[object Object],[object Object]

More Related Content

What's hot

Fraud Prevention, Detection and Investigation in the Payday Advance Industry
Fraud Prevention, Detection and Investigation in the Payday Advance IndustryFraud Prevention, Detection and Investigation in the Payday Advance Industry
Fraud Prevention, Detection and Investigation in the Payday Advance Industry
DecosimoCPAs
 
forensic accounting india
forensic accounting indiaforensic accounting india
forensic accounting india
Mayank Garg
 
Fraud Awareness For Managers
Fraud Awareness For ManagersFraud Awareness For Managers
Fraud Awareness For Managers
rickycfe
 
Chap19 time series-analysis_and_forecasting
Chap19 time series-analysis_and_forecastingChap19 time series-analysis_and_forecasting
Chap19 time series-analysis_and_forecasting
Vishal Kukreja
 
Chapter 5:Recognizing the symptoms of fraud
Chapter 5:Recognizing the symptoms of fraudChapter 5:Recognizing the symptoms of fraud
Chapter 5:Recognizing the symptoms of fraud
VidaB
 

What's hot (20)

Fraud Detection presentation
Fraud Detection presentationFraud Detection presentation
Fraud Detection presentation
 
Fraud Prevention, Detection and Investigation in the Payday Advance Industry
Fraud Prevention, Detection and Investigation in the Payday Advance IndustryFraud Prevention, Detection and Investigation in the Payday Advance Industry
Fraud Prevention, Detection and Investigation in the Payday Advance Industry
 
forensic accounting india
forensic accounting indiaforensic accounting india
forensic accounting india
 
Forensic Accounting
Forensic AccountingForensic Accounting
Forensic Accounting
 
Accounting and financial management
Accounting and financial managementAccounting and financial management
Accounting and financial management
 
Fraud Awareness For Managers
Fraud Awareness For ManagersFraud Awareness For Managers
Fraud Awareness For Managers
 
Forensic accounting
Forensic accountingForensic accounting
Forensic accounting
 
Internal check internal audit internal control
Internal check internal audit internal controlInternal check internal audit internal control
Internal check internal audit internal control
 
Types of Fraud.pptx
Types of Fraud.pptxTypes of Fraud.pptx
Types of Fraud.pptx
 
Chap19 time series-analysis_and_forecasting
Chap19 time series-analysis_and_forecastingChap19 time series-analysis_and_forecasting
Chap19 time series-analysis_and_forecasting
 
ACFE Presentation on Analytics for Fraud Detection and Mitigation
ACFE Presentation on Analytics for Fraud Detection and MitigationACFE Presentation on Analytics for Fraud Detection and Mitigation
ACFE Presentation on Analytics for Fraud Detection and Mitigation
 
Accounting
AccountingAccounting
Accounting
 
Chapter 5:Recognizing the symptoms of fraud
Chapter 5:Recognizing the symptoms of fraudChapter 5:Recognizing the symptoms of fraud
Chapter 5:Recognizing the symptoms of fraud
 
Forensic accounting
Forensic accountingForensic accounting
Forensic accounting
 
Internal control
Internal controlInternal control
Internal control
 
Criminal Justice 7: What to Omit
Criminal Justice 7: What to OmitCriminal Justice 7: What to Omit
Criminal Justice 7: What to Omit
 
Fraud Risk Awareness
Fraud Risk AwarenessFraud Risk Awareness
Fraud Risk Awareness
 
Who Commits Fraud
Who Commits Fraud   Who Commits Fraud
Who Commits Fraud
 
chapter-eight.ppt
chapter-eight.pptchapter-eight.ppt
chapter-eight.ppt
 
Computer Assisted Audit Tools and Techniques - the Force multiplier in the ba...
Computer Assisted Audit Tools and Techniques - the Force multiplier in the ba...Computer Assisted Audit Tools and Techniques - the Force multiplier in the ba...
Computer Assisted Audit Tools and Techniques - the Force multiplier in the ba...
 

Similar to Benford's law

Final Initial Project Development With Discussion
Final Initial Project Development With DiscussionFinal Initial Project Development With Discussion
Final Initial Project Development With Discussion
easternman99
 
Data quality applications_of_benford's_law_(finalv2)
Data quality applications_of_benford's_law_(finalv2)Data quality applications_of_benford's_law_(finalv2)
Data quality applications_of_benford's_law_(finalv2)
Robert Hillard
 
Cartel detection and collusion screening: an empirical analysis of the London...
Cartel detection and collusion screening: an empirical analysis of the London...Cartel detection and collusion screening: an empirical analysis of the London...
Cartel detection and collusion screening: an empirical analysis of the London...
Dr Danilo Samà
 
Ballot Problem for Many Candidates
Ballot Problem for Many CandidatesBallot Problem for Many Candidates
Ballot Problem for Many Candidates
guest838786
 
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docxHomework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
pooleavelina
 
Nov-Dec INTA -FAY TEPLITSKY
Nov-Dec INTA -FAY TEPLITSKYNov-Dec INTA -FAY TEPLITSKY
Nov-Dec INTA -FAY TEPLITSKY
Fay Teplitsky
 
1 Econometrics Final Exam Summer 2017 This exam i.docx
1  Econometrics Final Exam Summer 2017 This exam i.docx1  Econometrics Final Exam Summer 2017 This exam i.docx
1 Econometrics Final Exam Summer 2017 This exam i.docx
honey725342
 
Finding Patterns in Data Breaches
Finding Patterns in Data BreachesFinding Patterns in Data Breaches
Finding Patterns in Data Breaches
Rochester Security Summit
 
01 Descriptive Statistics for Exploring Data.pdf
01 Descriptive Statistics for Exploring Data.pdf01 Descriptive Statistics for Exploring Data.pdf
01 Descriptive Statistics for Exploring Data.pdf
SREDDINIRANJAN
 

Similar to Benford's law (20)

Mathematics ib ia example
Mathematics ib ia exampleMathematics ib ia example
Mathematics ib ia example
 
Final Initial Project Development With Discussion
Final Initial Project Development With DiscussionFinal Initial Project Development With Discussion
Final Initial Project Development With Discussion
 
Vote Counts and Benford's Law
Vote Counts and Benford's LawVote Counts and Benford's Law
Vote Counts and Benford's Law
 
Data quality applications_of_benford's_law_(finalv2)
Data quality applications_of_benford's_law_(finalv2)Data quality applications_of_benford's_law_(finalv2)
Data quality applications_of_benford's_law_(finalv2)
 
Benford's Law and Fraud Detection
Benford's Law and Fraud DetectionBenford's Law and Fraud Detection
Benford's Law and Fraud Detection
 
Benfords law o vs e
Benfords law o vs eBenfords law o vs e
Benfords law o vs e
 
Modelling Conformity of Nigeria’s Recent Population Censuses With Benford’s D...
Modelling Conformity of Nigeria’s Recent Population Censuses With Benford’s D...Modelling Conformity of Nigeria’s Recent Population Censuses With Benford’s D...
Modelling Conformity of Nigeria’s Recent Population Censuses With Benford’s D...
 
Data forensics with R and Power BI
Data forensics with R and Power BIData forensics with R and Power BI
Data forensics with R and Power BI
 
10 ways to identify Accounts Payable fraud Pt1
10 ways to identify Accounts Payable fraud Pt110 ways to identify Accounts Payable fraud Pt1
10 ways to identify Accounts Payable fraud Pt1
 
Cartel detection and collusion screening: an empirical analysis of the London...
Cartel detection and collusion screening: an empirical analysis of the London...Cartel detection and collusion screening: an empirical analysis of the London...
Cartel detection and collusion screening: an empirical analysis of the London...
 
Ballot Problem for Many Candidates
Ballot Problem for Many CandidatesBallot Problem for Many Candidates
Ballot Problem for Many Candidates
 
The Southbourne Tax Group: 10 Ways to Identify Accounts Payable Fraud
The Southbourne Tax Group: 10 Ways to Identify Accounts Payable FraudThe Southbourne Tax Group: 10 Ways to Identify Accounts Payable Fraud
The Southbourne Tax Group: 10 Ways to Identify Accounts Payable Fraud
 
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docxHomework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
 
Nov-Dec INTA -FAY TEPLITSKY
Nov-Dec INTA -FAY TEPLITSKYNov-Dec INTA -FAY TEPLITSKY
Nov-Dec INTA -FAY TEPLITSKY
 
226 lec9 jda
226 lec9 jda226 lec9 jda
226 lec9 jda
 
1 Econometrics Final Exam Summer 2017 This exam i.docx
1  Econometrics Final Exam Summer 2017 This exam i.docx1  Econometrics Final Exam Summer 2017 This exam i.docx
1 Econometrics Final Exam Summer 2017 This exam i.docx
 
Finding Patterns in Data Breaches
Finding Patterns in Data BreachesFinding Patterns in Data Breaches
Finding Patterns in Data Breaches
 
01 Descriptive Statistics for Exploring Data.pdf
01 Descriptive Statistics for Exploring Data.pdf01 Descriptive Statistics for Exploring Data.pdf
01 Descriptive Statistics for Exploring Data.pdf
 
benfords Law
benfords Lawbenfords Law
benfords Law
 
assignment of statistics 2.pdf
assignment of statistics 2.pdfassignment of statistics 2.pdf
assignment of statistics 2.pdf
 

More from Gaetan Lion

More from Gaetan Lion (20)

DRU projections testing.pptx
DRU projections testing.pptxDRU projections testing.pptx
DRU projections testing.pptx
 
Climate Change in 24 US Cities
Climate Change in 24 US CitiesClimate Change in 24 US Cities
Climate Change in 24 US Cities
 
Compact Letter Display (CLD). How it works
Compact Letter Display (CLD).  How it worksCompact Letter Display (CLD).  How it works
Compact Letter Display (CLD). How it works
 
CalPERS pensions vs. Social Security
CalPERS pensions vs. Social SecurityCalPERS pensions vs. Social Security
CalPERS pensions vs. Social Security
 
Recessions.pptx
Recessions.pptxRecessions.pptx
Recessions.pptx
 
Inequality in the United States
Inequality in the United StatesInequality in the United States
Inequality in the United States
 
Housing Price Models
Housing Price ModelsHousing Price Models
Housing Price Models
 
Global Aging.pdf
Global Aging.pdfGlobal Aging.pdf
Global Aging.pdf
 
Cryptocurrencies as an asset class
Cryptocurrencies as an asset classCryptocurrencies as an asset class
Cryptocurrencies as an asset class
 
Can you Deep Learn the Stock Market?
Can you Deep Learn the Stock Market?Can you Deep Learn the Stock Market?
Can you Deep Learn the Stock Market?
 
Can Treasury Inflation Protected Securities predict Inflation?
Can Treasury Inflation Protected Securities predict Inflation?Can Treasury Inflation Protected Securities predict Inflation?
Can Treasury Inflation Protected Securities predict Inflation?
 
How overvalued is the Stock Market?
How overvalued is the Stock Market? How overvalued is the Stock Market?
How overvalued is the Stock Market?
 
The relationship between the Stock Market and Interest Rates
The relationship between the Stock Market and Interest RatesThe relationship between the Stock Market and Interest Rates
The relationship between the Stock Market and Interest Rates
 
Life expectancy
Life expectancyLife expectancy
Life expectancy
 
Comparing R vs. Python for data visualization
Comparing R vs. Python for data visualizationComparing R vs. Python for data visualization
Comparing R vs. Python for data visualization
 
Will Stock Markets survive in 200 years?
Will Stock Markets survive in 200 years?Will Stock Markets survive in 200 years?
Will Stock Markets survive in 200 years?
 
Standardization
StandardizationStandardization
Standardization
 
Is Tom Brady the greatest quarterback?
Is Tom Brady the greatest quarterback?Is Tom Brady the greatest quarterback?
Is Tom Brady the greatest quarterback?
 
Regularization why you should avoid them
Regularization why you should avoid themRegularization why you should avoid them
Regularization why you should avoid them
 
Basketball the 3 pt game
Basketball the 3 pt gameBasketball the 3 pt game
Basketball the 3 pt game
 

Recently uploaded

The basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxThe basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

....................Muslim-Law notes.pdf
....................Muslim-Law notes.pdf....................Muslim-Law notes.pdf
....................Muslim-Law notes.pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.pptBasic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
 
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptxslides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
 
Telling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdf
Telling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdfTelling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdf
Telling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdf
 
“O BEIJO” EM ARTE .
“O BEIJO” EM ARTE                       .“O BEIJO” EM ARTE                       .
“O BEIJO” EM ARTE .
 
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
 
NCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdfNCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdf
 
[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation
 
Salient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptxSalient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptx
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdfINU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
 
Championnat de France de Tennis de table/
Championnat de France de Tennis de table/Championnat de France de Tennis de table/
Championnat de France de Tennis de table/
 
Features of Video Calls in the Discuss Module in Odoo 17
Features of Video Calls in the Discuss Module in Odoo 17Features of Video Calls in the Discuss Module in Odoo 17
Features of Video Calls in the Discuss Module in Odoo 17
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
 
The basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxThe basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptx
 
Application of Matrices in real life. Presentation on application of matrices
Application of Matrices in real life. Presentation on application of matricesApplication of Matrices in real life. Presentation on application of matrices
Application of Matrices in real life. Presentation on application of matrices
 
The Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational ResourcesThe Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational Resources
 
Word Stress rules esl .pptx
Word Stress rules esl               .pptxWord Stress rules esl               .pptx
Word Stress rules esl .pptx
 

Benford's law

  • 1. Benford’s Law… Is it magic? Gaetan “Guy” Lion July 2010
  • 2.
  • 3. … The Correct Answer
  • 4. Countries populations follow Benford’s Law Chi Square P value the two distributions are the same: 0.8
  • 5.
  • 6. When does this law work? The data crosses at least one scale (or order of magnitude) as shown below: You preferably need a sample > 100.
  • 7. Demographic data follows Benford Law very closely The U.S. has over 3,000 counties. All shown demographic measures follow Benford’s Law pretty closely. This very large sample renders the Chi Square Goodness of fit test very (if not excessively) rigorous.
  • 8. NYSE Stocks volume This captures the first digit frequency of volume of over 2,000 NYSE stocks on June 21 st . The fit is excellent both visually and statistically.
  • 9. PG&E SmartMeter test This captures 91 observations between April and July 2010 of analog vs SmartMeter kWh consumption readings. Both the visual and statistical fit are pretty good.
  • 10. Tennis pros ATP points The number of ATP points of the first 1,600 professional tennis players follow closely Benford’s Law. Because of the large sample the associated P value is small.
  • 11. Even when it is not supposed to work… It kind of does. I investigated Bernie Madoff’s monthly returns vs its closest competitor (GATEX). Although those data sets were not fit to use Benford’s Law the visual fit was surprisingly good.
  • 12. Is Benford Law magic? Bacteria > No, a simple rule is that there are more small things than large things in the universe…
  • 13. … a simple explanation… The general principle is that there are more smaller observations vs larger ones. There are probably nearly twice as many 1s as there are 2s and three times as many 1s as there are 3s, etc… Using such a principle throughout gives us a frequency that is close to Benford’s Law. We would need a sample > 1,000 to reach statistical significance at the 0.05 level that those two distributions are different.
  • 14.
  • 15. Benford vs Simple rule for first two digits When dealing with first two digits (10 – 99), Benford’s Law and the Simple Rule have indistinguishable distributions. You would need samples > 700,000 to reach statistical significance at the 0.05 level that the two distributions are different.
  • 16. Time series growing by 2% per period A time series growing by 2% per period over 116 periods replicates almost exactly Benford’s Law frequency distribution. This makes sense. The difference between 1 and 2 is a 100% increase vs between 2 and 3 is only a a 50% increase, etc… This entails there will be a lot more 1s than other digits.
  • 17.
  • 18. The Ones Scaling Test Looking at tax return numbers that followed BL closely, someone used the Ones Scaling Test to see if the number of “1s” would remain the same if multiplied by a constant. In this case, they multiplied the set of numbers by 1.01 and did that 696 times. This corresponds to multiplying the numbers progressively up to a factor of 1,000 as 1.01^696 = 1,000. As shown, across all iterations the number of 1s remained very stable around the BL predicated level of 30.1%. Source: “The Scientist and Engineer’s Guide to Digital Signal Processing. Steve Smith, PhD.
  • 19.
  • 20.
  • 21. Iran Election Mahmoud Ahmadinejad's vote totals have more '2s' and fewer '1s' than expected. Roukema speculates Iranian officials replaced 1s by 2s. So, for instance, in some town where he received 1,954 votes, they would report his having received 2,954 votes. Source: Nate Silver. fivethirtyeight.com
  • 22. Franken Vote count “…This hugely violates Benford's Law -- there are not nearly enough totals beginning in 1 and too many beginning in numbers like 5, 6 and 7. The odds of these anomalies having occurred by chance are greater than a quadrillion to one against… the reason this pattern emerges is because precinct sizes in Minnesota are not truly random . There is a large number of precincts in Minnesota that are designed to serve between 1,000 and 2,000 voters; since Franken won about 42 percent of the votes statewide, this leads to a relatively high number of instances where his vote totals are in the high single digits (672, 704, 588, etc.)” Source: Nate Silver. fivethirtyeight.com Senator
  • 23. Inspector Clouseau demonstrates how to run a fraud investigation
  • 24. Detecting fraud (an example). Step 1 A company issued 483 checks in 2009 Q4 that was audited and everything checked out. It also issued 522 checks in 2010 Q1. A fraud investigator notes that 09 Q4 pattern fit Benford Law very closely (P value 0.84). He notes that the fit deteriorated in 010 Q1 9 (P value 0.06).
  • 25. Step 2. Focus on the difference As shown, the company has issued many more checks starting with the ‘6’ digit than expected (60 vs 35 for BL).
  • 26. Step 3. Focus on the 6s first two digits We have 28 checks out of 522 starting with the two digits 66 vs 3.4 expected per Benford’s Law. This calls for further investigation.
  • 27. Step 4. Focus on the 66s to three digits Carrying this analysis to the first three digits, we see an unusual # of checks starting with ‘666’ and ‘668.’ Later, we find that the checks starting with ‘666’ were legitimate ones that four employees wrote to pay for a monthly service that cost $5.95 per month plus tax or $6.66 with tax. Meanwhile, 9 of the 10 checks starting with ‘668’ were fraudulent ones.
  • 28.
  • 29.