How do auditors start their data analysis journey, this presentation to the 2018 Audit Conference of Chartered Accountants Australia & New Zealand provides an insight into testing for fraud using a "Benford's Test". We also address testing for significant and unusual transactions using stratification of data sets and highlight the basic requirements for performing data analysis in Excel, TeamMate, and IDEA.
2. 2experience. new thinking
What will you learn …
Basic data analysis requirements
Using Excel™
Using TeamMate™
Other computer options
Audit implications
4. 4experience. new thinking
McNamara – The Fog of War
1. Empathize with your enemy
2. Rationality will not save us
3. There's something beyond one's self
4. Maximize efficiency
5. Proportionality should be a guideline in war
6. Get the data
7. Belief and seeing are often both wrong
8. Be prepared to re-examine your reasoning
9. In order to do good, you may have to engage in
evil
10.Never say never
11.You can't change human nature
Eleven Lessons from the Life of Robert S. McNamara
These topics were selected by Errol Morris and highlighted in the 2003 documentary
film; they were not selected by McNamara.
Data analysis provides a window
into the data, enabling decisions
and analysis
5. 5experience. new thinking
Data isn’t information
A data set, such as a transaction list, does not
provide easily understood information. This is
why data analysis tools are particularly useful.
Source: Hanrick Curran data analysis training file
6. 6experience. new thinking
Today’s agenda …
1. Benford’s test
2. Efficiency benefits from using data analysis (using the
CaseWare sampling workpaper)
3. The basics for all data analysis work
4. Basic data analysis in Excel™ using pivot tables and
vlookup
5. Demonstrate basic use of TeamMate™
6. Using stratification for population analysis
7. The importance of the graphical display of information
8. Software options
9. Examine hurdles to adoption
7. 7experience. new thinkingexperience. new thinking 7
This isn’t new for auditors, this is what we do.
A brief history of data analysis
by auditors
8. 8experience. new thinking
Auditors and data analysis
“Auditors have been
doing ‘data analysis’
since before that term
was cool.
We just called it
‘Computer Assisted
Audit Techniques’
(“CAAT”).
9. 9experience. new thinking
Data mining and big data
Google data mining and you get 97,500,000 results
Google big data and you get 774,000,000 results
The explosion of data and its use is not going away
Auditors need to respond to the massive increase in data that
clients can make available
10. 10experience. new thinking
A history in data analysis
Proof of data analysis
work in our profession
dates back to at least
the 1979 book about
CAATs, issued by the
AICPA.
AICPA – American Institute of Certified
Public Accountants
11. 11experience. new thinking
In 1999 …
“We were moving
300MB transactional
data files using reel-to-
reel computer tapes
and our IT team had to
get involved in moving
the files around.
12. 12experience. new thinking
In 1999 …
Excel 97 had a capacity of 65,000 rows, so
Excel wasn’t the choice for most of our data
analysis.
13. 13experience. new thinkingexperience. new thinking 13
A statistical test for identifying potential
fraudulent and anomalous transactions in large
data sets.
Trivia question …
Name the 2016 movie where Benford’s Law was
a central element of the plot.
Benford’s Test
14. 14experience. new thinking
The history of Benford’s Law
The story of Benford's Law begins in 1881, when astronomer Simon Newcomb noticed that the page
numbers in a book of logarithm tables were worn (or smeared) more toward the front of the book and
progressively less worn toward the end of the book. Where others would simply dismiss the worn page
numbers, Newcomb recognized a distinct pattern related to the occurrence of lower versus higher
numbers. He published an article explaining his observations and postulated that the probability of a
single number n being the first digit of a number was equal to log(n+1) − log(n). Fifty-seven years
later, in 1938, physicist Frank Benford tested Newcomb's hypothesis against 20 sets of data and
published a scholarly paper verifying the law. Despite Newcomb's groundwork, Benford has garnered
much of the credit for the discovery now commonly referred to as Benford's Law.
The application of Benford's Law to spot signs of accounting fraud grew out of an article published in
1972 by economist Hal Varian, who wrote that Benford's Law might be used to detect the possibility of
fraud in socioeconomic data submitted in support of various public planning decisions. Varian's general
idea was that a simple comparison of first-digit frequency distributions ought to reveal anomalous
results (if any), per Benford's Law. In 1999, a JofA article ("I've Got Your Number," May 1999), written
by Mark J. Nigrini introduced how forensic accountants and auditors could apply Benford's Law to search
for indicators of potential accounting and expenses fraud.
Benford attempted to explain his law by saying that "it's easier to own one acre than nine acres,"
implying (perhaps) that when people purchase land, it is reasonable to assume that more people
purchase one acre as a starting point, rather than nine acres as their starting point.
Source: https://www.journalofaccountancy.com/issues/2017/apr/excel-and-benfords-law-to-detect-fraud.html
16. 16experience. new thinking
Benford's Test
Benford’s test can
be run on all data
sets to identify
anomalous
transactions that
may warrant
further
investigation
Source: HC Audit File – expenses testing
17. 17experience. new thinking
Investigating anomalies
We investigate anomalies, using the extraction ability inside TeamMate/Pivot tables.
Anomalous populations are extracted and analysed for further consideration.
Source: HC Audit File – expenses testing
19. 19experience. new thinking
Benford’s Test
Emailed client a selection of the data, with a
request to review and confirm whether further
investigation is required.
24. 24experience. new thinkingexperience. new thinking 24
Trivia question …
Name the 2016 movie where Benford’s Law was
a central element of the plot.
Movie question
28. 28experience. new thinkingexperience. new thinking 28
Understanding the requirements of ASA 500 and
how Data Analysis can help produce a more
efficient and effective audit. This is to help you
understand why data analysis is worth including
in your audit planning.
Audit Evidence
29. 29experience. new thinking
ISA 500 Audit Evidence
4. The objective of the auditor is to design
and perform audit procedures in such a way
as to enable the auditor to obtain sufficient
appropriate audit evidence to be able to draw
reasonable conclusions on which to base the
auditor’s opinion.
30. 30experience. new thinking
ISA 500 Audit Evidence
A52. … The means available to the auditor for
selecting items for testing are:
(a) selecting all items (100% testing)
(b) selecting specific items
(c) audit sampling
Testing
All items
Specific items
Sampling
31. 31experience. new thinking
ISA 500 Audit Evidence
When testing specific items, the question
that arises is “what evidence do you
have for the remainder of the
population?”
Data analysis can help you answer that
question.
34. 34experience. new thinking
Using a confidence factor
Sampling can be completed by
using a confidence factor to help
set the sample size. Confidence
factors are described in the CA
Australian Audit Manual (p.497)
35. 35experience. new thinking
Determine a selection interval
A selection interval (j) is determined using the
following formula:
𝑚𝑎𝑡𝑒𝑟𝑖𝑎𝑙𝑖𝑡𝑦
𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑓𝑎𝑐𝑡𝑜𝑟
= 𝑗
Also expressed as:
𝑚𝑝
𝑟
= 𝑗
36. 36experience. new thinking
Determine a sample size
A sample size is determined using the
following formula:
𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛
𝑗
= 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒
An example:
Population = $177,203
Materiality = $15,000
Confidence factor = 3
Sample interval = $15,000/3 = $5,000
Sample size = $177,203/$5,000 = 35.4 (roundup to 36)
42. 42experience. new thinkingexperience. new thinking 42
An understanding of the basics of performing
data analysis and computer assisted audit
techniques (“CAATs”) is a core part of any
auditors toolkit.
Basics
45. 45experience. new thinking
Terminology
Workbook – an excel file with multiple sheets
Worksheet – an individual sheet in an excel file
Row (record) – a row of data on a worksheet
Column (field) – a column of data on a worksheet
Table – a part of a worksheet containing data
Array – a type of data table
Control totals – details of the file to be imported that
provide certainty that all data has been received / imported
46. 46experience. new thinking
Audit benefits
Data analysis adds to the evidence that we
obtain during our audit procedures by
enabling us to look at data for the entire
population.
This analysis provides additional evidence
regarding our audit that is supplemental to
our detailed substantive testing.
47. 47experience. new thinking
Getting data from the client
Plan your request to the client
Set out your needs clearly (email / letter)
Allow plenty of time
Include a diary reminder (Outlook is great
for these)
48. 48experience. new thinking
Control totals
When requesting data from clients, have them
confirm control totals
Control totals are usually details such as the
number of rows / records and the value of a
control field (e.g., total payments)
Use the control total to confirm that all records
have been correctly imported and that the
value of transactions imported agrees with the
information provided by the client
49. 49experience. new thinking
Protecting evidence
When analysing data in excel, be careful to
maintain the integrity of the information
supplied by the client
Work on a copy of the client’s worksheet
Keep the original client datafile separate
from the working file
Do your backups
51. 51experience. new thinking
Audit documentation (ii)
You need to retain sufficient evidence in
your audit workpapers to meet the
standards
This does not mean you have to keep the
full client records (e.g., payments register)
52. 52experience. new thinking
Pro tip …
“Junior staff will typically keep unnecessary
data, both in the audit file and in their own
systems (i.e., c:drive and email). Consider
your Firm’s systems and processes to get
them to clean out this data.
53. 53experience. new thinking
CaseWare “do not retain” button
An easy way to manage
document retention is
using the “Retain on
Cleanup” checkbox in
CaseWare.
Unchecking the “Retain
on Cleanup” box means
that the documents can
be automatically
excluded as part of the
file lockdown procedures.
Source: Hanrick Curran audit file
56. 56experience. new thinking
Sort, Filter & Subtotal
Much can be
achieved with simple
functions of sort,
filter and subtotals
Source: Hanrick Curran data analysis training file
57. 57experience. new thinking
Filters
Various number filters
offer a great starting
point for basic data
analysis in Excel™.
Source: Hanrick Curran data analysis training file
58. 58experience. new thinking
Pivot tables
Pivot tables provide
an even more
powerful way to
analyse data in
Excel™.
Create pivot tables
from the INSERT tab
on the Excel Ribbon.
Source: Hanrick Curran data analysis training file
59. 59experience. new thinking
Pivot tables
When creating the pivot table, ensure that all
of the cells in your data table are selected.
This is a good opportunity to check your
control totals.
60. 60experience. new thinking
A pivot table example
We can use a pivot table to summarise a transaction
register.
In this example, we can summarise the payment register to
obtain insight into the types of transactions recorded.
Source: Hanrick Curran data analysis training file
61. 61experience. new thinking
A pivot table example
In this
example, we
can see the
types of
transactions
being
recorded in
the register,
by supplier.
Source: Hanrick Curran data analysis training file
62. 62experience. new thinking
A pivot table example
This can be done by
dragging and dropping
fields from the list of
fields into the four
boxes at the bottom,
which then results in
automatically sorting
and summarising the
list we saw on the
previous slide.
Source: Hanrick Curran data analysis training file
63. 63experience. new thinking
Vlookup – to join data tables
Vlookup can be used to join data tables
Key is to have a common data field (“key”)
that is in both tables
64. 64experience. new thinking
Vlookup example
To cross check employee bank accounts
against supplier bank accounts, start with the
employee bank accounts.
Source: Hanrick Curran data analysis training file
65. 65experience. new thinking
Vlookup example
Using our payments register from earlier, let’s
look at how we would do a vlookup.
Source: Hanrick Curran data analysis training file
66. 66experience. new thinking
Vlookup example
With the payables register, we have to tweak
the layout to move the bank account to the
beginning of the data table.
Source: Hanrick Curran data analysis training file
67. 67experience. new thinking
Vlookup example
Excel provides great
support around the
function, with this function
wizard.
The data field you are
looking up to, must be the
first field (column) in the
target data set.
Don’t forget to use $A$1
references when linking to
the “table array” (i.e., your
data table).
Source: Hanrick Curran data analysis training file
69. 69experience. new thinking
Vlookup
This shows us that Mr Alfred ‘Ginger’
Cotton, from our payroll data file, has
the same bank account as supplier
MAR004 in our payables transaction
register.
A matter for further audit investigation.
Source: Hanrick Curran data analysis training file
70. 73experience. new thinking
TB editing
TB’s come in many formats, often with the
“-” or credit sign at the back of numbers.
71. 74experience. new thinking
TB editing – instructions for back-to-front
minus signs
Steps to fix the TB
Use TeamMate / ASAP-Utilities ? (see http://www.asap-utilities.com/)
or
Sort the TB on the balance numbers
In the next column over, copy the numbers into the new column, leave the
debit numbers alone
For the credit numbers, include a formula “= - (main number)”
On the main numbers, highlight the credit numbers and use CTRL + H to
access “find and replace”
Find the minus sign (-) and replace with nothing
For the new column of numbers (which should now add to zero), copy the
column and use paste values (ALT-H-V-V) paste the values into the sheet
Delete the column with the original numbers and import to CaseWare TB
module
73. 77experience. new thinking
Screen clippings
(documenting your work)
Use the screen clipping button from the
Quick Access Toolbar** to capture screen
shots for documenting audit testing
**
74. 78experience. new thinkingexperience. new thinking 78
TeamMate is the data analysis software
that we have selected for our auditors.
This software works as an add-in to
Excel and is used as the basis for our
data analysis during audit assignments.
We have other tools available, which
include IDEA and Spreadsheet Detective.
TeamMate
76. 80experience. new thinking
Append / join sheets
Join sheets - Joins two sheets together, based
on a common matching column, e.g. common
bank test, inventory for NPV testing.
Append sheets - Joins multiple sheets that all
have the same column structure together. For
example, where you have separate reports for
each sales person, each warehouse, etc. that
you wish to combine into one report.
77. 81experience. new thinking
Manipulate fields
A tool inside TeamMate that can be used for
data manipulation
Can be used for different analysis and is
worth exploring
Highlights :
Normalize fields – tool to standardize fields
Debit and credit columns – splits/combines Dr and Cr columns
Merge wrap and autofit – combines those functions from Excel into
one
78. 82experience. new thinking
Stratify
Used to obtain information regarding the
characteristics of a data set
Should be run on all files as a preliminary
activity with the objective of understanding
the nature of the population under
investigation
79. 83experience. new thinking
Extract (duplicates / gaps / sample)
Can be used to extract relevant data from a
population (e.g., duplicates, gaps or samples)
Gaps extraction - Identifies for gaps in a sequence
of numbers or partly numeric references, such as
missing invoices or journals.
Duplicates extraction - Extracts duplicate records,
based on up to 3 fields you specify.
Can be used to isolate specific
transactions/records for further investigation
80. 84experience. new thinking
Outliers
Used to extract records from a data set that
are at the edge of the normal distribution
Can be used to extract records more than X
times the average or standard deviation
81. 85experience. new thinking
Sampling
Sampling is used to extract records for
further analysis as part of detailed
substantive testing
Various types of sampling can be
undertaken (refer additional slides)
82. 86experience. new thinking
Sampling - MUS
Our preferred method for sample selection
is the use of a MUS (Monetary Unit
Sample).
A MUS sample is preferred because of the
effectiveness of the audit testing that
results and the efficiency of the sample
selection (e.g., all ‘high value’ items will be
selected).
83. 88experience. new thinking
Random sample - Extracts a number of randomly
sampled items from your population.
Systematic sample - Extracts every ‘nth’ item from
the population.
Stratified sample – Stratifies data and select a
random sample from each strata band
Other types of sampling
84. 90experience. new thinking
If you want to explore more, visit video
demonstrations website:
http://www.teammatesolutions.com/data-analytics.aspx
Or search at: www.youtube.com
85. 91experience. new thinkingexperience. new thinking 91
Fantastic tool used to understand the data set
and audit population. Best performed using
some type of audit / data analysis software.
Stratification is especially useful for identifying
significant and unusual items.
Stratification
86. 92experience. new thinking
Stratify
Used to obtain information regarding the
characteristics of a data set
Should be run on all files as a preliminary
activity with the objective of understanding
the nature of the population under
investigation
87. 93experience. new thinking
Section audit approach
Our standard
approach to auditing
a section includes
data analysis.
Our team are
expected to run
stratification and
Benford’s tests on all
large data files (e.g.,
revenue and
expense transaction
files.
Leadsheet
Programs
Reconciliations
{PBC}
Analytical
review
Data analysis
Substantive
Testing
Self-review
88. 94experience. new thinking
Stratify
Example of revenue stratification from small
client with significant related entities
Source: Hanrick Curran audit file
94. 100experience. new thinkingexperience. new thinking 100
Graphing outputs of spreadsheets is important.
Some examples of why follow…
A segue into the graphic display
of information
95. 101experience. new thinking
Anscombe’s Quartet
Four data sets with
similar characteristics
X average = 9.0
Y average = 7.5
X sum = 99.0
Y sum = 82.5
x y x y x y x y
10.00 8.04 10.00 9.14 10.00 7.46 8.00 6.58
8.00 6.95 8.00 8.14 8.00 6.77 8.00 5.76
13.00 7.58 13.00 8.74 13.00 12.74 8.00 7.71
9.00 8.81 9.00 8.77 9.00 7.11 8.00 8.84
11.00 8.33 11.00 9.26 11.00 7.81 8.00 8.47
14.00 9.96 14.00 8.10 14.00 8.84 8.00 7.04
6.00 7.24 6.00 6.13 6.00 6.08 8.00 5.25
4.00 4.26 4.00 3.10 4.00 5.39 19.00 12.50
12.00 10.84 12.00 9.13 12.00 8.15 8.00 5.56
7.00 4.82 7.00 7.26 7.00 6.42 8.00 7.91
5.00 5.68 5.00 4.74 5.00 5.73 8.00 6.89
sum
99.0 82.5 99.0 82.5 99.0 82.5 99.0 82.5
Average
9.0 7.5 9.0 7.5 9.0 7.5 9.0 7.5
I II III IV
Source: Wikipedia
96. 102experience. new thinking
Anscombe’s Quartet
-
2.00
4.00
6.00
8.00
10.00
12.00
- 5.00 10.00 15.00
Series I
-
2.00
4.00
6.00
8.00
10.00
- 5.00 10.00 15.00
Series II
-
5.00
10.00
15.00
- 5.00 10.00 15.00
Series III
-
5.00
10.00
15.00
- 5.00 10.00 15.00 20.00
Series IV
97. 103experience. new thinking
A common audit test
A common audit test is to graph revenue, looking for spikes,
trends and seasonality.
In these examples, two audit clients, displaying seasonality in
accordance with underlying business model.
One factor we look for is a year-end spike.
$-
$100,000
$200,000
$300,000
July
August
September
October
November
December
January
February
March
April
May
June
Letting fees
$-
$50,000
$100,000
$150,000
$200,000
July
August
Septem…
October
November
December
January
February
March
April
May
June
Management Fees
Source: Hanrick Curran audit file
98. 104experience. new thinking
Discontinuity
Other common issues
include discontinuities
such as spikes, slope
changes and steps.
Graphing outputs can
also help with
identifying spikes from
data entry or formula
errors.
Source: F1F9, 31 day on-line learning
99. 105experience. new thinking
Stephen Few, Perceptual Edge
Stephen Few’s work on visual communication is well worth
investigating as part of developing your team’s use of excel.
Typically a board paper might include a table such as exhibit A.
The problem with this is that the data does not provide the
reader with any insight into the data.
Using Excel’s graphs, providing a visual presentation of the
graph allows insights (see next slide).
Sales ($'000) Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
Domestic 1,893 2,343 2,593 2,283 2,574 2,838 2,382 2,634 2,938 2,739 2,983 3,493
International 574 636 673 593 644 679 593 139 599 583 602 690
2,467 2,979 3,266 2,876 3,218 3,517 2,975 2,773 3,537 3,322 3,585 4,183
Exhibit A: Sales data table
Source: Stephen Few “Visual Communication” IBM Whitepaper, April 2009 (p. 2)
100. 106experience. new thinking
Stephen Few, Perceptual Edge
From the data at right for a typical sales graph, we can
observe:
Domesitc sales trend upwards across the year
International sales are relatively flat across the year
An exception in international sales is noted in August
There is a cyclical pattern in domestic sales, being
lowest in the first month of the quarter and then
growing through the quarter
From the graph, we might infer:
Sales staff may be going light in the first month of
the quarter and start working harder as the quarter
progresses in order to meet their quarterly targets.
Perhaps there is an element of ‘channel stuffing’
happening at the end of the quarter.
Why the year-end spike?
Using Excel’s full potential enables this analysis.
-
500
1,000
1,500
2,000
2,500
3,000
3,500
4,000
Jan Mar May Jul Sep Nov
Domestic InternationalSource: Stephen Few “Visual Communication” IBM Whitepaper, April 2009 (p. 2)
102. 108experience. new thinking
What is IDEA
IDEA is a data analysis application (tool)
IDEA can be run on a laptop (microcomputer) or be loaded onto a
clients network/servers/mainframe computers
(subject to licensing)
IDEA is designed to analyse data sets (i.e., rows and columns of
data)
103. 109experience. new thinking
Where can you use IDEA
If IDEA is loaded to a laptop, you can work from the clients
premises.
The advantage of this is that as issues are identified, they can be
discussed with the client.
More commonly, the analysis is performed as part of the audit
field work.
104. 110experience. new thinking
What hardware is needed for IDEA
Generally any reasonably modern laptop should be able to handle
IDEA
The better the machine, the better the analysis performance
When I as a junior auditor, sometimes I would set up tests and
then let them run overnight . . . (but I think computer
performance has improved since then)
105. 111experience. new thinking
What does it cost $$$$$
Not sure . . .
In my experience, the biggest costs are:
training and time to use
108. 114experience. new thinking
114
j
IDEA allows you to import
and export data from and
into a multitude of formats,
including files originating
from large mainframe
computers and accounting
software.
109. 115experience. new thinking
115
• IDEA protects the source data by allowing read-only access to the
client's data to avoid any unwanted changes, and maintain data
integrity.
• IDEA allows you to import and export data into a multitude of formats,
including files originating from large mainframe computers and
accounting software.
• IDEA allows you to easily manage your files and results and showing the
source of your results
• IDEA can read and process millions of records in seconds. There is no
limit to the number of records that IDEA can process.
110. 116experience. new thinking
116
IDEA creates a record of all changes made to a file
(database) and maintains an audit trail or log of all
operations carried out on a database, including the
import and each audit test.
112. 118experience. new thinking
118
Testing entering data controls
Data dictionary errors (undefined codes)
System weaknesses (Code 99, Free style fields)
113. 119experience. new thinking
119
Audit Types:
Quantitive analysis
Testing efficiency and effectiveness (e.g.,
delays)
Identify specific transactions and trends
(e.g., fraud)
Testing controls (e.g., electronic signature)
Compliance with regulation or policies
Analytical reviews (e.g., salary)
Matching information between different files
114. 120experience. new thinking
Fraud investigations
Purchasing, payroll, money
laundering
Financial and other analyses
Customer and other sales
profiling
Performance measures
(Delays)
Inventory – movement,
pricing
Security reviews
Network, phone, firewall logs
Operations audits
Policy compliance
Value for Money
System audits
Reliability of the data
System weakness (code 99,
free style)
115. 121experience. new thinking
Fraud investigations
Payroll
Ghost employees
Changes to salary
Changes to benefits
Overtime payments
Purchasing
Favorable treatment of vendors
Transactions at or near spending authorities
Large purchases over several invoices
Phantom goods and phantom vendors
Money Laundering
Applying Benford’s Law
Funds from various accounts
Accounts with a large average value of
transactions.
Multiple accounts for particular individuals
117. 123experience. new thinking
IMHO the biggest are:
People’s skill and experience in working
with data
Time required to complete analysis vs
audit budgets
118. 124experience. new thinking
Pro tip …
“ To overcome these problems, you need to
be prepared to take a risk, try something
different and make some mistakes. A culture
inside an audit firm that doesn’t allow people
to ‘fail’ won’t see a rapid, widespread
adoption of data analysis tools.
119. 125experience. new thinking
Other roadblocks
Data access – client consent required and reluctance to give
access to systems
Data access – requirements of privacy legislation
Data access – format of data
(but has gotten better over the years)
A lack of time to properly plan the procedures
120. 126experience. new thinking
Other hurdles
Lots of data laying around = litigation risk
Teams need good discipline in cleaning up client data files
We suggest trying to limit “CYA” files kept by junior staff
121. 127experience. new thinkingexperience. new thinking 127
Some of the benefits of using data analysis
include an improvement in audit effectiveness
and efficiency.
When to use data analysis
122. 128experience. new thinking
Data analysis enables testing
Can test the entire population
Limits sampling risk
Still need to pay attention to accuracy and completeness of source
data
One of the benefits of IDEA vs Excel is that IDEA doesn’t change the
source data
IDEA also tracks the work performed with an audit trail
But, Excel has a significantly lower entry cost than IDEA.
123. 129experience. new thinking
KEY areas to use data analysis
Sampling (i.e., CMA / DUS / MUS )
Stratification
Extraction of unusual items
Benford’s test
124. 130experience. new thinking
Client size for data analysis
Audit fee < $10,000 = probably not worth it, unless you are
looking for something or want to perform a simple sample
selection
Audit fee > $30,000 = starts to be useful for expenditure and
payroll testing for selecting samples for testing
125. 131experience. new thinking
Client industries for data analysis
Generally good for industries with large transaction volumes
where transaction amounts are small dollar value
Useful for financial services and retail clients
Useful with services and manufacturing clients
126. 132experience. new thinking
Planning is key!
When you are going to use data analysis, plan for it
Discuss it with the client and get their buy-in
Identify what tests will be performed with CAATs / data analysis
Identify what data will be needed,
in what format,
at what date it is required, and
who will provide it
127. 133experience. new thinking
What do clients think?
We don’t usually go through the testing approach with them, but
they love it when we have data that they don’t.
It is especially useful to have the data to back-up management
letter comments (e.g., “56% of receivables population is outside
your determined credit limits”).
130. 137experience. new thinking
Pro tip …
“Junior staff will typically keep unnecessary
data, both in the audit file and in their own
systems (i.e., c:drive and email). Consider
your Firm’s systems and processes to get
them to clean out this data.
131. 138experience. new thinking
Pro tip …
“ With data analaysis:
Plan ahead
Track control totals
Protect evidence
Document as you go
132. 139experience. new thinking
Pro tip …
“ To overcome these problems, you need to
be prepared to take a risk, try something
different and make some mistakes. A culture
inside an audit firm that doesn’t allow people
to ‘fail’ won’t see a rapid, widespread
adoption of data analysis tools.
135. 142experience. new thinking
More resources
www.f1f9.com
“Well worth trying their 31 day free online course for a brush-up on your excel skills.”
www.spreadsheetdetective.com
“Use the tools we use, to understand and self-audit your model.”
www.asap-utilities.com
“Great tools for every excel user. If you ever work with data, you need these tools.”
www.perceptualedge.com
“For enlightening analysis and communication.”
137. 144experience. new thinking
Presenter details
Source: LinkedIn
Matthew has been running
computer assisted audit
techniques to perform data
analysis since before the turn of
the century and has prepared and
conducted instruction courses for
auditors in the use of IDEA, ACL
and Excel.
139. 146experience. new thinkingexperience. new thinking 146
Summary slides for key requirements from
Australian Auditing Standards
Appendix – ASA requirements
140. 147experience. new thinking
ISA 500 Audit Evidence
4. The objective of the auditor is to design
and perform audit procedures in such a way
as to enable the auditor to obtain sufficient
appropriate audit evidence to be able to draw
reasonable conclusions on which to base the
auditor’s opinion.
141. 148experience. new thinking
ISA 500 Audit Evidence
5(b). Appropriateness means the measure of the
quality of audit evidence; that is, its relevance and its
reliability in providing support for the conclusions on
which the auditor’s opinion is based.
5(e). Sufficiency means the measure of the quantity
of audit evidence. The quantity of the audit evidence
is affected by the auditor’s assessment of the risks of
material misstatement and also by the quality of such
audit evidence.
142. 149experience. new thinking
ISA 500 Audit Evidence
9. When using information produced by the entity, the
auditor shall evaluate whether the information is
sufficiently reliable for the auditor’s purposes,
including as necessary in the circumstances:
(a) Obtaining audit evidence about the accuracy and
completeness of the information; and
(b) Evaluating whether the information is sufficiently
precise and detailed for the auditor’s purposes.
“ASIC’s Top 10 issues for auditors includes a failure to apply
professional scepticism and document work.
143. 150experience. new thinking
ISA 500 Audit Evidence
11. if:
(a) audit evidence obtained from one source is
inconsistent with that obtained from another;
or
(b) the auditor has doubts over the reliability of
information to be used as audit evidence,
the auditor shall determine what modifications or
additional to audit procedures are necessary to
resolve the matter, and shall consider the effect
of the matter, if any, on other aspects of the
audit.
144. 151experience. new thinking
ISA 500 Audit Evidence
A2. … procedures to obtain audit evidence can
include:
Inspection
Observation
Confirmation
Re-calculation
Re-performance
Analytical procedures
Enquiry
145. 152experience. new thinking
ISA 530 Audit Sampling
5(a). Audit sampling means the application of audit
procedures to less than 100% of items within a
population of audit relevance such that all sampling
units have a chance of selection in order to provide
the auditor with a reasonable basis on which to draw
conclusions about the entire population.
5(b). Population means the entire set of data from
which a sample is selected and about which the
auditor wishes to draw conclusions.
146. 153experience. new thinking
ISA 530 Audit Sampling
5(c). Sampling risk means the risk that the
auditor’s conclusions based on a sample may be
different from the conclusion if the entire
population were subjected to the same audit
procedure.
5(d). Non-sampling risk means the risk that the
auditor reaches an erroneous conclusion for any
reason not related to sampling risk.
147. 154experience. new thinking
ISA 530 Audit Sampling
5(e). Anomaly means a misstatement or
deviation that is demonstrably not
representative of misstatements or deviations
in a population.
148. 155experience. new thinking
ISA 530 Audit Sampling
5(g). Statistical sampling means an approach to
sampling that has the following characteristics:
(i)Random selection of sample items; and
(ii)The use of probability theory to evaluate
sample results, including measurement of
sampling risks.
A sampling approach that does not have
characteristics (i) and (ii) is considered non-
statistical sampling.
149. 156experience. new thinking
ISA 530 Audit Sampling
5(i). Tolerable misstatement means a monetary
amount set by the auditor in respect of which the
auditor seeks to obtain an appropriate level of
assurance that the monetary amount set by the
auditor is not exceeded by the actual misstatement in
the population.
“Typically this would be the amount
determined as materiality, specifically the
performance materiality, or a lower amount.
150. 157experience. new thinking
ISA 530 Audit Sampling
12. The auditor shall investigate the nature and cause of any
deviations or misstatements identified, and evaluate their
possible effect on the purpose of the audit procedure and on
other areas of the audit
13. In the extremely rare circumstances when the auditor
considers a misstatement or deviation discovered in a sample
to be an anomaly, the auditor shall obtain a high degree of
certainty that such misstatement or deviation is not
representative of the population. The auditor shall obtain this
degree of certainty by performing additional audit procedures
to obtain sufficient appropriate evidence that the misstatement
or deviation does not affect the remainder of the population.
151. 158experience. new thinking
Sample selection methods
(ASA 530.App 4)
(a) Random selection (applied through random number
generators or random number tables)
152. 159experience. new thinking
Sample selection methods
(ASA 530.App 4)
(b) Systematic selection, in which the number of sampling
units in the population is divided by the sample size to give a
sampling interval, for example 50, and having determined a
starting point within the first 50, each 50th sampling unit
thereafter is selected. Although the starting point may be
determined haphazardly, the sample is more likely to be truly
random if it is determined by use of a computerised random
number generator or random number tables. When using
systematic selection, the auditor would need to determine
that sampling units within the population are not structured
in such a way that the sampling interval corresponds with a
particular pattern in the population.
153. 160experience. new thinking
Sample selection methods
(ASA 530.App 4)
(c) Monetary Unit Sampling is a type of value-weighted
selection (as described in Appendix 1) in which sample size,
selection and evaluation results in a conclusion in monetary
amounts.
“Also known as Dollar Unit Sampling (DUS) or Constant Monetary
Amount (CMA) sampling.
154. 161experience. new thinking
Sample selection methods
(ASA 530.App 4)
(d) Haphazard selection, in which the auditor selects the
sample without following a structured technique. Although no
structured technique is used, the auditor would nonetheless avoid
any conscious bias or predictability (for example, avoiding difficult to
locate items, or always choosing or avoiding the first or last entries
on a page) and thus attempt to ensure that all items in the
population have a chance of selection. Haphazard selection is not
appropriate when using statistical sampling.
155. 162experience. new thinking
Sample selection methods
(ASA 530.App 4)
(e) Block selection involves selection of a block(s) of
contiguous items from within the population. Block selection cannot
ordinarily be used in audit sampling because most populations are
structured such that items in a sequence can be expected to have
similar characteristics to each other, but different characteristics from
items elsewhere in the population. Although in some circumstances it
may be an appropriate audit procedure to examine a block of items,
it would rarely be an appropriate sample selection technique when
the auditor intends to draw valid inferences about the entire
population based on the sample.
“Good morning and welcome to our sessions this morning on data analysis.
“Our session this morning will examine various issues associated with data analysis and provide an overview of the types of tests that can be performed using basic Excel functions and more advanced software, in this case, TeamMate and ASAP Utilities.
“We will also examine some of the audit implications that arise from using data analytics.
Mr Robert McNamara, US Secretary of Defence, an MBA graduate of the Harvard Business School and alumnus of Price Waterhouse. When the US joined WWII, Mr McNamara jointed the Air Corp, serving with the Office of Statistical Control, with one of his major responsibilities being the analysis of US bomber efficiency and effectiveness.
“Today we are going to cover:
“With only 65,000 rows in 1999, we had to use other tools for the data analysis. This meant learning about ACL and Idea.
Benford’s test is a test we run, looking for anomalous transactions.
“We run Benford’s tests on all large data sets that we audit. Our objective is to identify anomalous transactions and potentially highlight examples of fraud.
“This example is from a client file for FY17, in this case showing issues identified in the 6 bracket. This test was run on a population with 2,890 observations/transactions.
In this example for an important and distribution company, we determined to investigate payments commencing with a value of either 8 or 9. These were selected out of 2,400 payment transactions.
“This Benford’s analysis shows how anomalies arise. In this case, we investigated the spikes and noted that they arise from the ex-GST amount for particular price points, for example at $90 for $99.
“Not an issue from an audit perspective, but good to know that we are not identifying material instances of fraud or error.
“This test was run on the first two digits in the population generated from sales for a large independent retailer with over $50 million in sales.
Anna Kendrick and Ben Afflick running the Benfords analysis manually. Who knew auditing could be such fun.
“Can you guess the connection between this fairly average 2016 Ben Affleck movie and auditing?
“Bonus points if you can name the type of rifle he is holding.
“Before we get into the data analysis, we need to talk about audit evidence, the reason we need to talk about this is so that you can understand why data analysis will be beneficial to your audit work.
** highlight, sampling is only one type of way of selecting items for testing.
“When we look at the standard CaseWare sampling workpapers, data analysis can be used to complete the “Other Substantive Tests” part of the workpaper and this can lead to a reduction in the final sample sizes required for auditing.
“CAATs are a basic part of the auditors toolkit and have been since microcomputers were being used in the early 1990s. One of the first books on the subject that I read was published by the Canadian Institute of Chartered Accountants in 1994, this isn’t a new topic and is one that chartered accounts have been interested in for over 20 years.
“Some of the terminology that we will use during this presentation is outlined here.
“Precision in the use of terminology is encouraged as it assists in clear communication between team members. We need to be clear on workbook, worksheet, rows / records and columns / fields.
“The benefits of using data analysis techniques in audit assignments is that the entire population can be assessed by the auditor for evidence of material misstatement.
“Data analysis also provides an opportunity for the auditor to improve audit coverage while at the same time gathering evidence by alternative methods that would allow a reduction in detailed substantive testing, meaning less sampling of invoices.
“Big data may be a recent buzzword but data analysis has been part of the auditors toolkit for over 20 years. Today’s tools are more advanced and easier to use, meaning that we see greater benefits from this approach today and it can be more widely utilised than in the mid ‘90s.
“Our tips for obtaining data from the client includes: planning ahead, being proactive and being clear about what you are asking for. Clarifying and following-up with requests are also part of what the auditor can do to make life easy for everyone.
“A key part of any data analysis program is the use of control totals.
“Control totals are used to ensure that the data we are supplied is appropriate, complete and that it is received, imported and prepared for analysis correctly. There is no point doing the analysis on a population that is missing a chunk of information.
“As an example, we recently had a project where a client supplied us with an invoice list that totalled $20 million whereas their reported revenue was only $14 million. Investigation identified that the extra $6 million turned out to be instances of where the company had allowed discounts to standard pricing, that had been included in the invoice listing but which were subsequently deducted for the purpose of determining reported revenue.
“We have some pointers for performing data analysis in Excel. Some of the more advanced software systems protect the initial information so that these issues are less prevalent.
“One of the most important tips for protecting the evidence is to keep the original data supplied by the client in a safe place and to work on a copy of that data.
“When we look at what needs to be retained, we draw on the details in ASA 230 Audit Documentation.
“Audit documentation needs to retain sufficient information to enable an assessment of the audit test, but does not require that we retain all of the information we are supplied.“Keeping all of the information that we are provided by the client can lead to significant amounts of data in our systems that may not need to be kept and which adds to the costs of maintaining out backup data in our systems.
“Excel is a very powerful tool for performing data analysis.
“Some basic features in Excel are found in the DATA tab and include Filter functions.
“Pivot tables are a powerful data analysis tool that is available from the INSERT tab.
“In Excel, the vlookup function can be used to join data from different sheets.
Absolute references - $$$$
“We often find excel workbooks with links between them. We recommend that links between workbooks be avoided if possible. They can cause lots of issue later and can be difficult to manage.
“My tip on links between workbooks is to collect all links onto a single sheet.
{show example in excel file}
“We often find excel workbooks with links between them. We recommend that links between workbooks be avoided if possible. They can cause lots of issue later and can be difficult to manage.
“My tip on links between workbooks is to collect all links onto a single sheet.
{show example in excel file}
“We have a short example here of the tools available in ASAP-Utilities that can be used to correct data formats.
Provides tools to use to lookup supplier details against the ABN register.
Ability to lookup against database.
Apologies but as I don’t work in NZ, does anyone have any ideas as to how this can be automated?
“If you haven’t used screen clippings before, I am sure that you will love this function and it is available in all Office applications. It can be inserted into the quick access toolbar to make life much easier in using each of the applications (very useful in Outlook, Word and Excel).
“TeamMate is commercially available software that we use with our audit team to analyse large client data sets.
“TeamMate and ASAP can be used to join multiple sheets into a single sheet.
“TeamMate and ASAP can be used to ‘normalise’ data in excel to make it useable.
“TeamMate includes the stratify function.
“Excel, TeamMate and ASAP can be used to identify duplicates in payments. Lets have a look at an example.
{Activity, show how to identify duplicates in example file}
“Using the TeamMate software, we are able to select and extract representative samples that can be used for the audit file.
“TeamMate enables us to apply MUS (or Monetary Unit Sampling) to populations to achieve an efficient sampling approach.
“Other types of sampling can also be supported by TeamMate and is another main use in our audit practice.
“More information on TeamMate can be found here or through a standard search engine.
“TeamMate includes the stratify function.
“This example from an entity with a small revenue base and a related party that receives many of its services resulted in a small number of high value invoices. This can be contrasted with out next example.
“This stratification is from a large retail client, and we can see that over 150,000 transactions were processed for less than $250 each.
Speaker Notes:
Why Use IDEA – well, IDEA has the ability to practically import data from any source:
- Text files (flat files generated from ERP systems)
- Microsoft products such as Access and Excel
- IDEA can also import data directly from SAP using SmartExporter (if interested, provide additional information), or by using an ODBC connection with Oracle, SQL, JD Edwards etc…
- If you have reports in native PDF formats, IDEA has the capability to import this information as well.
Speaker Notes:
If we use a spreadsheet program for the analysis, we might override a cell without our knowledge.
However, IDEA protects the source data by allowing read-only access to the client's data to avoid any unwanted changes, and maintain data integrity.
IDEA creates a record of all changes made to a file (database) and maintains an audit trail or log of all operations carried out on a database, including the import and each audit test.
IDEA performs the following functions, features that are useful for accountants, auditors, systems and financial professionals:
Compares, joins, appends and connects different files from different sources
Extracts specific transactions, identifies gaps (e.g., check number) or duplicates
Profiles data by summarizing, stratifying or aging the files
Creates useful field statistics automatically
Creates samples using several different sampling methods
IDEA allows you to import and export data from and into a multitude of formats, including files originating from large mainframe computers and accounting software.
It also allows you to easily manage your files and results, as well as view the source of your results
IDEA can read and process millions of records in seconds. There is no limit to the number of records that IDEA can process.
Speaker Notes:
IDEA creates a record of all changes made to a file and maintains an audit trail or log of all operations carried out on a database, including the import and each test preformed.
Often we deal with staff turnover and situations where transfer of knowledge was not performed. With Project Overview, you can view what file and format was imported, the type of analysis performed and the type of result created.
IDEA gives you the ability to modify or create processes without a single line of programming.
Speaker Notes:
Here is the IDEA Process:
1) As I previously mentioned, IDEA has the capability to import information from any source.
2) You will be able to perform all your analytic processes directly in the software, such as:
Extracting specific exceptions based on your business rules
Sorting the data
Searching for keywords in descriptions
Grouping Data by segments
Adding calculated fields (i.e. data tax, quantity multiplied by unit price, etc.)
Stratifying, summarizing or aging the data
Identifying gaps or duplicates in the data for potential errors or fraud
Using our various sampling methodologies
Using our advanced statistical methods such Time Series, Trend Analysis, and Correlation
Even joining, appending, and comparing different databases
IDEA gives you the ability to review 100% of your transactions
Greater coverage than sampling
Deeper coverage from automated testing
The IIA recommends the use of CAATs with the following statement: “Consider the use of computer-assisted audit tools and other data analysis technique”
3) You have the ability to review the results using pivot tables, custom reports, and charts. You can export the information into another format such as PDF or Excel if you wish. History and Project Overview will allow you to review the process in a table or graphical format.
Here is something new and exciting. You have the ability to automate the entire or part of the process with just a few clicks. This can be done by IDEAScript (programing side) or Visual Script (non-programing side). Visual Script is unique to IDEA. You have the ability to create and automate processes without a single line of programming.
Speaker Notes:
Prior to applying IDEA for a specific engagement, you should review the following:
Availability of the data:
Do you have access to the raw data required for the analysis?
What formats are available to export from the system?
Do you have the user credentials to create an ODBC connection (if applicable)?
Understanding the data:
Do you have access to the data dictionary?
What fields are required for the audit?
(i.e. SAP might have 92,000 tables but you only require 20 tables for the analysis)
Reliability of the data
Is the data you received reliable?
Has it changed hands multiple times?
You will want to extract the information as close as possible to the source. When importing data from reports, the user might have the ability to omit suspicious data relevant to the analysis
Reconciling the data
Do you require information from various systems?
Do you require information for various months?
IDEA has the ability to append multiple databases for global analysis. You can also use IDEA to join data from different systems as well
Speaker Notes:
Explain the types of audit you can employ IDEA for
Speaker Notes:
Explain where you can use IDEA and how it is used currently in the market.
Speaker Notes:
Explain each bullet
** highlight
“Too often we see that our staff can have a tendency to call things an anomaly when perhaps we should consider the further guidance.
“Tolerable Misstatement”, aka “Tolerable Error”
CAATs provide a unique ability to perform random sampling. As well as random sampling we have some other sampling methods set out in the coming slides.
“We like MUS or CMA sampling. It is robust in the application of our audit procedures and effective in the completion of our audit work.