Blooming Together_ Growing a Community Garden Worksheet.docx
De-Duplication
1. leaders in digital evidence
Quality
ISO 9001
near de-duplication,
email threading & text compare technology
de-duplication technology The Near Duplicate Solution -
Near duplicate detection technology (NDD) can be
The Exact Duplicate Problem used to detect files with the same content but are in
Research shows that anywhere from 30% to 50% different formats, e.g, MS Word and PDF versions of
or more of electronic document collections are exact the same document. Also files with the same content
duplicates. Duplicate documents significantly increase but which have different formatting can be identified
e.discovery processing costs and legal review time if using NDD. Near de-duplication creates order from
they are not identified and removed. chaos by grouping documents with similar content
together and highlighting this to the user. Whilst exact
The Exact Duplicate Solution de-duplication can result in the removal of up to 50%
Electronic documents have their own DNA and their of duplicates in potentially discoverable electronic file
own fingerprints. We can use this information to repositories, near de-duplication can result in finding
identify electronic documents that are exact up to a further 50%. This means faster review and
duplicates of each other, significantly reducing thereby greater time and cost savings.
the volume of documents that is required to be Less cost, less time and less risk!
processed and reviewed.
The Near Duplicate Problem
email threading technology
It is estimated that in enterprise environments,
20% to 50% of all electronic information are near The Email Thread Problem
duplicates. Near-duplicate files are documents It is estimated that over 250 billion emails are sent
with minor differences. For example, contract and received each day worldwide. A large portion of
versions containing a few different words. those are replies or forwards to other emails, creating
an email thread. An email thread can contain multiple
duplicates of other emails however they are not an
exact duplicate as they are unique in their own right.
The Email Thread Solution
In the context of litigation, being able to follow an
email thread is very important. Email threading
technology captures and reconstructs email
WE CREATE conversations. By identifying the unique emails in
ORDER a collection, the tool drastically reduces the number
of emails that need to be reviewed. Email threading
FROM THE simplifies the review of emails, while allowing the
CHAOS review of the email within its original context.
Visit elaw.com.au for more information Contact e.law: info@elaw.com.au
2. leaders in digital evidence
text compare technology Chaotic email collection
Equivio>Compare is a software application that
highlights the textual differences between two
documents and has the ability to compare documents
of any two file types. e.law has integrated the
Equivio>Compare technology into our near duplicate
and email threading solutions for a unique document
review experience.
Using Text Compare with Near Duplicate Sets
This is used when reviewing documents that have Re-built email thread
been grouped into near-duplicate sets. For example,
there is a set of near-duplicates comprising 10
versions of a 50-page contract. The pivot document
is identified, which is the most representative
document of the near-duplicate set. The lawyer starts
by reading the pivot document. Having read the pivot,
the lawyer can decide whether it’s necessary to
continue reviewing the remaining 9 versions of the Focus on “inclusives”
contract in our near-duplicate set. If the lawyer
decides that the other nine versions do require review,
it’s not necessary to read the 50-page contract
another nine times. Using Equivio>Compare, the
lawyer can simply review the differences of each
document vis-à-vis the pivot document.
Using Text Compare with Email Threads
Equivio>Compare also compares emails. The READ LESS
TM
compare function is useful for highlighting the
differences between two inclusives within an email
THINK MORE
thread. The two inclusives typically share a common WIN BIG
ancestry; that is, both emails originate from the same
original email thread, which at some point split into
two sub-threads. Equivio>Compare identifies the Equivio™, Equivio>NearDuplicates™, Equivio>EmailThreads™, Equivio>Compare™
common part of the chain, and the unique elements and Read less, Think more, Win big™ are trademarks of Equivio. Other product names
mentioned in this document may be trademarks or registered trademarks of their respective
in both inclusives. owners. All specifications in this document are subject to change without prior notice.
Visit elaw.com.au for more information Contact e.law: info@elaw.com.au