SlideShare a Scribd company logo
1 of 4
Download to read offline
2014 WHITE PAPER
DEMYSTIFYING ANALYTICS
IN EDISCOVERY
Steven Toole
Vice President of Marketing, Content Analyst Company, LLC.
1© 2014 Content Analyst,LLC.All rights reserved. Content Analyst,CAATand the Content Analyst and CAATlogos are registered trademarks of Content Analyst,LLC in the
United States. All other marks are the property of their respective owners.
A recent eDiscovery Journal blog entry by Greg
Buckles pointed out that the number of companies
providing the analytics technology in early case
assessment (ECA) and review platforms is few, and
the specific analytics capabilities can vary wildly
from one platform to the next. While this may not
be breaking news to those closest to analytics in
eDiscovery, it did reveal some mysticism about
how the market uses analytics in the early stages
of eDiscovery. In addition, it raised questions about
how analytics fit into the eDiscovery workflow, and
ultimately, the return on investment that analytics
have on eDiscovery and information governance in
general.
This overview is designed to further unveil this
mysticism surrounding analytics in eDiscovery and
information governance, and provide insights about
the return on investment analytics can enable for
those who embrace these capabilities. Corporate
counsel that get ahead of the curve today with
forward-thinking strategies such as these will be
the ultimate heroes and beneficiaries of eDiscovery
analytics, leading their field with a much more
proactive and cost-effective approach to information
governance and legal technology.
The Analytics Land Grab
While there are plenty of stakes in the ground across the
eDiscoverylandscape,lawfirmsandserviceprovidersarelooking
”to the West” for unclaimed territory. The gigabyte gold rush is all
about applying analytics to the data further upstream in order
to “own” the data long before it’s needed in a matter. ECA was
the first frontier law firms and service providers looked toward in
ordertomoveupstream,butthegreenestpasturesarestillfurther
upstream, in what some call “pre-discovery.” The Golden Rule in
eDiscovery is simple: Those who ”rule” the content get the gold.
Translation:thevendorthatcanapplyanalyticsearliest–beforeit’s
needed in a matter – provides the most value to corporations, and
therefore is at a great competitive advantage.
“ECA was the first frontier law
firms and service providers looked
toward in order to move upstream,
but the greenest pastures are still
further upstream, in what some
call “pre-discovery.”
”
Dynamic Clustering – This is a good place to start, especially if
you know little to nothing about the content. Clustering “buckets”
the content (documents, emails, etc.) into natural groupings of
conceptually related materials. One major benefit of clustering is
that it provides a very fast map of the document landscape in a
highly objective, consistent, concept-aware fashion. As a result,
thereviewercanjumpstraighttotheclusterthat’sofmostinterest
(conceptually relevant), and avoid spending time in clusters of no
conceptual relevance. In terms of information governance, it can
also help identify and weed out the ROT (redundant, outdated
and transient) content quickly and easily, thus reducing costs and
helping to increase ROI.
Term Expansion – You have a keyword, name or technical term,
abbreviation, or acronym, and you want a list of all similar,
or highly related, terms so you can expand your search for
documents containing those terms as well. Term expansion
identifies conceptually related terms, customized to your content,
and ranked in order of relevance. For example, Barack Obama
might produce a list such as President Obama, Commander-in-
Chief, Senator Obama, Michelle Obama, the Oval Office, Office of
the President, POTUS, etc. In a matter, that means finding more
conceptually related content faster and easier, saving time and
money. In information governance, it helps identify content
relatedtocorporaterecords,intellectualproperty,andcompliance,
as well as, of course, more ROT for defensible deletion.
Conceptual Search – You’ve identified a key document or
paragraph, now you want to find similar ones. Keyword search
will give you documents containing the specific keywords as best
as you can write the Boolean search string, and as long as those
keywords are included in the resulting documents. But writing
Boolean search strings can be time-consuming and still may miss
key documents containing the ”unknown” terms not included in
your search string. To find the documents you’d otherwise miss
with keyword searches, you’ll need to use conceptual search.
Applying mathematical algorithms to your example document or
text selection, conceptual search looks for matching patterns in
the “map” of the data called the conceptual space. The benefit is
thatconceptualsearchcanfindsimilarresultsevenifthematching
document doesn’t contain any of the same terms as the example
text. Think abbreviations, misspellings, acronyms, code words,
andrelatedtermsyouhadn’theard.Thentakeawayfalsepositives
from synonyms and polysemes. Translation: uncover highly
relevant yet latent relationships, saving time and costs.
Auto-Categorization – Predictive coding is one area that’s
gained a lot of attention in eDiscovery over the past two years.
Auto-categorization is what makes predictive coding possible.
Predictive coding is applying machine learning to a corpus of
documents to intelligently categorize them any number of ways,
such as, as privileged, responsive, nonresponsive. For example,
users can categorize documents as responsive, then categorize
the responsive documents even further into relevant issue sub-
categories. Auto-categorization uses the same conceptual space
and sample document exemplars to find conceptually similar
documents and label them as appropriate. Again, the big benefit
here is a tremendous amount of time saving (and cost saving) by
letting the technology bring the most relevant documents to the
forefront, and into the hands of the domain experts, as quickly and
easily as possible.
Email Threading – The concept of email threading is fairly
simple – find the subset of emails at the end of each branch of
a conversation thread. Rather than reading 30 emails back and
forth – as well as sideways among forwarded branches of the
conversation – email threading finds the subset of emails that
includeallofthepreviousreplies(called”inclusive”emailsbecause
these six, for example, include the whole history). Time and cost
savings of using email threading are self-evident, but it’s also
important to note that threading reveals exactly who knew what,
when – pretty critical in piecing together the course of events that
unfolded surrounding a matter.
Near-DuplicateIdentification–Asimilarbenefittoemailthreading
exists with near-duplicate identification.While conceptual search,
clustering or categorization can identify documents that are
relevant to the case, many could be various versions of the same
document. Knowing that they’re near duplicates of each other
can save the time of having to review each one. If it’s important to
know what changed from one version to the next, when, and by
whom, difference highlighting shows these changes, again saving
time and reducing cost. Batching near duplicates together from
theoutsetofamatteralsoprovidesreviewerswithamorefocused
set of documents.
Recipe for Success
This all sounds good, but what does this really mean? You can’t bake a cake until you know what the ingredients are, what they do, and how
they can affect the output. And since each eDiscovery solution has a different set of analytics capabilities, here’s a brief tutorial on the key
ingredients of analytics for eDiscovery and information governance.
© 2014 Content Analyst,LLC.All rights reserved. Content Analyst,CAATand the Content Analyst and CAATlogos are registered trademarks of Content Analyst,LLC in the
United States. All other marks are the property of their respective owners.
2
3© 2014 Content Analyst,LLC.All rights reserved. Content Analyst,CAATand the Content Analyst and CAATlogos are registered trademarks of Content Analyst,LLC in the
United States. All other marks are the property of their respective owners.
Putting Your Data on a Diet
Putting these analytics capabilities to work for you may cause
serious weight loss in your data. In eDiscovery, that means fewer
documents to review by expensive reviewers. It also means that
the documents they are reviewing are nothing but the absolutely
most conceptually relevant documents to the case. Moreover,
reviewers are being presented with documents that are not
batched haphazardly, allowing for a more focused review, driving
accuracy to an all-time high and costs and time even lower.
But Wait – There’s More!
Remember the gigabyte gold rush from above? The Golden Rule
of data? This is where analytics really are the key to unlocking all
those hidden insights in a company’s data – long before they’re
needed in eDiscovery. Applying text analytics to a company’s
electronic records proactively through pre-discovery means that
data is already organized, reduced, and ready to be presented
if and when a matter arises. Corporate counsel love this idea
because it keeps litigation costs as low as possible, decreases the
crucial time it takes to investigate or decide whether to settle a
case, and helps them present their side in the very best possible
light.
The Bottom Line: ROI for eDiscovery Analytics
Measuring the ROI of using analytics in eDiscovery comes down
to this: Review is the greatest cost factor in eDiscovery. Expert
reviewers don’t come cheap – their expertise is clearly of utmost
value in a case. But if a large percentage of their time is spent
reviewing documents not relevant to a case, then you’re not
getting the most value out of them in the first place. Their hours
cost the same whether they’re looking at the smoking gun
document in a case or something completely unrelated. You
wouldn’t wear gloves during a palm reading – they’d just get
in the way of the psychic doing her job. Presenting a corpus of
documents to expensive reviewers without applying machine
learning first makes no more sense. Further, finding documents
otherwise missed without analytics can also hinder the experts’
ability to formulate your case strategy.
But the ROI of applying analytics to your documents and email
pre-discovery goes even further. The cost benefits of organizing
and analyzing your content proactively are huge, helping to
drive decision making and information governance practices for
compliance, risk mitigation and cost avoidance. Applying these
strategies early can provide a tremendous advantage long term
through a much more proactive and cost-effective approach to
information governance and eDiscovery.
Steven Toole is vice
president of marketing
at Content Analyst
Company where his unique
combination of business
acumen, creativity,
strategic vision and tactical
execution yield impactful
results toward the
company's mission.
“Measuring the ROI of using
analytics in eDiscovery
comes down to this: Review
is the greatest cost factor in
eDiscovery.
”

More Related Content

What's hot

Connectivity to business outcomes
Connectivity to business outcomesConnectivity to business outcomes
Connectivity to business outcomesAndrey Karpov
 
Rethinking the eDiscovery Process by Kelly Twigger
Rethinking the eDiscovery Process by Kelly TwiggerRethinking the eDiscovery Process by Kelly Twigger
Rethinking the eDiscovery Process by Kelly TwiggerESI Attorneys LLC
 
Complex Carrier Network Performance Data on Vertica Yields Performance and Cu...
Complex Carrier Network Performance Data on Vertica Yields Performance and Cu...Complex Carrier Network Performance Data on Vertica Yields Performance and Cu...
Complex Carrier Network Performance Data on Vertica Yields Performance and Cu...Dana Gardner
 
Searching for Meaning
Searching for MeaningSearching for Meaning
Searching for MeaningTrey Grainger
 
Python's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPython's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPeter Wang
 
South Big Data Hub: Text Data Analysis Panel
South Big Data Hub: Text Data Analysis PanelSouth Big Data Hub: Text Data Analysis Panel
South Big Data Hub: Text Data Analysis PanelTrey Grainger
 
Semantic based Enterprise Search Solution in Networking Domain
Semantic based Enterprise Search Solution in Networking DomainSemantic based Enterprise Search Solution in Networking Domain
Semantic based Enterprise Search Solution in Networking DomainRanjan Baisak
 
Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020Editor IJARCET
 
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision MakingFast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision MakingCodemotion
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond Rajesh Kumar
 
Big Data-Survey
Big Data-SurveyBig Data-Survey
Big Data-Surveyijeei-iaes
 
The Relevance of the Apache Solr Semantic Knowledge Graph
The Relevance of the Apache Solr Semantic Knowledge GraphThe Relevance of the Apache Solr Semantic Knowledge Graph
The Relevance of the Apache Solr Semantic Knowledge GraphTrey Grainger
 
Reinvent Law London Grady Presentation
Reinvent Law London Grady PresentationReinvent Law London Grady Presentation
Reinvent Law London Grady PresentationKenneth Grady
 
A SURVEY ON QUESTION AND ANSWER SYSTEM BY RETRIEVING THE DESCRIPTIONS USING L...
A SURVEY ON QUESTION AND ANSWER SYSTEM BY RETRIEVING THE DESCRIPTIONS USING L...A SURVEY ON QUESTION AND ANSWER SYSTEM BY RETRIEVING THE DESCRIPTIONS USING L...
A SURVEY ON QUESTION AND ANSWER SYSTEM BY RETRIEVING THE DESCRIPTIONS USING L...IJARBEST JOURNAL
 
Implementing Semantic Search
Implementing Semantic SearchImplementing Semantic Search
Implementing Semantic SearchPaul Wlodarczyk
 

What's hot (16)

Connectivity to business outcomes
Connectivity to business outcomesConnectivity to business outcomes
Connectivity to business outcomes
 
Rethinking the eDiscovery Process by Kelly Twigger
Rethinking the eDiscovery Process by Kelly TwiggerRethinking the eDiscovery Process by Kelly Twigger
Rethinking the eDiscovery Process by Kelly Twigger
 
Complex Carrier Network Performance Data on Vertica Yields Performance and Cu...
Complex Carrier Network Performance Data on Vertica Yields Performance and Cu...Complex Carrier Network Performance Data on Vertica Yields Performance and Cu...
Complex Carrier Network Performance Data on Vertica Yields Performance and Cu...
 
Searching for Meaning
Searching for MeaningSearching for Meaning
Searching for Meaning
 
Starting a search application
Starting a search applicationStarting a search application
Starting a search application
 
Python's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPython's Role in the Future of Data Analysis
Python's Role in the Future of Data Analysis
 
South Big Data Hub: Text Data Analysis Panel
South Big Data Hub: Text Data Analysis PanelSouth Big Data Hub: Text Data Analysis Panel
South Big Data Hub: Text Data Analysis Panel
 
Semantic based Enterprise Search Solution in Networking Domain
Semantic based Enterprise Search Solution in Networking DomainSemantic based Enterprise Search Solution in Networking Domain
Semantic based Enterprise Search Solution in Networking Domain
 
Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020
 
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision MakingFast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond
 
Big Data-Survey
Big Data-SurveyBig Data-Survey
Big Data-Survey
 
The Relevance of the Apache Solr Semantic Knowledge Graph
The Relevance of the Apache Solr Semantic Knowledge GraphThe Relevance of the Apache Solr Semantic Knowledge Graph
The Relevance of the Apache Solr Semantic Knowledge Graph
 
Reinvent Law London Grady Presentation
Reinvent Law London Grady PresentationReinvent Law London Grady Presentation
Reinvent Law London Grady Presentation
 
A SURVEY ON QUESTION AND ANSWER SYSTEM BY RETRIEVING THE DESCRIPTIONS USING L...
A SURVEY ON QUESTION AND ANSWER SYSTEM BY RETRIEVING THE DESCRIPTIONS USING L...A SURVEY ON QUESTION AND ANSWER SYSTEM BY RETRIEVING THE DESCRIPTIONS USING L...
A SURVEY ON QUESTION AND ANSWER SYSTEM BY RETRIEVING THE DESCRIPTIONS USING L...
 
Implementing Semantic Search
Implementing Semantic SearchImplementing Semantic Search
Implementing Semantic Search
 

Viewers also liked

Markkinointiviestintä – Mitä tehdä itse, mitä ulkoistaa
Markkinointiviestintä – Mitä tehdä itse, mitä ulkoistaa Markkinointiviestintä – Mitä tehdä itse, mitä ulkoistaa
Markkinointiviestintä – Mitä tehdä itse, mitä ulkoistaa Worldcom Public Relations Group
 
Mobile Tools for Immigration Journalists
Mobile Tools for Immigration JournalistsMobile Tools for Immigration Journalists
Mobile Tools for Immigration JournalistsMandy Jenkins
 
What does-it-take-to-follow-christ
What does-it-take-to-follow-christWhat does-it-take-to-follow-christ
What does-it-take-to-follow-christSSMC
 
Church in Community
Church in CommunityChurch in Community
Church in CommunitySSMC
 
Knowing god-through-1-peter
Knowing god-through-1-peterKnowing god-through-1-peter
Knowing god-through-1-peterSSMC
 
Translation: Vietnam taps reserves but dong still likely to slide
Translation: Vietnam taps reserves but dong still likely to slideTranslation: Vietnam taps reserves but dong still likely to slide
Translation: Vietnam taps reserves but dong still likely to slideAn Nguyen
 
5分で始める XS - tsukuba.xs#1
5分で始める XS - tsukuba.xs#15分で始める XS - tsukuba.xs#1
5分で始める XS - tsukuba.xs#1Yuji Shimada
 
Loyalty Presentation
Loyalty PresentationLoyalty Presentation
Loyalty Presentationwendyrousseau
 
Apostolic church 21 apr 2013
Apostolic church 21 apr 2013Apostolic church 21 apr 2013
Apostolic church 21 apr 2013SSMC
 
After you say i do sermon12 feb2012
After you say i do   sermon12 feb2012After you say i do   sermon12 feb2012
After you say i do sermon12 feb2012SSMC
 
Social Media for the New News Brand
Social Media for the New News BrandSocial Media for the New News Brand
Social Media for the New News BrandMandy Jenkins
 
Adatvizualizáció online eszközökkel
Adatvizualizáció online eszközökkelAdatvizualizáció online eszközökkel
Adatvizualizáció online eszközökkelSzilvia Tóth-Mózer
 
Eval1 Mª I Sabel Berrio Rojo
Eval1 Mª I Sabel Berrio RojoEval1 Mª I Sabel Berrio Rojo
Eval1 Mª I Sabel Berrio RojoSilvia Pañeda
 
جدول مباريات كأس العالم كاملا
جدول مباريات كأس العالم كاملاجدول مباريات كأس العالم كاملا
جدول مباريات كأس العالم كاملاamr magdy
 
愉快的性格是成功的靈魂
愉快的性格是成功的靈魂愉快的性格是成功的靈魂
愉快的性格是成功的靈魂guestc38345
 
The Muffis' washing
The Muffis' washingThe Muffis' washing
The Muffis' washingCons1
 

Viewers also liked (20)

Markkinointiviestintä – Mitä tehdä itse, mitä ulkoistaa
Markkinointiviestintä – Mitä tehdä itse, mitä ulkoistaa Markkinointiviestintä – Mitä tehdä itse, mitä ulkoistaa
Markkinointiviestintä – Mitä tehdä itse, mitä ulkoistaa
 
Gazedaslide
GazedaslideGazedaslide
Gazedaslide
 
Mobile Tools for Immigration Journalists
Mobile Tools for Immigration JournalistsMobile Tools for Immigration Journalists
Mobile Tools for Immigration Journalists
 
Rddfs instant insight
Rddfs instant insightRddfs instant insight
Rddfs instant insight
 
What does-it-take-to-follow-christ
What does-it-take-to-follow-christWhat does-it-take-to-follow-christ
What does-it-take-to-follow-christ
 
Church in Community
Church in CommunityChurch in Community
Church in Community
 
Knowing god-through-1-peter
Knowing god-through-1-peterKnowing god-through-1-peter
Knowing god-through-1-peter
 
Translation: Vietnam taps reserves but dong still likely to slide
Translation: Vietnam taps reserves but dong still likely to slideTranslation: Vietnam taps reserves but dong still likely to slide
Translation: Vietnam taps reserves but dong still likely to slide
 
5分で始める XS - tsukuba.xs#1
5分で始める XS - tsukuba.xs#15分で始める XS - tsukuba.xs#1
5分で始める XS - tsukuba.xs#1
 
Loyalty Presentation
Loyalty PresentationLoyalty Presentation
Loyalty Presentation
 
2010 BMW 535i Boston
2010 BMW 535i Boston2010 BMW 535i Boston
2010 BMW 535i Boston
 
Apostolic church 21 apr 2013
Apostolic church 21 apr 2013Apostolic church 21 apr 2013
Apostolic church 21 apr 2013
 
After you say i do sermon12 feb2012
After you say i do   sermon12 feb2012After you say i do   sermon12 feb2012
After you say i do sermon12 feb2012
 
Social Media for the New News Brand
Social Media for the New News BrandSocial Media for the New News Brand
Social Media for the New News Brand
 
Adatvizualizáció online eszközökkel
Adatvizualizáció online eszközökkelAdatvizualizáció online eszközökkel
Adatvizualizáció online eszközökkel
 
Eval1 Mª I Sabel Berrio Rojo
Eval1 Mª I Sabel Berrio RojoEval1 Mª I Sabel Berrio Rojo
Eval1 Mª I Sabel Berrio Rojo
 
جدول مباريات كأس العالم كاملا
جدول مباريات كأس العالم كاملاجدول مباريات كأس العالم كاملا
جدول مباريات كأس العالم كاملا
 
愉快的性格是成功的靈魂
愉快的性格是成功的靈魂愉快的性格是成功的靈魂
愉快的性格是成功的靈魂
 
Gatos 1
Gatos 1Gatos 1
Gatos 1
 
The Muffis' washing
The Muffis' washingThe Muffis' washing
The Muffis' washing
 

Similar to Demystifying analytics in e discovery white paper 06-30-14

Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...
Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...
Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...Findwise
 
How new ai based analytics ignite a productivity revolution in e discovery-final
How new ai based analytics ignite a productivity revolution in e discovery-finalHow new ai based analytics ignite a productivity revolution in e discovery-final
How new ai based analytics ignite a productivity revolution in e discovery-finaljcscholtes
 
ACEDS - ZyLAB webinar - AI Based eDiscovery Analytics
ACEDS - ZyLAB webinar - AI Based eDiscovery AnalyticsACEDS - ZyLAB webinar - AI Based eDiscovery Analytics
ACEDS - ZyLAB webinar - AI Based eDiscovery AnalyticsAnnelore van der Lint
 
The search engine index
The search engine indexThe search engine index
The search engine indexCJ Jenkins
 
Ten steps for early awareness and relevance/non-relevance selection
Ten steps for early awareness and relevance/non-relevance selectionTen steps for early awareness and relevance/non-relevance selection
Ten steps for early awareness and relevance/non-relevance selectionJeffJohnson442
 
Technology in Legal Collection - RMA - presentation - Francois Sauvageau
Technology in Legal Collection - RMA - presentation - Francois SauvageauTechnology in Legal Collection - RMA - presentation - Francois Sauvageau
Technology in Legal Collection - RMA - presentation - Francois SauvageauFrançois Sauvageau
 
Information Architecture Primer - Integrating search,tagging, taxonomy and us...
Information Architecture Primer - Integrating search,tagging, taxonomy and us...Information Architecture Primer - Integrating search,tagging, taxonomy and us...
Information Architecture Primer - Integrating search,tagging, taxonomy and us...Dan Keldsen
 
Document Types Explained: Structured, Semi-Structured and Unstructured
Document Types Explained: Structured, Semi-Structured and UnstructuredDocument Types Explained: Structured, Semi-Structured and Unstructured
Document Types Explained: Structured, Semi-Structured and UnstructuredInfrrd
 
Accelerate Data Discovery
Accelerate Data Discovery   Accelerate Data Discovery
Accelerate Data Discovery Attivio
 
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”voginip
 
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”VOGIN-academie
 
Business Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search EngineBusiness Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search Engineankur881120
 
Hvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivtHvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivtKristian Norling
 
Expert Big Data Tips
Expert Big Data TipsExpert Big Data Tips
Expert Big Data TipsQubole
 
Data quality and data profiling
Data quality and data profilingData quality and data profiling
Data quality and data profilingShailja Khurana
 
Data Science - Part XI - Text Analytics
Data Science - Part XI - Text AnalyticsData Science - Part XI - Text Analytics
Data Science - Part XI - Text AnalyticsDerek Kane
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignMarianne Sweeny
 
Structured Document Search and Retrieval
Structured Document Search and RetrievalStructured Document Search and Retrieval
Structured Document Search and RetrievalOptum
 
USING GOOGLE’S KEYWORD RELATION IN MULTIDOMAIN DOCUMENT CLASSIFICATION
USING GOOGLE’S KEYWORD RELATION IN MULTIDOMAIN DOCUMENT CLASSIFICATIONUSING GOOGLE’S KEYWORD RELATION IN MULTIDOMAIN DOCUMENT CLASSIFICATION
USING GOOGLE’S KEYWORD RELATION IN MULTIDOMAIN DOCUMENT CLASSIFICATIONIJDKP
 

Similar to Demystifying analytics in e discovery white paper 06-30-14 (20)

Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...
Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...
Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...
 
How new ai based analytics ignite a productivity revolution in e discovery-final
How new ai based analytics ignite a productivity revolution in e discovery-finalHow new ai based analytics ignite a productivity revolution in e discovery-final
How new ai based analytics ignite a productivity revolution in e discovery-final
 
ACEDS - ZyLAB webinar - AI Based eDiscovery Analytics
ACEDS - ZyLAB webinar - AI Based eDiscovery AnalyticsACEDS - ZyLAB webinar - AI Based eDiscovery Analytics
ACEDS - ZyLAB webinar - AI Based eDiscovery Analytics
 
The search engine index
The search engine indexThe search engine index
The search engine index
 
Ten steps for early awareness and relevance/non-relevance selection
Ten steps for early awareness and relevance/non-relevance selectionTen steps for early awareness and relevance/non-relevance selection
Ten steps for early awareness and relevance/non-relevance selection
 
Technology in Legal Collection - RMA - presentation - Francois Sauvageau
Technology in Legal Collection - RMA - presentation - Francois SauvageauTechnology in Legal Collection - RMA - presentation - Francois Sauvageau
Technology in Legal Collection - RMA - presentation - Francois Sauvageau
 
Information Architecture Primer - Integrating search,tagging, taxonomy and us...
Information Architecture Primer - Integrating search,tagging, taxonomy and us...Information Architecture Primer - Integrating search,tagging, taxonomy and us...
Information Architecture Primer - Integrating search,tagging, taxonomy and us...
 
Mighty Guides- Data Disruption
Mighty Guides- Data DisruptionMighty Guides- Data Disruption
Mighty Guides- Data Disruption
 
Document Types Explained: Structured, Semi-Structured and Unstructured
Document Types Explained: Structured, Semi-Structured and UnstructuredDocument Types Explained: Structured, Semi-Structured and Unstructured
Document Types Explained: Structured, Semi-Structured and Unstructured
 
Accelerate Data Discovery
Accelerate Data Discovery   Accelerate Data Discovery
Accelerate Data Discovery
 
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
 
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”Smartlogic, Semaphore and Semantically Enhanced Search –  For “Discovery”
Smartlogic, Semaphore and Semantically Enhanced Search – For “Discovery”
 
Business Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search EngineBusiness Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search Engine
 
Hvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivtHvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivt
 
Expert Big Data Tips
Expert Big Data TipsExpert Big Data Tips
Expert Big Data Tips
 
Data quality and data profiling
Data quality and data profilingData quality and data profiling
Data quality and data profiling
 
Data Science - Part XI - Text Analytics
Data Science - Part XI - Text AnalyticsData Science - Part XI - Text Analytics
Data Science - Part XI - Text Analytics
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By Design
 
Structured Document Search and Retrieval
Structured Document Search and RetrievalStructured Document Search and Retrieval
Structured Document Search and Retrieval
 
USING GOOGLE’S KEYWORD RELATION IN MULTIDOMAIN DOCUMENT CLASSIFICATION
USING GOOGLE’S KEYWORD RELATION IN MULTIDOMAIN DOCUMENT CLASSIFICATIONUSING GOOGLE’S KEYWORD RELATION IN MULTIDOMAIN DOCUMENT CLASSIFICATION
USING GOOGLE’S KEYWORD RELATION IN MULTIDOMAIN DOCUMENT CLASSIFICATION
 

Recently uploaded

_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 

Recently uploaded (20)

_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 

Demystifying analytics in e discovery white paper 06-30-14

  • 1. 2014 WHITE PAPER DEMYSTIFYING ANALYTICS IN EDISCOVERY Steven Toole Vice President of Marketing, Content Analyst Company, LLC.
  • 2. 1© 2014 Content Analyst,LLC.All rights reserved. Content Analyst,CAATand the Content Analyst and CAATlogos are registered trademarks of Content Analyst,LLC in the United States. All other marks are the property of their respective owners. A recent eDiscovery Journal blog entry by Greg Buckles pointed out that the number of companies providing the analytics technology in early case assessment (ECA) and review platforms is few, and the specific analytics capabilities can vary wildly from one platform to the next. While this may not be breaking news to those closest to analytics in eDiscovery, it did reveal some mysticism about how the market uses analytics in the early stages of eDiscovery. In addition, it raised questions about how analytics fit into the eDiscovery workflow, and ultimately, the return on investment that analytics have on eDiscovery and information governance in general. This overview is designed to further unveil this mysticism surrounding analytics in eDiscovery and information governance, and provide insights about the return on investment analytics can enable for those who embrace these capabilities. Corporate counsel that get ahead of the curve today with forward-thinking strategies such as these will be the ultimate heroes and beneficiaries of eDiscovery analytics, leading their field with a much more proactive and cost-effective approach to information governance and legal technology. The Analytics Land Grab While there are plenty of stakes in the ground across the eDiscoverylandscape,lawfirmsandserviceprovidersarelooking ”to the West” for unclaimed territory. The gigabyte gold rush is all about applying analytics to the data further upstream in order to “own” the data long before it’s needed in a matter. ECA was the first frontier law firms and service providers looked toward in ordertomoveupstream,butthegreenestpasturesarestillfurther upstream, in what some call “pre-discovery.” The Golden Rule in eDiscovery is simple: Those who ”rule” the content get the gold. Translation:thevendorthatcanapplyanalyticsearliest–beforeit’s needed in a matter – provides the most value to corporations, and therefore is at a great competitive advantage. “ECA was the first frontier law firms and service providers looked toward in order to move upstream, but the greenest pastures are still further upstream, in what some call “pre-discovery.” ”
  • 3. Dynamic Clustering – This is a good place to start, especially if you know little to nothing about the content. Clustering “buckets” the content (documents, emails, etc.) into natural groupings of conceptually related materials. One major benefit of clustering is that it provides a very fast map of the document landscape in a highly objective, consistent, concept-aware fashion. As a result, thereviewercanjumpstraighttotheclusterthat’sofmostinterest (conceptually relevant), and avoid spending time in clusters of no conceptual relevance. In terms of information governance, it can also help identify and weed out the ROT (redundant, outdated and transient) content quickly and easily, thus reducing costs and helping to increase ROI. Term Expansion – You have a keyword, name or technical term, abbreviation, or acronym, and you want a list of all similar, or highly related, terms so you can expand your search for documents containing those terms as well. Term expansion identifies conceptually related terms, customized to your content, and ranked in order of relevance. For example, Barack Obama might produce a list such as President Obama, Commander-in- Chief, Senator Obama, Michelle Obama, the Oval Office, Office of the President, POTUS, etc. In a matter, that means finding more conceptually related content faster and easier, saving time and money. In information governance, it helps identify content relatedtocorporaterecords,intellectualproperty,andcompliance, as well as, of course, more ROT for defensible deletion. Conceptual Search – You’ve identified a key document or paragraph, now you want to find similar ones. Keyword search will give you documents containing the specific keywords as best as you can write the Boolean search string, and as long as those keywords are included in the resulting documents. But writing Boolean search strings can be time-consuming and still may miss key documents containing the ”unknown” terms not included in your search string. To find the documents you’d otherwise miss with keyword searches, you’ll need to use conceptual search. Applying mathematical algorithms to your example document or text selection, conceptual search looks for matching patterns in the “map” of the data called the conceptual space. The benefit is thatconceptualsearchcanfindsimilarresultsevenifthematching document doesn’t contain any of the same terms as the example text. Think abbreviations, misspellings, acronyms, code words, andrelatedtermsyouhadn’theard.Thentakeawayfalsepositives from synonyms and polysemes. Translation: uncover highly relevant yet latent relationships, saving time and costs. Auto-Categorization – Predictive coding is one area that’s gained a lot of attention in eDiscovery over the past two years. Auto-categorization is what makes predictive coding possible. Predictive coding is applying machine learning to a corpus of documents to intelligently categorize them any number of ways, such as, as privileged, responsive, nonresponsive. For example, users can categorize documents as responsive, then categorize the responsive documents even further into relevant issue sub- categories. Auto-categorization uses the same conceptual space and sample document exemplars to find conceptually similar documents and label them as appropriate. Again, the big benefit here is a tremendous amount of time saving (and cost saving) by letting the technology bring the most relevant documents to the forefront, and into the hands of the domain experts, as quickly and easily as possible. Email Threading – The concept of email threading is fairly simple – find the subset of emails at the end of each branch of a conversation thread. Rather than reading 30 emails back and forth – as well as sideways among forwarded branches of the conversation – email threading finds the subset of emails that includeallofthepreviousreplies(called”inclusive”emailsbecause these six, for example, include the whole history). Time and cost savings of using email threading are self-evident, but it’s also important to note that threading reveals exactly who knew what, when – pretty critical in piecing together the course of events that unfolded surrounding a matter. Near-DuplicateIdentification–Asimilarbenefittoemailthreading exists with near-duplicate identification.While conceptual search, clustering or categorization can identify documents that are relevant to the case, many could be various versions of the same document. Knowing that they’re near duplicates of each other can save the time of having to review each one. If it’s important to know what changed from one version to the next, when, and by whom, difference highlighting shows these changes, again saving time and reducing cost. Batching near duplicates together from theoutsetofamatteralsoprovidesreviewerswithamorefocused set of documents. Recipe for Success This all sounds good, but what does this really mean? You can’t bake a cake until you know what the ingredients are, what they do, and how they can affect the output. And since each eDiscovery solution has a different set of analytics capabilities, here’s a brief tutorial on the key ingredients of analytics for eDiscovery and information governance. © 2014 Content Analyst,LLC.All rights reserved. Content Analyst,CAATand the Content Analyst and CAATlogos are registered trademarks of Content Analyst,LLC in the United States. All other marks are the property of their respective owners. 2
  • 4. 3© 2014 Content Analyst,LLC.All rights reserved. Content Analyst,CAATand the Content Analyst and CAATlogos are registered trademarks of Content Analyst,LLC in the United States. All other marks are the property of their respective owners. Putting Your Data on a Diet Putting these analytics capabilities to work for you may cause serious weight loss in your data. In eDiscovery, that means fewer documents to review by expensive reviewers. It also means that the documents they are reviewing are nothing but the absolutely most conceptually relevant documents to the case. Moreover, reviewers are being presented with documents that are not batched haphazardly, allowing for a more focused review, driving accuracy to an all-time high and costs and time even lower. But Wait – There’s More! Remember the gigabyte gold rush from above? The Golden Rule of data? This is where analytics really are the key to unlocking all those hidden insights in a company’s data – long before they’re needed in eDiscovery. Applying text analytics to a company’s electronic records proactively through pre-discovery means that data is already organized, reduced, and ready to be presented if and when a matter arises. Corporate counsel love this idea because it keeps litigation costs as low as possible, decreases the crucial time it takes to investigate or decide whether to settle a case, and helps them present their side in the very best possible light. The Bottom Line: ROI for eDiscovery Analytics Measuring the ROI of using analytics in eDiscovery comes down to this: Review is the greatest cost factor in eDiscovery. Expert reviewers don’t come cheap – their expertise is clearly of utmost value in a case. But if a large percentage of their time is spent reviewing documents not relevant to a case, then you’re not getting the most value out of them in the first place. Their hours cost the same whether they’re looking at the smoking gun document in a case or something completely unrelated. You wouldn’t wear gloves during a palm reading – they’d just get in the way of the psychic doing her job. Presenting a corpus of documents to expensive reviewers without applying machine learning first makes no more sense. Further, finding documents otherwise missed without analytics can also hinder the experts’ ability to formulate your case strategy. But the ROI of applying analytics to your documents and email pre-discovery goes even further. The cost benefits of organizing and analyzing your content proactively are huge, helping to drive decision making and information governance practices for compliance, risk mitigation and cost avoidance. Applying these strategies early can provide a tremendous advantage long term through a much more proactive and cost-effective approach to information governance and eDiscovery. Steven Toole is vice president of marketing at Content Analyst Company where his unique combination of business acumen, creativity, strategic vision and tactical execution yield impactful results toward the company's mission. “Measuring the ROI of using analytics in eDiscovery comes down to this: Review is the greatest cost factor in eDiscovery. ”