SlideShare a Scribd company logo
1 of 39
Download to read offline
HOW NEW AI-BASED ANALYTICS IGNITE A
PRODUCTIVITY REVOLUTION IN
EDISCOVERY
ACEDS Webinar - August 24th, 2017
TODAY’S SPEAKERS
Mary Mack
Executive Director ACEDS
Paul Starrett
Specialist in electronic
evidence and data science in
the legal profession
Johannes Scholtes
CSO at ZyLAB
Professor Text-Mining
University of Maastricht
SLIDE / 3
 Tools from the field of Artificial Intelligence and Data Science
accelerate truth-finding missions in regulatory requests and
internal investigations.
 New AI-based analytics have drastically increased the speed
and improved the quality of the eDiscovery process.
 But what exactly are these new AI techniques and how do they
compare to all the other analytics we have been using for
years?
TODAY’S AGENDA
THE BUZZ
SLIDE / 5
e-Discovery & Artificial Intelligence The new reality
AI becomes good business practice
WHAT ARE WE TALKING ABOUT?
“Analytics” is the discovery,
interpretation, and communication
of meaningful patterns in data.
The terms “analytics” or “analysis”
describe functions ranging from
reporting and review metrics to
sophisticated search and
advanced data, text-mining and
machine learning applications.
Benefits also range across various
dimensions.
“Artificial Intelligence (AI) is a
broad, complex field of research.
AI includes tasks such as
reasoning, problem solving,
knowledge representation,
planning, machine learning,
natural language processing,
perception, motion, social
intelligence, and even creativity.
The ultimate goal is the creation
of some form of general
intelligence.
SLIDE / 6
The Usual Suspects:
 Exploding data volumes;
 New types of data (multi-media, social, BYOD);
 Exploding eDiscovery costs;
 New regulations and compliance requirements
 GDPR
 Cyber-security requirements
 More enthusiastic regulators, especially outside of the US.
SLIDE / 7
WHY WE SHOULD CARE
DEALING WITH THE EDISCOVERY DATA WAVE
In eDiscovery, you never know in
advance:
 How much data you will have;
 What type of data it will be and thus
what type of processing is required;
 What workflow and iterations you will
have;
 Automation, AI and Data Science are
very CPU and computers memory
intensive;
So, you need intelligent and extremely
load-balancing and resource allocation to
prevent bottlenecks and deal effectively
with the “Data Wave” in eDiscovery.
 Better understand your data: the ability to make better strategic
decisions.
 Early Case Assessment: build and justify eDiscovery budget,
resources and timelines.
 Reduce data volumes: cut through the noise and zero in on
documents of interest.
 Take an investigative approach: organize and prioritize documents.
 Reduce your eDiscovery cost: improve productivity and precision of
your team.
 Better quality: see greater consistency in coding decisions across
similar documents.
 Speed up litigation.
SLIDE / 9
WHY ANALYTICS?
 Humans have cognitive limitations when processing and
deriving insights from large-scale document sets; humans
simply cannot successfully synthesize large volumes of data.
 Technology will help lawyers work more efficiently, effectively,
and enjoyably.
 Grossman & Cormack* : “TAR was not only more effective than
human review at finding relevant documents, but also much
cheaper … Overall, the myth that exhaustive manual review is
the most effective—and therefore the most defensible—
approach to document review is strongly refuted.”
SLIDE / 10
WHY AI-BASED ANALYTICS?
* TECHNOLOGY-ASSISTED REVIEW IN E-DISCOVERY CAN BE MORE EFFECTIVE AND MORE EFFICIENT THAN EXHAUSTIVE MANUAL REVIEW
By Maura R. Grossman* & Gordon V. Cormack. Richmond Journal of Law and Technology. Vol. XVII, Issue 3.
SLIDE / 11
 Structural: aka syntactic analytics
 File-, Document and Forensic Property extraction, Meta-data
filtering, Saved (full-text) Searches, Email Thread detection,
Email Thread reduction, Missing emails in thread, Duplicate- and
Near Duplicate detection, Language identification,
Communication Analysis, Time-line Visualizations, Geo-mapping,
…
 Conceptual: aka semantic or meaning based analytics
 Keyword Expansion (taxonomy), Content Clustering, Content-
based Categorization, Conceptual Search, Sentiment & Emotion
Mining, Semantic Content Analysis, Word-Cloud, Topic Modeling,
…
 Machine Learning: data driven (predictive) analytics
 Technology Assisted Review, Contract clause detection &
classification, Privileged detection, …
SLIDE / 12
WHAT KIND OF ANALYTICS HAVE WE SEEN?
STRUCTURE OF DATA
MEANING OF DATA
LEARN FROM DATA
WHAT IS THE RELATION BETWEEN AI AND ANALYTICS?
eDiscovery needs:
 Perception
 Reading: OCR, handwriting detection, signature
recognition,
 Listening: Audio search
 Vision: Image classification
 Language: Machine Translation
 Intelligent Search
 Machine Learning for search
 Concept Clustering
 Data Visualization
 Text classification and categorization
 Document
 Paragraph (clause)
 Sentence or phrase
AI provides the algorithms and evaluation methods:
 Machine Learning
 Decision trees
 Support Vector Machines
 Deep Learning (CNN)
 Topic Modeling / Concept Search
 Hierarchical Clustering
 LSI
 LDA
 NMF
 Natural Language Processing (NLP)
 Shallow Parsing
 Deep Parsing
 Co-reference resolution
SLIDE / 13
PERCEPTION: AUDIO SEARCH
ZyLAB: automatic Audio
Search on all detected
(embedded) audio and
video files.
ZyLAB: embedded
machine translation
on every (embedded)
document or
document section.
PERCEPTION: MACHINE TRANSLATION
SLIDE / 16
PERCEPTION: HANDWRITING & SIGNATURE DETECTION (R&D)
SLIDE / 17
PERCEPTION: VISUAL CLASSIFICATION OF IMAGES FOR
EDISCOVERY (R&D)
PERCEPTION: OCR ON BITMAPS
ZyLAB: people often screenshot or take
pictures from such information, just in case
or to remember…. ZyLAB will pick up such
images, OCR and find them…
STRUCTURAL: UNPACK EMBEDDED CONTENT
ZyLAB:
• Every embedded item is extracted and OCR-ed if needed.
• Search & Find
• Show in document family
STRUCTURAL: ONE-ON-ONE COMMUNICATION
STRUCTURAL: MISSING EMAIL IN THREAD
ZyLAB:
 Identify gaps in
collected emails
 Compare gaps among
suspects
 Restore email from
backup’s
CONCEPTUAL: SEMANTICS AND SENTIMENTS
FIND EVEN WHEN YOU DO NOT KNOW WHAT TO LOOK FOR
Question Entities or patterns to address this question
Who is it about? PERSON, COMPANY, ORGANIZATION. EMAIL
ADDRESS
What is it about? Result of Topic Modeling and Concept Clustering
When did it happen? DATE, TIME, MONTH, DAY WEEK, YEAR
Where did it happen? ADDRESS, CITY, COUNTRY, CONTINENT,
DEPARTMENT and other geo-locations
Why did it happen? Sentiments, emotions and cursing
How did it happen? Combining entities and facts
How much/often did it happen? Quantitative measures such as amounts,
currencies, and other numbers. Also frequency
and averages on entity occurrences.
SLIDE / 24
MORE DETAILED INSIGHTS
SLIDE / 25
More interesting is to combine the W’s. For instance, why
not look for Who is Where, or What happened When.
Who – Who
Who – Why
When – What
The era of traditional keyword and Boolean search
seems to be over. Even the most brilliant query results
in too many hits. Reviewing these takes too much
time and resources.
 People do not know exactly what to look for, what
keywords to use or how to spell them.
 The quality of traditional search is much lower than
the searchers think (80% perceived versus 20-40%
actual quality).
 Only highly skilled searchers who manage all
(advanced) query options are able to get close to
80%. Even then, they cannot be sure that they did in
fact found 80% of all relevant documents. This is
another problem measuring recall: you never know
what you miss.
MACHINE LEARNING: THE NEW SEARCH
 Document Classification (TAR)
 Find responsive documents
 Boost recall
 Measure recall
 Paragraph Classification
 Privileged review
 Document clause classification
 Contract clause classification
 GDPR – Privacy detection – Redaction – Pseudomization
SLIDE / 28
DIFFERENT USE CASES OF MACHINE LEARNING
 Have we found all relevant
information? How complete
is the data we sent to the
regulator? Machine
learning!
 During this process, several
quantitative measures can
be calculated such as
precision, recall, F-values
and precision of the return
set. Based on these
measurements, one can
describe exactly how much
of the relevant information
has been found at which
moment in the process.
HOW CAN WE MEASURE RECALL
0
200
400
600
800
1000
1200
1400
1600
ZyLAB Assisted Review Manual Review
Hours
MACHINE LEARNING
 15-20 faster than manual review
 10-20% more accurate, fully defensible
 Privileged
information:
automatically identify
communications with
our lawyers.
 PII, PHI, and GDPR:
redaction and
pseudonymization
CLAUSE DETECTION
Detailed reporting
on content of
contracts, Reporting
on extraction of key
information, Higher
precision search
 ZyLAB’s Direct Collecting makes tremendous time savings to get data ready for early
case assessment and (first) pass review. Direct Collection drastically reduces the cost
and risks of downloading / uploading data or the shipping around of tapes and hard disks.
 ZyLAB’s Deep Processing allows you to automatically reduce your data volumes before
you send them on for review, without getting in trouble or being accused of data
spoliation. If every component of data is searchable, only then can one use automated
tools to reduce data.
 Using ZyLAB’s Review Accelerators you can minimize the most expensive and time
consuming part of the eDiscovery process. TAR, batch tagging, sampling, redaction,
email trails, …
 Litigants use ZyLAB’s Early Case Assessment to quickly understand the facts and
merits of a case, identify key custodians and recognize critical information so they can
develop an effective and realistic litigation strategy.
SLIDE / 34
BENEFITS TO IN-HOUSE COUNSEL
BENEFITS TO LAW FIRMS
 ZyLAB covers multiple eDiscovery use
cases. One platform: More cases, more
volume, better pricing.
 No need to involve any 3rd parties.
 Bill the hours for project management and
data science (machine learning) as well.
 DIY: upload data and almost immediately
start reviewing with your team and bill the
hours.
 Find out what really happened with
ZyLAB’s deep search and analytics.
Expand review team.
 Replace the bottom of the traditional
earnings pyramid with “review robots”:
make more margin.
 Be more competitive.
 Do more work with your current team:
never have to pass on new opportunities
because of capacity problems.
 less risk of errors and missing out on key
issues. So, less risk for liability claims and
higher insurance premiums.
“ZYLAB TAKES CARE OF THE PROCESS, SUPPORTS THE LAWYER BY
THINKING COMMERCIALLY AND PROVIDES COMFORT WITH THE
USE OF ADVANCED TECHNOLOGY”
Ruben Elkerbout, anti-trust lawyer and partner with Stek Lawyers
MORE READING – WWW.ZYLAB.COM/RESOURCES/EBOOKS/
Q&A
MORE INFORMATION: WWW.ZYLAB.COM
39
More ZyLAB Webinars and events:
https://zylab.com/company/event-calendar/

More Related Content

What's hot

Data science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyData science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyPeter Kua
 
Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds
 Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds
Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Mindshappiestmindstech
 
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...Happiest Minds Technologies
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsSri Ambati
 
Data Scientist Toolbox
Data Scientist ToolboxData Scientist Toolbox
Data Scientist ToolboxAndrei Savu
 
2015 data-science-salary-survey
2015 data-science-salary-survey2015 data-science-salary-survey
2015 data-science-salary-surveyAdam Rabinovitch
 
“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” diannepatricia
 
A Primer for a layman about Big Data, Business Analytics and Cloud
A Primer for a layman  about Big Data, Business Analytics and CloudA Primer for a layman  about Big Data, Business Analytics and Cloud
A Primer for a layman about Big Data, Business Analytics and CloudRajagopalan V
 
Technology Intelligence for R&D
Technology Intelligence for R&DTechnology Intelligence for R&D
Technology Intelligence for R&DJoe Buzzanga
 
KM - Cognitive Computing overview by Ken Martin 13Apr2016
KM - Cognitive Computing overview by Ken Martin 13Apr2016KM - Cognitive Computing overview by Ken Martin 13Apr2016
KM - Cognitive Computing overview by Ken Martin 13Apr2016HCL Technologies
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052Gilbert Rozario
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big DataIndu Khemchandani
 
Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...mark madsen
 
Data Mining and Data Warehousing (MAKAUT)
Data Mining and Data Warehousing (MAKAUT)Data Mining and Data Warehousing (MAKAUT)
Data Mining and Data Warehousing (MAKAUT)Bikramjit Sarkar, Ph.D.
 
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...DATAVERSITY
 
How the Analytics Translator can make your organisation more AI driven
How the Analytics Translator can make your organisation more AI drivenHow the Analytics Translator can make your organisation more AI driven
How the Analytics Translator can make your organisation more AI drivenSteven Nooijen
 
SMART Seminar - The Future of Business Intelligence: Information 2020
SMART Seminar - The Future of Business Intelligence: Information 2020SMART Seminar - The Future of Business Intelligence: Information 2020
SMART Seminar - The Future of Business Intelligence: Information 2020SMART Infrastructure Facility
 
iTrain Malaysia: Data Science by Tarun Sukhani
iTrain Malaysia: Data Science by Tarun SukhaniiTrain Malaysia: Data Science by Tarun Sukhani
iTrain Malaysia: Data Science by Tarun SukhaniiTrain
 
From Rocket Science to Data Science
From Rocket Science to Data ScienceFrom Rocket Science to Data Science
From Rocket Science to Data ScienceSanghamitra Deb
 
Ai and Legal Industy - Executive Overview
Ai and Legal Industy - Executive OverviewAi and Legal Industy - Executive Overview
Ai and Legal Industy - Executive OverviewGraeme Wood
 

What's hot (20)

Data science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyData science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi Periasamy
 
Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds
 Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds
Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds
 
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
Data Scientist Toolbox
Data Scientist ToolboxData Scientist Toolbox
Data Scientist Toolbox
 
2015 data-science-salary-survey
2015 data-science-salary-survey2015 data-science-salary-survey
2015 data-science-salary-survey
 
“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services”
 
A Primer for a layman about Big Data, Business Analytics and Cloud
A Primer for a layman  about Big Data, Business Analytics and CloudA Primer for a layman  about Big Data, Business Analytics and Cloud
A Primer for a layman about Big Data, Business Analytics and Cloud
 
Technology Intelligence for R&D
Technology Intelligence for R&DTechnology Intelligence for R&D
Technology Intelligence for R&D
 
KM - Cognitive Computing overview by Ken Martin 13Apr2016
KM - Cognitive Computing overview by Ken Martin 13Apr2016KM - Cognitive Computing overview by Ken Martin 13Apr2016
KM - Cognitive Computing overview by Ken Martin 13Apr2016
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big Data
 
Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...
 
Data Mining and Data Warehousing (MAKAUT)
Data Mining and Data Warehousing (MAKAUT)Data Mining and Data Warehousing (MAKAUT)
Data Mining and Data Warehousing (MAKAUT)
 
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
 
How the Analytics Translator can make your organisation more AI driven
How the Analytics Translator can make your organisation more AI drivenHow the Analytics Translator can make your organisation more AI driven
How the Analytics Translator can make your organisation more AI driven
 
SMART Seminar - The Future of Business Intelligence: Information 2020
SMART Seminar - The Future of Business Intelligence: Information 2020SMART Seminar - The Future of Business Intelligence: Information 2020
SMART Seminar - The Future of Business Intelligence: Information 2020
 
iTrain Malaysia: Data Science by Tarun Sukhani
iTrain Malaysia: Data Science by Tarun SukhaniiTrain Malaysia: Data Science by Tarun Sukhani
iTrain Malaysia: Data Science by Tarun Sukhani
 
From Rocket Science to Data Science
From Rocket Science to Data ScienceFrom Rocket Science to Data Science
From Rocket Science to Data Science
 
Ai and Legal Industy - Executive Overview
Ai and Legal Industy - Executive OverviewAi and Legal Industy - Executive Overview
Ai and Legal Industy - Executive Overview
 

Similar to How new ai based analytics ignite a productivity revolution in e discovery-final

Week-1-Introduction to Data Mining.pptx
Week-1-Introduction to Data Mining.pptxWeek-1-Introduction to Data Mining.pptx
Week-1-Introduction to Data Mining.pptxTake1As
 
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksBio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksLucidworks
 
Efficiently Handling Subject Access Requests
Efficiently Handling Subject Access RequestsEfficiently Handling Subject Access Requests
Efficiently Handling Subject Access Requestsjcscholtes
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data scienceVipul Kalamkar
 
Theres No Crying In Baseball...Or In E Discovery 04.30.10
Theres No Crying In Baseball...Or In E Discovery 04.30.10Theres No Crying In Baseball...Or In E Discovery 04.30.10
Theres No Crying In Baseball...Or In E Discovery 04.30.10knugent
 
Evidence Data Preprocessing for Forensic and Legal Analytics
Evidence Data Preprocessing for Forensic and Legal AnalyticsEvidence Data Preprocessing for Forensic and Legal Analytics
Evidence Data Preprocessing for Forensic and Legal AnalyticsCSCJournals
 
Demystifying analytics in e discovery white paper 06-30-14
Demystifying analytics in e discovery   white paper 06-30-14Demystifying analytics in e discovery   white paper 06-30-14
Demystifying analytics in e discovery white paper 06-30-14Steven Toole
 
Introduction of Data Science and Data Analytics
Introduction of Data Science and Data AnalyticsIntroduction of Data Science and Data Analytics
Introduction of Data Science and Data AnalyticsVrushaliSolanke
 
District Office of Info and KM - Proposed - by Joel Magnussen - 2004
District Office of Info and KM - Proposed - by Joel Magnussen - 2004District Office of Info and KM - Proposed - by Joel Magnussen - 2004
District Office of Info and KM - Proposed - by Joel Magnussen - 2004Peter Stinson
 
Data-Mining-ppt (1).pptx
Data-Mining-ppt (1).pptxData-Mining-ppt (1).pptx
Data-Mining-ppt (1).pptxParvathyparu25
 
Data-Mining-ppt.pptx
Data-Mining-ppt.pptxData-Mining-ppt.pptx
Data-Mining-ppt.pptxayush309565
 
Digital Reasoning at AirSummit 2014
Digital Reasoning at AirSummit 2014Digital Reasoning at AirSummit 2014
Digital Reasoning at AirSummit 2014Marten den Haring
 
What is Data Science?
What is Data Science?What is Data Science?
What is Data Science?Ahmed Banafa
 
Introduction To Data Mining
Introduction To Data MiningIntroduction To Data Mining
Introduction To Data Miningdataminers.ir
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining Phi Jack
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 

Similar to How new ai based analytics ignite a productivity revolution in e discovery-final (20)

Data mining
Data miningData mining
Data mining
 
Week-1-Introduction to Data Mining.pptx
Week-1-Introduction to Data Mining.pptxWeek-1-Introduction to Data Mining.pptx
Week-1-Introduction to Data Mining.pptx
 
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksBio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
 
Efficiently Handling Subject Access Requests
Efficiently Handling Subject Access RequestsEfficiently Handling Subject Access Requests
Efficiently Handling Subject Access Requests
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 
Theres No Crying In Baseball...Or In E Discovery 04.30.10
Theres No Crying In Baseball...Or In E Discovery 04.30.10Theres No Crying In Baseball...Or In E Discovery 04.30.10
Theres No Crying In Baseball...Or In E Discovery 04.30.10
 
Evidence Data Preprocessing for Forensic and Legal Analytics
Evidence Data Preprocessing for Forensic and Legal AnalyticsEvidence Data Preprocessing for Forensic and Legal Analytics
Evidence Data Preprocessing for Forensic and Legal Analytics
 
Demystifying analytics in e discovery white paper 06-30-14
Demystifying analytics in e discovery   white paper 06-30-14Demystifying analytics in e discovery   white paper 06-30-14
Demystifying analytics in e discovery white paper 06-30-14
 
Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
 
Introduction of Data Science and Data Analytics
Introduction of Data Science and Data AnalyticsIntroduction of Data Science and Data Analytics
Introduction of Data Science and Data Analytics
 
District Office of Info and KM - Proposed - by Joel Magnussen - 2004
District Office of Info and KM - Proposed - by Joel Magnussen - 2004District Office of Info and KM - Proposed - by Joel Magnussen - 2004
District Office of Info and KM - Proposed - by Joel Magnussen - 2004
 
Data-Mining-ppt (1).pptx
Data-Mining-ppt (1).pptxData-Mining-ppt (1).pptx
Data-Mining-ppt (1).pptx
 
Data-Mining-ppt.pptx
Data-Mining-ppt.pptxData-Mining-ppt.pptx
Data-Mining-ppt.pptx
 
Digital Reasoning at AirSummit 2014
Digital Reasoning at AirSummit 2014Digital Reasoning at AirSummit 2014
Digital Reasoning at AirSummit 2014
 
What is Data Science?
What is Data Science?What is Data Science?
What is Data Science?
 
Introduction To Data Mining
Introduction To Data MiningIntroduction To Data Mining
Introduction To Data Mining
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 

More from jcscholtes

Legal tech Alliance Workshop 20191029
Legal tech Alliance Workshop 20191029Legal tech Alliance Workshop 20191029
Legal tech Alliance Workshop 20191029jcscholtes
 
LegalTech Alliance eDiscovery keynote Scholtes
LegalTech Alliance eDiscovery keynote ScholtesLegalTech Alliance eDiscovery keynote Scholtes
LegalTech Alliance eDiscovery keynote Scholtesjcscholtes
 
Text mining scholtes - big data congress utrecht 2019
Text mining   scholtes - big data congress utrecht 2019Text mining   scholtes - big data congress utrecht 2019
Text mining scholtes - big data congress utrecht 2019jcscholtes
 
Target-Based Sentiment Anaysis as a Sequence-Tagging Task
Target-Based Sentiment Anaysis as a Sequence-Tagging TaskTarget-Based Sentiment Anaysis as a Sequence-Tagging Task
Target-Based Sentiment Anaysis as a Sequence-Tagging Taskjcscholtes
 
Ai and applications in the legal domain studium generale maastricht 20191101
Ai and applications in the legal domain studium generale maastricht 20191101Ai and applications in the legal domain studium generale maastricht 20191101
Ai and applications in the legal domain studium generale maastricht 20191101jcscholtes
 
Augmented intelligence and the impact on your world in 2030
Augmented intelligence and the impact on your world in 2030Augmented intelligence and the impact on your world in 2030
Augmented intelligence and the impact on your world in 2030jcscholtes
 
Text mining voor Business Intelligence toepassingen
Text mining voor Business Intelligence toepassingenText mining voor Business Intelligence toepassingen
Text mining voor Business Intelligence toepassingenjcscholtes
 
How can text-mining leverage developments in Deep Learning? Presentation at ...
How can text-mining leverage developments in Deep Learning?  Presentation at ...How can text-mining leverage developments in Deep Learning?  Presentation at ...
How can text-mining leverage developments in Deep Learning? Presentation at ...jcscholtes
 
Hogeschool Den Haag Legal Analytics
Hogeschool Den Haag Legal AnalyticsHogeschool Den Haag Legal Analytics
Hogeschool Den Haag Legal Analyticsjcscholtes
 
HvA Legaltech Lab Opening
HvA Legaltech Lab OpeningHvA Legaltech Lab Opening
HvA Legaltech Lab Openingjcscholtes
 
Big Data en Data Science en de Rechtspraak
Big Data en Data Science en de RechtspraakBig Data en Data Science en de Rechtspraak
Big Data en Data Science en de Rechtspraakjcscholtes
 
How can Artificial Intelligence help me on the Battlefield?
How can Artificial Intelligence help me on the Battlefield?How can Artificial Intelligence help me on the Battlefield?
How can Artificial Intelligence help me on the Battlefield?jcscholtes
 
Big data analytics for legal fact finding
Big data analytics for legal fact findingBig data analytics for legal fact finding
Big data analytics for legal fact findingjcscholtes
 
Text mining scholtes - big data congress utrecht 2018
Text mining   scholtes - big data congress utrecht 2018Text mining   scholtes - big data congress utrecht 2018
Text mining scholtes - big data congress utrecht 2018jcscholtes
 
Waarom LegalTech de toekomst heeft
Waarom LegalTech de toekomst heeftWaarom LegalTech de toekomst heeft
Waarom LegalTech de toekomst heeftjcscholtes
 

More from jcscholtes (15)

Legal tech Alliance Workshop 20191029
Legal tech Alliance Workshop 20191029Legal tech Alliance Workshop 20191029
Legal tech Alliance Workshop 20191029
 
LegalTech Alliance eDiscovery keynote Scholtes
LegalTech Alliance eDiscovery keynote ScholtesLegalTech Alliance eDiscovery keynote Scholtes
LegalTech Alliance eDiscovery keynote Scholtes
 
Text mining scholtes - big data congress utrecht 2019
Text mining   scholtes - big data congress utrecht 2019Text mining   scholtes - big data congress utrecht 2019
Text mining scholtes - big data congress utrecht 2019
 
Target-Based Sentiment Anaysis as a Sequence-Tagging Task
Target-Based Sentiment Anaysis as a Sequence-Tagging TaskTarget-Based Sentiment Anaysis as a Sequence-Tagging Task
Target-Based Sentiment Anaysis as a Sequence-Tagging Task
 
Ai and applications in the legal domain studium generale maastricht 20191101
Ai and applications in the legal domain studium generale maastricht 20191101Ai and applications in the legal domain studium generale maastricht 20191101
Ai and applications in the legal domain studium generale maastricht 20191101
 
Augmented intelligence and the impact on your world in 2030
Augmented intelligence and the impact on your world in 2030Augmented intelligence and the impact on your world in 2030
Augmented intelligence and the impact on your world in 2030
 
Text mining voor Business Intelligence toepassingen
Text mining voor Business Intelligence toepassingenText mining voor Business Intelligence toepassingen
Text mining voor Business Intelligence toepassingen
 
How can text-mining leverage developments in Deep Learning? Presentation at ...
How can text-mining leverage developments in Deep Learning?  Presentation at ...How can text-mining leverage developments in Deep Learning?  Presentation at ...
How can text-mining leverage developments in Deep Learning? Presentation at ...
 
Hogeschool Den Haag Legal Analytics
Hogeschool Den Haag Legal AnalyticsHogeschool Den Haag Legal Analytics
Hogeschool Den Haag Legal Analytics
 
HvA Legaltech Lab Opening
HvA Legaltech Lab OpeningHvA Legaltech Lab Opening
HvA Legaltech Lab Opening
 
Big Data en Data Science en de Rechtspraak
Big Data en Data Science en de RechtspraakBig Data en Data Science en de Rechtspraak
Big Data en Data Science en de Rechtspraak
 
How can Artificial Intelligence help me on the Battlefield?
How can Artificial Intelligence help me on the Battlefield?How can Artificial Intelligence help me on the Battlefield?
How can Artificial Intelligence help me on the Battlefield?
 
Big data analytics for legal fact finding
Big data analytics for legal fact findingBig data analytics for legal fact finding
Big data analytics for legal fact finding
 
Text mining scholtes - big data congress utrecht 2018
Text mining   scholtes - big data congress utrecht 2018Text mining   scholtes - big data congress utrecht 2018
Text mining scholtes - big data congress utrecht 2018
 
Waarom LegalTech de toekomst heeft
Waarom LegalTech de toekomst heeftWaarom LegalTech de toekomst heeft
Waarom LegalTech de toekomst heeft
 

Recently uploaded

Model Call Girl in Haqiqat Nagar Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Haqiqat Nagar Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Haqiqat Nagar Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Haqiqat Nagar Delhi reach out to us at 🔝8264348440🔝soniya singh
 
如何办理新加坡南洋理工大学毕业证(本硕)NTU学位证书
如何办理新加坡南洋理工大学毕业证(本硕)NTU学位证书如何办理新加坡南洋理工大学毕业证(本硕)NTU学位证书
如何办理新加坡南洋理工大学毕业证(本硕)NTU学位证书Fir L
 
Andrea Hill Featured in Canadian Lawyer as SkyLaw Recognized as a Top Boutique
Andrea Hill Featured in Canadian Lawyer as SkyLaw Recognized as a Top BoutiqueAndrea Hill Featured in Canadian Lawyer as SkyLaw Recognized as a Top Boutique
Andrea Hill Featured in Canadian Lawyer as SkyLaw Recognized as a Top BoutiqueSkyLaw Professional Corporation
 
如何办理伦敦南岸大学毕业证(本硕)LSBU学位证书
如何办理伦敦南岸大学毕业证(本硕)LSBU学位证书如何办理伦敦南岸大学毕业证(本硕)LSBU学位证书
如何办理伦敦南岸大学毕业证(本硕)LSBU学位证书FS LS
 
Understanding Social Media Bullying: Legal Implications and Challenges
Understanding Social Media Bullying: Legal Implications and ChallengesUnderstanding Social Media Bullying: Legal Implications and Challenges
Understanding Social Media Bullying: Legal Implications and ChallengesFinlaw Associates
 
一比一原版旧金山州立大学毕业证学位证书
 一比一原版旧金山州立大学毕业证学位证书 一比一原版旧金山州立大学毕业证学位证书
一比一原版旧金山州立大学毕业证学位证书SS A
 
A Short-ppt on new gst laws in india.pptx
A Short-ppt on new gst laws in india.pptxA Short-ppt on new gst laws in india.pptx
A Short-ppt on new gst laws in india.pptxPKrishna18
 
Constitutional Values & Fundamental Principles of the ConstitutionPPT.pptx
Constitutional Values & Fundamental Principles of the ConstitutionPPT.pptxConstitutional Values & Fundamental Principles of the ConstitutionPPT.pptx
Constitutional Values & Fundamental Principles of the ConstitutionPPT.pptxsrikarna235
 
定制(WMU毕业证书)美国西密歇根大学毕业证成绩单原版一比一
定制(WMU毕业证书)美国西密歇根大学毕业证成绩单原版一比一定制(WMU毕业证书)美国西密歇根大学毕业证成绩单原版一比一
定制(WMU毕业证书)美国西密歇根大学毕业证成绩单原版一比一jr6r07mb
 
How You Can Get a Turkish Digital Nomad Visa
How You Can Get a Turkish Digital Nomad VisaHow You Can Get a Turkish Digital Nomad Visa
How You Can Get a Turkish Digital Nomad VisaBridgeWest.eu
 
如何办理(Lincoln文凭证书)林肯大学毕业证学位证书
如何办理(Lincoln文凭证书)林肯大学毕业证学位证书如何办理(Lincoln文凭证书)林肯大学毕业证学位证书
如何办理(Lincoln文凭证书)林肯大学毕业证学位证书Fs Las
 
如何办理(MSU文凭证书)密歇根州立大学毕业证学位证书
 如何办理(MSU文凭证书)密歇根州立大学毕业证学位证书 如何办理(MSU文凭证书)密歇根州立大学毕业证学位证书
如何办理(MSU文凭证书)密歇根州立大学毕业证学位证书Sir Lt
 
Test Identification Parade & Dying Declaration.pptx
Test Identification Parade & Dying Declaration.pptxTest Identification Parade & Dying Declaration.pptx
Test Identification Parade & Dying Declaration.pptxsrikarna235
 
如何办理美国波士顿大学(BU)毕业证学位证书
如何办理美国波士顿大学(BU)毕业证学位证书如何办理美国波士顿大学(BU)毕业证学位证书
如何办理美国波士顿大学(BU)毕业证学位证书Fir L
 
如何办理佛蒙特大学毕业证学位证书
 如何办理佛蒙特大学毕业证学位证书 如何办理佛蒙特大学毕业证学位证书
如何办理佛蒙特大学毕业证学位证书Fir sss
 
如何办理澳洲南澳大学(UniSA)毕业证学位证书
如何办理澳洲南澳大学(UniSA)毕业证学位证书如何办理澳洲南澳大学(UniSA)毕业证学位证书
如何办理澳洲南澳大学(UniSA)毕业证学位证书Fir L
 
FINALTRUEENFORCEMENT OF BARANGAY SETTLEMENT.ppt
FINALTRUEENFORCEMENT OF BARANGAY SETTLEMENT.pptFINALTRUEENFORCEMENT OF BARANGAY SETTLEMENT.ppt
FINALTRUEENFORCEMENT OF BARANGAY SETTLEMENT.pptjudeplata
 
Essentials of a Valid Transfer.pptxmmmmmm
Essentials of a Valid Transfer.pptxmmmmmmEssentials of a Valid Transfer.pptxmmmmmm
Essentials of a Valid Transfer.pptxmmmmmm2020000445musaib
 
Offences against property (TRESPASS, BREAKING
Offences against property (TRESPASS, BREAKINGOffences against property (TRESPASS, BREAKING
Offences against property (TRESPASS, BREAKINGPRAKHARGUPTA419620
 

Recently uploaded (20)

Model Call Girl in Haqiqat Nagar Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Haqiqat Nagar Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Haqiqat Nagar Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Haqiqat Nagar Delhi reach out to us at 🔝8264348440🔝
 
如何办理新加坡南洋理工大学毕业证(本硕)NTU学位证书
如何办理新加坡南洋理工大学毕业证(本硕)NTU学位证书如何办理新加坡南洋理工大学毕业证(本硕)NTU学位证书
如何办理新加坡南洋理工大学毕业证(本硕)NTU学位证书
 
Andrea Hill Featured in Canadian Lawyer as SkyLaw Recognized as a Top Boutique
Andrea Hill Featured in Canadian Lawyer as SkyLaw Recognized as a Top BoutiqueAndrea Hill Featured in Canadian Lawyer as SkyLaw Recognized as a Top Boutique
Andrea Hill Featured in Canadian Lawyer as SkyLaw Recognized as a Top Boutique
 
如何办理伦敦南岸大学毕业证(本硕)LSBU学位证书
如何办理伦敦南岸大学毕业证(本硕)LSBU学位证书如何办理伦敦南岸大学毕业证(本硕)LSBU学位证书
如何办理伦敦南岸大学毕业证(本硕)LSBU学位证书
 
Understanding Social Media Bullying: Legal Implications and Challenges
Understanding Social Media Bullying: Legal Implications and ChallengesUnderstanding Social Media Bullying: Legal Implications and Challenges
Understanding Social Media Bullying: Legal Implications and Challenges
 
一比一原版旧金山州立大学毕业证学位证书
 一比一原版旧金山州立大学毕业证学位证书 一比一原版旧金山州立大学毕业证学位证书
一比一原版旧金山州立大学毕业证学位证书
 
A Short-ppt on new gst laws in india.pptx
A Short-ppt on new gst laws in india.pptxA Short-ppt on new gst laws in india.pptx
A Short-ppt on new gst laws in india.pptx
 
Constitutional Values & Fundamental Principles of the ConstitutionPPT.pptx
Constitutional Values & Fundamental Principles of the ConstitutionPPT.pptxConstitutional Values & Fundamental Principles of the ConstitutionPPT.pptx
Constitutional Values & Fundamental Principles of the ConstitutionPPT.pptx
 
定制(WMU毕业证书)美国西密歇根大学毕业证成绩单原版一比一
定制(WMU毕业证书)美国西密歇根大学毕业证成绩单原版一比一定制(WMU毕业证书)美国西密歇根大学毕业证成绩单原版一比一
定制(WMU毕业证书)美国西密歇根大学毕业证成绩单原版一比一
 
How You Can Get a Turkish Digital Nomad Visa
How You Can Get a Turkish Digital Nomad VisaHow You Can Get a Turkish Digital Nomad Visa
How You Can Get a Turkish Digital Nomad Visa
 
如何办理(Lincoln文凭证书)林肯大学毕业证学位证书
如何办理(Lincoln文凭证书)林肯大学毕业证学位证书如何办理(Lincoln文凭证书)林肯大学毕业证学位证书
如何办理(Lincoln文凭证书)林肯大学毕业证学位证书
 
young Call Girls in Pusa Road🔝 9953330565 🔝 escort Service
young Call Girls in  Pusa Road🔝 9953330565 🔝 escort Serviceyoung Call Girls in  Pusa Road🔝 9953330565 🔝 escort Service
young Call Girls in Pusa Road🔝 9953330565 🔝 escort Service
 
如何办理(MSU文凭证书)密歇根州立大学毕业证学位证书
 如何办理(MSU文凭证书)密歇根州立大学毕业证学位证书 如何办理(MSU文凭证书)密歇根州立大学毕业证学位证书
如何办理(MSU文凭证书)密歇根州立大学毕业证学位证书
 
Test Identification Parade & Dying Declaration.pptx
Test Identification Parade & Dying Declaration.pptxTest Identification Parade & Dying Declaration.pptx
Test Identification Parade & Dying Declaration.pptx
 
如何办理美国波士顿大学(BU)毕业证学位证书
如何办理美国波士顿大学(BU)毕业证学位证书如何办理美国波士顿大学(BU)毕业证学位证书
如何办理美国波士顿大学(BU)毕业证学位证书
 
如何办理佛蒙特大学毕业证学位证书
 如何办理佛蒙特大学毕业证学位证书 如何办理佛蒙特大学毕业证学位证书
如何办理佛蒙特大学毕业证学位证书
 
如何办理澳洲南澳大学(UniSA)毕业证学位证书
如何办理澳洲南澳大学(UniSA)毕业证学位证书如何办理澳洲南澳大学(UniSA)毕业证学位证书
如何办理澳洲南澳大学(UniSA)毕业证学位证书
 
FINALTRUEENFORCEMENT OF BARANGAY SETTLEMENT.ppt
FINALTRUEENFORCEMENT OF BARANGAY SETTLEMENT.pptFINALTRUEENFORCEMENT OF BARANGAY SETTLEMENT.ppt
FINALTRUEENFORCEMENT OF BARANGAY SETTLEMENT.ppt
 
Essentials of a Valid Transfer.pptxmmmmmm
Essentials of a Valid Transfer.pptxmmmmmmEssentials of a Valid Transfer.pptxmmmmmm
Essentials of a Valid Transfer.pptxmmmmmm
 
Offences against property (TRESPASS, BREAKING
Offences against property (TRESPASS, BREAKINGOffences against property (TRESPASS, BREAKING
Offences against property (TRESPASS, BREAKING
 

How new ai based analytics ignite a productivity revolution in e discovery-final

  • 1. HOW NEW AI-BASED ANALYTICS IGNITE A PRODUCTIVITY REVOLUTION IN EDISCOVERY ACEDS Webinar - August 24th, 2017
  • 2. TODAY’S SPEAKERS Mary Mack Executive Director ACEDS Paul Starrett Specialist in electronic evidence and data science in the legal profession Johannes Scholtes CSO at ZyLAB Professor Text-Mining University of Maastricht
  • 4.  Tools from the field of Artificial Intelligence and Data Science accelerate truth-finding missions in regulatory requests and internal investigations.  New AI-based analytics have drastically increased the speed and improved the quality of the eDiscovery process.  But what exactly are these new AI techniques and how do they compare to all the other analytics we have been using for years? TODAY’S AGENDA
  • 5. THE BUZZ SLIDE / 5 e-Discovery & Artificial Intelligence The new reality AI becomes good business practice
  • 6. WHAT ARE WE TALKING ABOUT? “Analytics” is the discovery, interpretation, and communication of meaningful patterns in data. The terms “analytics” or “analysis” describe functions ranging from reporting and review metrics to sophisticated search and advanced data, text-mining and machine learning applications. Benefits also range across various dimensions. “Artificial Intelligence (AI) is a broad, complex field of research. AI includes tasks such as reasoning, problem solving, knowledge representation, planning, machine learning, natural language processing, perception, motion, social intelligence, and even creativity. The ultimate goal is the creation of some form of general intelligence. SLIDE / 6
  • 7. The Usual Suspects:  Exploding data volumes;  New types of data (multi-media, social, BYOD);  Exploding eDiscovery costs;  New regulations and compliance requirements  GDPR  Cyber-security requirements  More enthusiastic regulators, especially outside of the US. SLIDE / 7 WHY WE SHOULD CARE
  • 8. DEALING WITH THE EDISCOVERY DATA WAVE In eDiscovery, you never know in advance:  How much data you will have;  What type of data it will be and thus what type of processing is required;  What workflow and iterations you will have;  Automation, AI and Data Science are very CPU and computers memory intensive; So, you need intelligent and extremely load-balancing and resource allocation to prevent bottlenecks and deal effectively with the “Data Wave” in eDiscovery.
  • 9.  Better understand your data: the ability to make better strategic decisions.  Early Case Assessment: build and justify eDiscovery budget, resources and timelines.  Reduce data volumes: cut through the noise and zero in on documents of interest.  Take an investigative approach: organize and prioritize documents.  Reduce your eDiscovery cost: improve productivity and precision of your team.  Better quality: see greater consistency in coding decisions across similar documents.  Speed up litigation. SLIDE / 9 WHY ANALYTICS?
  • 10.  Humans have cognitive limitations when processing and deriving insights from large-scale document sets; humans simply cannot successfully synthesize large volumes of data.  Technology will help lawyers work more efficiently, effectively, and enjoyably.  Grossman & Cormack* : “TAR was not only more effective than human review at finding relevant documents, but also much cheaper … Overall, the myth that exhaustive manual review is the most effective—and therefore the most defensible— approach to document review is strongly refuted.” SLIDE / 10 WHY AI-BASED ANALYTICS? * TECHNOLOGY-ASSISTED REVIEW IN E-DISCOVERY CAN BE MORE EFFECTIVE AND MORE EFFICIENT THAN EXHAUSTIVE MANUAL REVIEW By Maura R. Grossman* & Gordon V. Cormack. Richmond Journal of Law and Technology. Vol. XVII, Issue 3.
  • 12.  Structural: aka syntactic analytics  File-, Document and Forensic Property extraction, Meta-data filtering, Saved (full-text) Searches, Email Thread detection, Email Thread reduction, Missing emails in thread, Duplicate- and Near Duplicate detection, Language identification, Communication Analysis, Time-line Visualizations, Geo-mapping, …  Conceptual: aka semantic or meaning based analytics  Keyword Expansion (taxonomy), Content Clustering, Content- based Categorization, Conceptual Search, Sentiment & Emotion Mining, Semantic Content Analysis, Word-Cloud, Topic Modeling, …  Machine Learning: data driven (predictive) analytics  Technology Assisted Review, Contract clause detection & classification, Privileged detection, … SLIDE / 12 WHAT KIND OF ANALYTICS HAVE WE SEEN? STRUCTURE OF DATA MEANING OF DATA LEARN FROM DATA
  • 13. WHAT IS THE RELATION BETWEEN AI AND ANALYTICS? eDiscovery needs:  Perception  Reading: OCR, handwriting detection, signature recognition,  Listening: Audio search  Vision: Image classification  Language: Machine Translation  Intelligent Search  Machine Learning for search  Concept Clustering  Data Visualization  Text classification and categorization  Document  Paragraph (clause)  Sentence or phrase AI provides the algorithms and evaluation methods:  Machine Learning  Decision trees  Support Vector Machines  Deep Learning (CNN)  Topic Modeling / Concept Search  Hierarchical Clustering  LSI  LDA  NMF  Natural Language Processing (NLP)  Shallow Parsing  Deep Parsing  Co-reference resolution SLIDE / 13
  • 14. PERCEPTION: AUDIO SEARCH ZyLAB: automatic Audio Search on all detected (embedded) audio and video files.
  • 15. ZyLAB: embedded machine translation on every (embedded) document or document section. PERCEPTION: MACHINE TRANSLATION
  • 16. SLIDE / 16 PERCEPTION: HANDWRITING & SIGNATURE DETECTION (R&D)
  • 17. SLIDE / 17 PERCEPTION: VISUAL CLASSIFICATION OF IMAGES FOR EDISCOVERY (R&D)
  • 18. PERCEPTION: OCR ON BITMAPS ZyLAB: people often screenshot or take pictures from such information, just in case or to remember…. ZyLAB will pick up such images, OCR and find them…
  • 19. STRUCTURAL: UNPACK EMBEDDED CONTENT ZyLAB: • Every embedded item is extracted and OCR-ed if needed. • Search & Find • Show in document family
  • 21. STRUCTURAL: MISSING EMAIL IN THREAD ZyLAB:  Identify gaps in collected emails  Compare gaps among suspects  Restore email from backup’s
  • 23. FIND EVEN WHEN YOU DO NOT KNOW WHAT TO LOOK FOR
  • 24. Question Entities or patterns to address this question Who is it about? PERSON, COMPANY, ORGANIZATION. EMAIL ADDRESS What is it about? Result of Topic Modeling and Concept Clustering When did it happen? DATE, TIME, MONTH, DAY WEEK, YEAR Where did it happen? ADDRESS, CITY, COUNTRY, CONTINENT, DEPARTMENT and other geo-locations Why did it happen? Sentiments, emotions and cursing How did it happen? Combining entities and facts How much/often did it happen? Quantitative measures such as amounts, currencies, and other numbers. Also frequency and averages on entity occurrences. SLIDE / 24
  • 25. MORE DETAILED INSIGHTS SLIDE / 25 More interesting is to combine the W’s. For instance, why not look for Who is Where, or What happened When. Who – Who Who – Why When – What
  • 26. The era of traditional keyword and Boolean search seems to be over. Even the most brilliant query results in too many hits. Reviewing these takes too much time and resources.  People do not know exactly what to look for, what keywords to use or how to spell them.  The quality of traditional search is much lower than the searchers think (80% perceived versus 20-40% actual quality).  Only highly skilled searchers who manage all (advanced) query options are able to get close to 80%. Even then, they cannot be sure that they did in fact found 80% of all relevant documents. This is another problem measuring recall: you never know what you miss. MACHINE LEARNING: THE NEW SEARCH
  • 27.
  • 28.  Document Classification (TAR)  Find responsive documents  Boost recall  Measure recall  Paragraph Classification  Privileged review  Document clause classification  Contract clause classification  GDPR – Privacy detection – Redaction – Pseudomization SLIDE / 28 DIFFERENT USE CASES OF MACHINE LEARNING
  • 29.  Have we found all relevant information? How complete is the data we sent to the regulator? Machine learning!  During this process, several quantitative measures can be calculated such as precision, recall, F-values and precision of the return set. Based on these measurements, one can describe exactly how much of the relevant information has been found at which moment in the process. HOW CAN WE MEASURE RECALL
  • 30.
  • 31. 0 200 400 600 800 1000 1200 1400 1600 ZyLAB Assisted Review Manual Review Hours MACHINE LEARNING  15-20 faster than manual review  10-20% more accurate, fully defensible
  • 32.  Privileged information: automatically identify communications with our lawyers.  PII, PHI, and GDPR: redaction and pseudonymization
  • 33. CLAUSE DETECTION Detailed reporting on content of contracts, Reporting on extraction of key information, Higher precision search
  • 34.  ZyLAB’s Direct Collecting makes tremendous time savings to get data ready for early case assessment and (first) pass review. Direct Collection drastically reduces the cost and risks of downloading / uploading data or the shipping around of tapes and hard disks.  ZyLAB’s Deep Processing allows you to automatically reduce your data volumes before you send them on for review, without getting in trouble or being accused of data spoliation. If every component of data is searchable, only then can one use automated tools to reduce data.  Using ZyLAB’s Review Accelerators you can minimize the most expensive and time consuming part of the eDiscovery process. TAR, batch tagging, sampling, redaction, email trails, …  Litigants use ZyLAB’s Early Case Assessment to quickly understand the facts and merits of a case, identify key custodians and recognize critical information so they can develop an effective and realistic litigation strategy. SLIDE / 34 BENEFITS TO IN-HOUSE COUNSEL
  • 35. BENEFITS TO LAW FIRMS  ZyLAB covers multiple eDiscovery use cases. One platform: More cases, more volume, better pricing.  No need to involve any 3rd parties.  Bill the hours for project management and data science (machine learning) as well.  DIY: upload data and almost immediately start reviewing with your team and bill the hours.  Find out what really happened with ZyLAB’s deep search and analytics. Expand review team.  Replace the bottom of the traditional earnings pyramid with “review robots”: make more margin.  Be more competitive.  Do more work with your current team: never have to pass on new opportunities because of capacity problems.  less risk of errors and missing out on key issues. So, less risk for liability claims and higher insurance premiums.
  • 36.
  • 37. “ZYLAB TAKES CARE OF THE PROCESS, SUPPORTS THE LAWYER BY THINKING COMMERCIALLY AND PROVIDES COMFORT WITH THE USE OF ADVANCED TECHNOLOGY” Ruben Elkerbout, anti-trust lawyer and partner with Stek Lawyers
  • 38. MORE READING – WWW.ZYLAB.COM/RESOURCES/EBOOKS/
  • 39. Q&A MORE INFORMATION: WWW.ZYLAB.COM 39 More ZyLAB Webinars and events: https://zylab.com/company/event-calendar/