SlideShare a Scribd company logo
DemystifyingPredictive
CodingTechnology
Date: Wednesday,August13,2014
Time: 1p.m.ET/NoonCT/11a.m.
MT/10a.m.PT
Anita Engles, VP Products and Marketing
Daegis
Doug Stewart, VP Sales Support
Daegis
TAR Defined
A process for prioritizing or coding a collection of
electronic documents using a computerized
system that harnesses human judgments of one
or more Subject Matter Expert(s) on a smaller set
of documents and then extrapolates those
judgments to the remaining Document Population.
* Grossman & Cormack 2012
The TAR Frontlines
• Evaluation of Machine-Learning
Protocols for Technology-Assisted
Review in Electronic Discovery
(2014)
• Maura R. Grossman and Gordon V. Cormack
• http://cormack.uwaterloo.ca/cormack/calstudy/
Key Findings
• Non-Random Selection Methods
Work Best for Seed Set
• Active Learning Better than Passive
Learning
• Senior Level Subject Matter Experts
are NOT Required to Train System
TAR Steps
Process Overview
ProducingTraining
Assessing
Results
Creating the
Seed Set
Keyword
Searching
Relatedness
Scoring
Identifying
the
Population
Relatedness Scoring
Building the Map
• Build the MapStep
• Measure
Relationships
Purpose
• AlgorithmsVariations
• Core to Predictive
Functionality
Why It
Matters
Keyword Searching
Tried and True
• Validated & Iterative
Keyword SearchingStep
• Inexpensive TrainingPurpose
• Not used in All
ApproachesVariations
• Drives Efficiency
Why It
Matters
motorcycle or bike AND ((throttle or accel*) w/10 stick)
Seed Set
Building the Seed Set
• Review Strategically
Sampled DocsStep
• Generates High-level
Relevancy “Heat Map”Purpose
• Random, Strategic,
Judgmental SamplesVariations
• Drives Efficiency
Why It
Matters
Predicting Responsiveness
The Prediction Engine
Prediction
Engine
Relatedness
Map
Seed Set /
Search
Training
Definitely
Predictive Calls
Responsive?
Definitely Not
The three categories of information we know are fed into the system’s algorithm, which
evaluates the data to score the likelihood of each document’s being responsive.
Assessing the Results
Building the Answer Key
•Assess Accuracy Based on
Industry Standard MetricsStep
•Informs Decision to Stop TARPurpose
•Simple and Stratified Sampling
•Sample Once or Multiple TimesVariations
•Defensibility
Why It
Matters
Definitely
Predictive Calls
Responsive?
Definitely Not
Training / Learning
Continual Refinement
Definitely
Predictive Calls
Responsive?
Definitely Not
Refining keyword searches and manually reviewing documents with highest
levels of uncertainty moves docs from the middle toward the endpoints.
• Reviewers Train and
System LearnsStep
• Transfer Subject Matter
Expertise to TAR SystemPurpose
• Active Learning
• Passive LearningVariations
• Dramatic Cost Savings
Why It
Matters
Post-TAR
Producing the Responsive Documents
• Terminate TAR Review
• Decision based on Accuracy and Cost Metrics
• “Stabilization”
• Harvest Predicted Calls
• Review Responsive Docs
• Sample Non-Responsive Docs
• Document Entire Process
Accuracy Metrics
How Accuracy is Measured TAR improves
the F1 score by
moving
documents
from false
(incorrect) bins
to the true bins
where they
belong.
Selected TAR Bibliography
TAR Resources
1. Search, Forward: Will Manual Document Review and Keyword Searches
be Replaced by Computer-assisted Coding? (2011)
• Judge Andrew Peck
• http://www.law.com/jsp/lawtechnologynews/PubArticleLTN.jsp?id=120251653
0534
2. Technology-Assisted Review in E-Discovery can be More Effective and
More Efficient than Exhaustive Manual Review (2011)
• Maura R. Grossman and Gordon V. Cormack
• http://jolt.richmond.edu/v17i3/article11.pdf
3. Where the Money Goes: Understanding Litigant Expenditures for
Producing Electronic Discovery (2012)
• RAND Institute for Civil Justice: Nicholas M. Pace, Laura Zakaras
• http://www.rand.org/pubs/monographs/MG1208.html#abstract
Thank You!
Q&A
15

More Related Content

What's hot

General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative Methodology
Statistics Solutions
 
Empirical research methods for software engineering
Empirical research methods for software engineeringEmpirical research methods for software engineering
Empirical research methods for software engineering
sarfraznawaz
 
Chapter7
Chapter7Chapter7
Chapter7
Cheryl Lawson
 
Confidently Conduct and Write Your Power Analysis
Confidently Conduct and Write Your Power AnalysisConfidently Conduct and Write Your Power Analysis
Confidently Conduct and Write Your Power Analysis
Statistics Solutions
 
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
alessio_ferrari
 
Cec2010 araujo pereziglesias
Cec2010 araujo pereziglesiasCec2010 araujo pereziglesias
Cec2010 araujo pereziglesias
Lourdes Araujo
 
SPSS Data Cleaning and Management
SPSS Data Cleaning and ManagementSPSS Data Cleaning and Management
SPSS Data Cleaning and Management
Statistics Solutions
 
RFID use in Libraries: ROI
RFID use in Libraries: ROIRFID use in Libraries: ROI
RFID use in Libraries: ROI
jeffnarver
 
RESEARCH in software engineering
RESEARCH in software engineeringRESEARCH in software engineering
RESEARCH in software engineering
Ivano Malavolta
 
Chapter 3 Methodology (Capstone Research)
Chapter 3   Methodology (Capstone Research)Chapter 3   Methodology (Capstone Research)
Chapter 3 Methodology (Capstone Research)
school
 
UX research
UX researchUX research
UX research
Billy Choi
 
Introduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey ResearchIntroduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey Research
Caroline Jarrett
 
Fact-Finding + the Lean Six Sigma World
Fact-Finding + the Lean Six Sigma WorldFact-Finding + the Lean Six Sigma World
Fact-Finding + the Lean Six Sigma WorldSteven Norton
 
Computer adaptive testing
Computer adaptive testingComputer adaptive testing
Computer adaptive testing
lilly Yz
 
How to Conduct and Interpret Tests of Differences
How to Conduct and Interpret Tests of DifferencesHow to Conduct and Interpret Tests of Differences
How to Conduct and Interpret Tests of Differences
Statistics Solutions
 
MSR End of Internship Talk
MSR End of Internship TalkMSR End of Internship Talk
MSR End of Internship TalkRay Buse
 
Systematic Literature Reviews and Systematic Mapping Studies
Systematic Literature Reviews and Systematic Mapping StudiesSystematic Literature Reviews and Systematic Mapping Studies
Systematic Literature Reviews and Systematic Mapping Studies
alessio_ferrari
 
Case Study Research in Software Engineering
Case Study Research in Software EngineeringCase Study Research in Software Engineering
Case Study Research in Software Engineering
alessio_ferrari
 

What's hot (20)

General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative Methodology
 
Data Analysis
Data AnalysisData Analysis
Data Analysis
 
Ch09
Ch09Ch09
Ch09
 
Empirical research methods for software engineering
Empirical research methods for software engineeringEmpirical research methods for software engineering
Empirical research methods for software engineering
 
Chapter7
Chapter7Chapter7
Chapter7
 
Confidently Conduct and Write Your Power Analysis
Confidently Conduct and Write Your Power AnalysisConfidently Conduct and Write Your Power Analysis
Confidently Conduct and Write Your Power Analysis
 
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
Qualitative Studies in Software Engineering - Interviews, Observation, Ground...
 
Cec2010 araujo pereziglesias
Cec2010 araujo pereziglesiasCec2010 araujo pereziglesias
Cec2010 araujo pereziglesias
 
SPSS Data Cleaning and Management
SPSS Data Cleaning and ManagementSPSS Data Cleaning and Management
SPSS Data Cleaning and Management
 
RFID use in Libraries: ROI
RFID use in Libraries: ROIRFID use in Libraries: ROI
RFID use in Libraries: ROI
 
RESEARCH in software engineering
RESEARCH in software engineeringRESEARCH in software engineering
RESEARCH in software engineering
 
Chapter 3 Methodology (Capstone Research)
Chapter 3   Methodology (Capstone Research)Chapter 3   Methodology (Capstone Research)
Chapter 3 Methodology (Capstone Research)
 
UX research
UX researchUX research
UX research
 
Introduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey ResearchIntroduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey Research
 
Fact-Finding + the Lean Six Sigma World
Fact-Finding + the Lean Six Sigma WorldFact-Finding + the Lean Six Sigma World
Fact-Finding + the Lean Six Sigma World
 
Computer adaptive testing
Computer adaptive testingComputer adaptive testing
Computer adaptive testing
 
How to Conduct and Interpret Tests of Differences
How to Conduct and Interpret Tests of DifferencesHow to Conduct and Interpret Tests of Differences
How to Conduct and Interpret Tests of Differences
 
MSR End of Internship Talk
MSR End of Internship TalkMSR End of Internship Talk
MSR End of Internship Talk
 
Systematic Literature Reviews and Systematic Mapping Studies
Systematic Literature Reviews and Systematic Mapping StudiesSystematic Literature Reviews and Systematic Mapping Studies
Systematic Literature Reviews and Systematic Mapping Studies
 
Case Study Research in Software Engineering
Case Study Research in Software EngineeringCase Study Research in Software Engineering
Case Study Research in Software Engineering
 

Similar to Demystifying Predictive Coding Technology

Measuring Data Quality with DataOps
Measuring Data Quality with DataOpsMeasuring Data Quality with DataOps
Measuring Data Quality with DataOps
Steven Ensslen
 
Can we induce change with what we measure?
Can we induce change with what we measure?Can we induce change with what we measure?
Can we induce change with what we measure?
Michaela Greiler
 
Building a Next Generation Clinical and Scientific Data Management Solution
Building a Next Generation Clinical and Scientific Data Management SolutionBuilding a Next Generation Clinical and Scientific Data Management Solution
Building a Next Generation Clinical and Scientific Data Management Solution
Saama
 
Dare to Explore: Discover ET!
Dare to Explore: Discover ET!Dare to Explore: Discover ET!
Dare to Explore: Discover ET!
Raj Indugula
 
Machinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfMachinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdf
SaketBansal9
 
Data and data collection procedures
Data and data collection proceduresData and data collection procedures
Data and data collection procedures
Diana Ashandy Pool Antonio
 
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comEnhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Simon Hughes
 
staffing chapter no 8 external selection part 1, by heneman
staffing chapter no 8 external selection part 1, by henemanstaffing chapter no 8 external selection part 1, by heneman
staffing chapter no 8 external selection part 1, by heneman
fareeha zanib
 
Bab 6 Tool Support For Testing
Bab 6 Tool Support For TestingBab 6 Tool Support For Testing
Bab 6 Tool Support For Testing
lolayoriva
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
Roger Barga
 
Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"
National Information Standards Organization (NISO)
 
Quantitative Methods- Dr Ryan Thomas Williams
Quantitative Methods- Dr Ryan Thomas WilliamsQuantitative Methods- Dr Ryan Thomas Williams
Quantitative Methods- Dr Ryan Thomas Williams
Ryan Williams
 
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...
Jin Young Kim
 
Các phương pháp nghiên cứu thị trường - Market research methods
Các phương pháp nghiên cứu thị trường - Market research methodsCác phương pháp nghiên cứu thị trường - Market research methods
Các phương pháp nghiên cứu thị trường - Market research methods
InfoQ - GMO Research
 
Taxonomy Validation
Taxonomy ValidationTaxonomy Validation
Taxonomy Validation
Dave Cooksey
 
2013 7 24 TAR Webinar 5 Tips & Myths Sigler
2013 7 24 TAR Webinar 5 Tips & Myths Sigler2013 7 24 TAR Webinar 5 Tips & Myths Sigler
2013 7 24 TAR Webinar 5 Tips & Myths Sigler
Sonya Sigler
 
Using Machine Learning to Optimize DevOps Practices
Using Machine Learning to Optimize DevOps PracticesUsing Machine Learning to Optimize DevOps Practices
Using Machine Learning to Optimize DevOps Practices
Peter Varhol
 
Fundamental of Quality Data - Anthony Ndungu
Fundamental of Quality Data - Anthony NdunguFundamental of Quality Data - Anthony Ndungu
Fundamental of Quality Data - Anthony Ndungu
World Agroforestry (ICRAF)
 
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
InsightInnovation
 
Optimising Clinical Trials Monitoring Data review - Neill Barron
Optimising Clinical Trials Monitoring Data review - Neill BarronOptimising Clinical Trials Monitoring Data review - Neill Barron
Optimising Clinical Trials Monitoring Data review - Neill Barron
Neill Barron
 

Similar to Demystifying Predictive Coding Technology (20)

Measuring Data Quality with DataOps
Measuring Data Quality with DataOpsMeasuring Data Quality with DataOps
Measuring Data Quality with DataOps
 
Can we induce change with what we measure?
Can we induce change with what we measure?Can we induce change with what we measure?
Can we induce change with what we measure?
 
Building a Next Generation Clinical and Scientific Data Management Solution
Building a Next Generation Clinical and Scientific Data Management SolutionBuilding a Next Generation Clinical and Scientific Data Management Solution
Building a Next Generation Clinical and Scientific Data Management Solution
 
Dare to Explore: Discover ET!
Dare to Explore: Discover ET!Dare to Explore: Discover ET!
Dare to Explore: Discover ET!
 
Machinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfMachinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdf
 
Data and data collection procedures
Data and data collection proceduresData and data collection procedures
Data and data collection procedures
 
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comEnhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
 
staffing chapter no 8 external selection part 1, by heneman
staffing chapter no 8 external selection part 1, by henemanstaffing chapter no 8 external selection part 1, by heneman
staffing chapter no 8 external selection part 1, by heneman
 
Bab 6 Tool Support For Testing
Bab 6 Tool Support For TestingBab 6 Tool Support For Testing
Bab 6 Tool Support For Testing
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"
 
Quantitative Methods- Dr Ryan Thomas Williams
Quantitative Methods- Dr Ryan Thomas WilliamsQuantitative Methods- Dr Ryan Thomas Williams
Quantitative Methods- Dr Ryan Thomas Williams
 
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...
 
Các phương pháp nghiên cứu thị trường - Market research methods
Các phương pháp nghiên cứu thị trường - Market research methodsCác phương pháp nghiên cứu thị trường - Market research methods
Các phương pháp nghiên cứu thị trường - Market research methods
 
Taxonomy Validation
Taxonomy ValidationTaxonomy Validation
Taxonomy Validation
 
2013 7 24 TAR Webinar 5 Tips & Myths Sigler
2013 7 24 TAR Webinar 5 Tips & Myths Sigler2013 7 24 TAR Webinar 5 Tips & Myths Sigler
2013 7 24 TAR Webinar 5 Tips & Myths Sigler
 
Using Machine Learning to Optimize DevOps Practices
Using Machine Learning to Optimize DevOps PracticesUsing Machine Learning to Optimize DevOps Practices
Using Machine Learning to Optimize DevOps Practices
 
Fundamental of Quality Data - Anthony Ndungu
Fundamental of Quality Data - Anthony NdunguFundamental of Quality Data - Anthony Ndungu
Fundamental of Quality Data - Anthony Ndungu
 
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
 
Optimising Clinical Trials Monitoring Data review - Neill Barron
Optimising Clinical Trials Monitoring Data review - Neill BarronOptimising Clinical Trials Monitoring Data review - Neill Barron
Optimising Clinical Trials Monitoring Data review - Neill Barron
 

More from Daegis

Finding the Right Information Governance Solution for IT
Finding the Right Information Governance Solution for ITFinding the Right Information Governance Solution for IT
Finding the Right Information Governance Solution for IT
Daegis
 
5 Information Governance Budgeting Pitfalls to Avoid
5 Information Governance Budgeting Pitfalls to Avoid5 Information Governance Budgeting Pitfalls to Avoid
5 Information Governance Budgeting Pitfalls to AvoidDaegis
 
Office 365 Emails & Archiving
Office 365 Emails & ArchivingOffice 365 Emails & Archiving
Office 365 Emails & Archiving
Daegis
 
The Benefits of Hosted Archive
The Benefits of Hosted ArchiveThe Benefits of Hosted Archive
The Benefits of Hosted Archive
Daegis
 
Judicial Acceptance of Technology Assisted Review (TAR)
Judicial Acceptance of Technology Assisted Review (TAR)Judicial Acceptance of Technology Assisted Review (TAR)
Judicial Acceptance of Technology Assisted Review (TAR)
Daegis
 
Technology is the Best Defense
Technology is the Best DefenseTechnology is the Best Defense
Technology is the Best Defense
Daegis
 
Learning from Big Data – Simplify Your Workflow Using Technology Assisted Review
Learning from Big Data – Simplify Your Workflow Using Technology Assisted ReviewLearning from Big Data – Simplify Your Workflow Using Technology Assisted Review
Learning from Big Data – Simplify Your Workflow Using Technology Assisted Review
Daegis
 
Technology Assisted Review (TAR): Opening, Exploring and Bringing Transparen...
Technology Assisted Review (TAR):  Opening, Exploring and Bringing Transparen...Technology Assisted Review (TAR):  Opening, Exploring and Bringing Transparen...
Technology Assisted Review (TAR): Opening, Exploring and Bringing Transparen...
Daegis
 
Effective Internal Investigations
Effective Internal InvestigationsEffective Internal Investigations
Effective Internal Investigations
Daegis
 
Information Security in the eDiscovery Process
Information Security in the eDiscovery ProcessInformation Security in the eDiscovery Process
Information Security in the eDiscovery Process
Daegis
 
Native eDiscovery for Lotus Notes
Native eDiscovery for Lotus NotesNative eDiscovery for Lotus Notes
Native eDiscovery for Lotus Notes
Daegis
 

More from Daegis (11)

Finding the Right Information Governance Solution for IT
Finding the Right Information Governance Solution for ITFinding the Right Information Governance Solution for IT
Finding the Right Information Governance Solution for IT
 
5 Information Governance Budgeting Pitfalls to Avoid
5 Information Governance Budgeting Pitfalls to Avoid5 Information Governance Budgeting Pitfalls to Avoid
5 Information Governance Budgeting Pitfalls to Avoid
 
Office 365 Emails & Archiving
Office 365 Emails & ArchivingOffice 365 Emails & Archiving
Office 365 Emails & Archiving
 
The Benefits of Hosted Archive
The Benefits of Hosted ArchiveThe Benefits of Hosted Archive
The Benefits of Hosted Archive
 
Judicial Acceptance of Technology Assisted Review (TAR)
Judicial Acceptance of Technology Assisted Review (TAR)Judicial Acceptance of Technology Assisted Review (TAR)
Judicial Acceptance of Technology Assisted Review (TAR)
 
Technology is the Best Defense
Technology is the Best DefenseTechnology is the Best Defense
Technology is the Best Defense
 
Learning from Big Data – Simplify Your Workflow Using Technology Assisted Review
Learning from Big Data – Simplify Your Workflow Using Technology Assisted ReviewLearning from Big Data – Simplify Your Workflow Using Technology Assisted Review
Learning from Big Data – Simplify Your Workflow Using Technology Assisted Review
 
Technology Assisted Review (TAR): Opening, Exploring and Bringing Transparen...
Technology Assisted Review (TAR):  Opening, Exploring and Bringing Transparen...Technology Assisted Review (TAR):  Opening, Exploring and Bringing Transparen...
Technology Assisted Review (TAR): Opening, Exploring and Bringing Transparen...
 
Effective Internal Investigations
Effective Internal InvestigationsEffective Internal Investigations
Effective Internal Investigations
 
Information Security in the eDiscovery Process
Information Security in the eDiscovery ProcessInformation Security in the eDiscovery Process
Information Security in the eDiscovery Process
 
Native eDiscovery for Lotus Notes
Native eDiscovery for Lotus NotesNative eDiscovery for Lotus Notes
Native eDiscovery for Lotus Notes
 

Recently uploaded

The Main Procedures for Obtaining Cypriot Citizenship
The Main Procedures for Obtaining Cypriot CitizenshipThe Main Procedures for Obtaining Cypriot Citizenship
The Main Procedures for Obtaining Cypriot Citizenship
BridgeWest.eu
 
NATURE, ORIGIN AND DEVELOPMENT OF INTERNATIONAL LAW.pptx
NATURE, ORIGIN AND DEVELOPMENT OF INTERNATIONAL LAW.pptxNATURE, ORIGIN AND DEVELOPMENT OF INTERNATIONAL LAW.pptx
NATURE, ORIGIN AND DEVELOPMENT OF INTERNATIONAL LAW.pptx
anvithaav
 
Daftar Rumpun, Pohon, dan Cabang Ilmu (28 Mei 2024).pdf
Daftar Rumpun, Pohon, dan Cabang Ilmu (28 Mei 2024).pdfDaftar Rumpun, Pohon, dan Cabang Ilmu (28 Mei 2024).pdf
Daftar Rumpun, Pohon, dan Cabang Ilmu (28 Mei 2024).pdf
akbarrasyid3
 
WINDING UP of COMPANY, Modes of Dissolution
WINDING UP of COMPANY, Modes of DissolutionWINDING UP of COMPANY, Modes of Dissolution
WINDING UP of COMPANY, Modes of Dissolution
KHURRAMWALI
 
定制(nus毕业证书)新加坡国立大学毕业证学位证书实拍图原版一模一样
定制(nus毕业证书)新加坡国立大学毕业证学位证书实拍图原版一模一样定制(nus毕业证书)新加坡国立大学毕业证学位证书实拍图原版一模一样
定制(nus毕业证书)新加坡国立大学毕业证学位证书实拍图原版一模一样
9ib5wiwt
 
Military Commissions details LtCol Thomas Jasper as Detailed Defense Counsel
Military Commissions details LtCol Thomas Jasper as Detailed Defense CounselMilitary Commissions details LtCol Thomas Jasper as Detailed Defense Counsel
Military Commissions details LtCol Thomas Jasper as Detailed Defense Counsel
Thomas (Tom) Jasper
 
怎么购买(massey毕业证书)新西兰梅西大学毕业证学位证书注册证明信原版一模一样
怎么购买(massey毕业证书)新西兰梅西大学毕业证学位证书注册证明信原版一模一样怎么购买(massey毕业证书)新西兰梅西大学毕业证学位证书注册证明信原版一模一样
怎么购买(massey毕业证书)新西兰梅西大学毕业证学位证书注册证明信原版一模一样
9ib5wiwt
 
new victimology of indonesian law. Pptx.
new victimology of indonesian law. Pptx.new victimology of indonesian law. Pptx.
new victimology of indonesian law. Pptx.
niputusriwidiasih
 
1比1制作(swansea毕业证书)英国斯旺西大学毕业证学位证书托业成绩单原版一模一样
1比1制作(swansea毕业证书)英国斯旺西大学毕业证学位证书托业成绩单原版一模一样1比1制作(swansea毕业证书)英国斯旺西大学毕业证学位证书托业成绩单原版一模一样
1比1制作(swansea毕业证书)英国斯旺西大学毕业证学位证书托业成绩单原版一模一样
9ib5wiwt
 
办理(waikato毕业证书)新西兰怀卡托大学毕业证双学位证书原版一模一样
办理(waikato毕业证书)新西兰怀卡托大学毕业证双学位证书原版一模一样办理(waikato毕业证书)新西兰怀卡托大学毕业证双学位证书原版一模一样
办理(waikato毕业证书)新西兰怀卡托大学毕业证双学位证书原版一模一样
9ib5wiwt
 
Secure Your Brand: File a Trademark Today
Secure Your Brand: File a Trademark TodaySecure Your Brand: File a Trademark Today
Secure Your Brand: File a Trademark Today
Trademark Quick
 
Donald_J_Trump_katigoritirio_stormi_daniels.pdf
Donald_J_Trump_katigoritirio_stormi_daniels.pdfDonald_J_Trump_katigoritirio_stormi_daniels.pdf
Donald_J_Trump_katigoritirio_stormi_daniels.pdf
ssuser5750e1
 
Rokita Releases Soccer Stadium Legal Opinion
Rokita Releases Soccer Stadium Legal OpinionRokita Releases Soccer Stadium Legal Opinion
Rokita Releases Soccer Stadium Legal Opinion
Abdul-Hakim Shabazz
 
EMPLOYMENT LAW AN OVERVIEW in Malawi.pptx
EMPLOYMENT LAW  AN OVERVIEW in Malawi.pptxEMPLOYMENT LAW  AN OVERVIEW in Malawi.pptx
EMPLOYMENT LAW AN OVERVIEW in Malawi.pptx
MwaiMapemba
 
Responsibilities of the office bearers while registering multi-state cooperat...
Responsibilities of the office bearers while registering multi-state cooperat...Responsibilities of the office bearers while registering multi-state cooperat...
Responsibilities of the office bearers while registering multi-state cooperat...
Finlaw Consultancy Pvt Ltd
 
ALL EYES ON RAFAH BUT WHY Explain more.pdf
ALL EYES ON RAFAH BUT WHY Explain more.pdfALL EYES ON RAFAH BUT WHY Explain more.pdf
ALL EYES ON RAFAH BUT WHY Explain more.pdf
46adnanshahzad
 
The Reserve Bank of India Act, 1934.pptx
The Reserve Bank of India Act, 1934.pptxThe Reserve Bank of India Act, 1934.pptx
The Reserve Bank of India Act, 1934.pptx
nehatalele22st
 
Car Accident Injury Do I Have a Case....
Car Accident Injury Do I Have a Case....Car Accident Injury Do I Have a Case....
Car Accident Injury Do I Have a Case....
Knowyourright
 
一比一原版麻省理工学院毕业证(MIT毕业证)成绩单如何办理
一比一原版麻省理工学院毕业证(MIT毕业证)成绩单如何办理一比一原版麻省理工学院毕业证(MIT毕业证)成绩单如何办理
一比一原版麻省理工学院毕业证(MIT毕业证)成绩单如何办理
o6ov5dqmf
 
Notes-on-Prescription-Obligations-and-Contracts.doc
Notes-on-Prescription-Obligations-and-Contracts.docNotes-on-Prescription-Obligations-and-Contracts.doc
Notes-on-Prescription-Obligations-and-Contracts.doc
BRELGOSIMAT
 

Recently uploaded (20)

The Main Procedures for Obtaining Cypriot Citizenship
The Main Procedures for Obtaining Cypriot CitizenshipThe Main Procedures for Obtaining Cypriot Citizenship
The Main Procedures for Obtaining Cypriot Citizenship
 
NATURE, ORIGIN AND DEVELOPMENT OF INTERNATIONAL LAW.pptx
NATURE, ORIGIN AND DEVELOPMENT OF INTERNATIONAL LAW.pptxNATURE, ORIGIN AND DEVELOPMENT OF INTERNATIONAL LAW.pptx
NATURE, ORIGIN AND DEVELOPMENT OF INTERNATIONAL LAW.pptx
 
Daftar Rumpun, Pohon, dan Cabang Ilmu (28 Mei 2024).pdf
Daftar Rumpun, Pohon, dan Cabang Ilmu (28 Mei 2024).pdfDaftar Rumpun, Pohon, dan Cabang Ilmu (28 Mei 2024).pdf
Daftar Rumpun, Pohon, dan Cabang Ilmu (28 Mei 2024).pdf
 
WINDING UP of COMPANY, Modes of Dissolution
WINDING UP of COMPANY, Modes of DissolutionWINDING UP of COMPANY, Modes of Dissolution
WINDING UP of COMPANY, Modes of Dissolution
 
定制(nus毕业证书)新加坡国立大学毕业证学位证书实拍图原版一模一样
定制(nus毕业证书)新加坡国立大学毕业证学位证书实拍图原版一模一样定制(nus毕业证书)新加坡国立大学毕业证学位证书实拍图原版一模一样
定制(nus毕业证书)新加坡国立大学毕业证学位证书实拍图原版一模一样
 
Military Commissions details LtCol Thomas Jasper as Detailed Defense Counsel
Military Commissions details LtCol Thomas Jasper as Detailed Defense CounselMilitary Commissions details LtCol Thomas Jasper as Detailed Defense Counsel
Military Commissions details LtCol Thomas Jasper as Detailed Defense Counsel
 
怎么购买(massey毕业证书)新西兰梅西大学毕业证学位证书注册证明信原版一模一样
怎么购买(massey毕业证书)新西兰梅西大学毕业证学位证书注册证明信原版一模一样怎么购买(massey毕业证书)新西兰梅西大学毕业证学位证书注册证明信原版一模一样
怎么购买(massey毕业证书)新西兰梅西大学毕业证学位证书注册证明信原版一模一样
 
new victimology of indonesian law. Pptx.
new victimology of indonesian law. Pptx.new victimology of indonesian law. Pptx.
new victimology of indonesian law. Pptx.
 
1比1制作(swansea毕业证书)英国斯旺西大学毕业证学位证书托业成绩单原版一模一样
1比1制作(swansea毕业证书)英国斯旺西大学毕业证学位证书托业成绩单原版一模一样1比1制作(swansea毕业证书)英国斯旺西大学毕业证学位证书托业成绩单原版一模一样
1比1制作(swansea毕业证书)英国斯旺西大学毕业证学位证书托业成绩单原版一模一样
 
办理(waikato毕业证书)新西兰怀卡托大学毕业证双学位证书原版一模一样
办理(waikato毕业证书)新西兰怀卡托大学毕业证双学位证书原版一模一样办理(waikato毕业证书)新西兰怀卡托大学毕业证双学位证书原版一模一样
办理(waikato毕业证书)新西兰怀卡托大学毕业证双学位证书原版一模一样
 
Secure Your Brand: File a Trademark Today
Secure Your Brand: File a Trademark TodaySecure Your Brand: File a Trademark Today
Secure Your Brand: File a Trademark Today
 
Donald_J_Trump_katigoritirio_stormi_daniels.pdf
Donald_J_Trump_katigoritirio_stormi_daniels.pdfDonald_J_Trump_katigoritirio_stormi_daniels.pdf
Donald_J_Trump_katigoritirio_stormi_daniels.pdf
 
Rokita Releases Soccer Stadium Legal Opinion
Rokita Releases Soccer Stadium Legal OpinionRokita Releases Soccer Stadium Legal Opinion
Rokita Releases Soccer Stadium Legal Opinion
 
EMPLOYMENT LAW AN OVERVIEW in Malawi.pptx
EMPLOYMENT LAW  AN OVERVIEW in Malawi.pptxEMPLOYMENT LAW  AN OVERVIEW in Malawi.pptx
EMPLOYMENT LAW AN OVERVIEW in Malawi.pptx
 
Responsibilities of the office bearers while registering multi-state cooperat...
Responsibilities of the office bearers while registering multi-state cooperat...Responsibilities of the office bearers while registering multi-state cooperat...
Responsibilities of the office bearers while registering multi-state cooperat...
 
ALL EYES ON RAFAH BUT WHY Explain more.pdf
ALL EYES ON RAFAH BUT WHY Explain more.pdfALL EYES ON RAFAH BUT WHY Explain more.pdf
ALL EYES ON RAFAH BUT WHY Explain more.pdf
 
The Reserve Bank of India Act, 1934.pptx
The Reserve Bank of India Act, 1934.pptxThe Reserve Bank of India Act, 1934.pptx
The Reserve Bank of India Act, 1934.pptx
 
Car Accident Injury Do I Have a Case....
Car Accident Injury Do I Have a Case....Car Accident Injury Do I Have a Case....
Car Accident Injury Do I Have a Case....
 
一比一原版麻省理工学院毕业证(MIT毕业证)成绩单如何办理
一比一原版麻省理工学院毕业证(MIT毕业证)成绩单如何办理一比一原版麻省理工学院毕业证(MIT毕业证)成绩单如何办理
一比一原版麻省理工学院毕业证(MIT毕业证)成绩单如何办理
 
Notes-on-Prescription-Obligations-and-Contracts.doc
Notes-on-Prescription-Obligations-and-Contracts.docNotes-on-Prescription-Obligations-and-Contracts.doc
Notes-on-Prescription-Obligations-and-Contracts.doc
 

Demystifying Predictive Coding Technology

  • 1. DemystifyingPredictive CodingTechnology Date: Wednesday,August13,2014 Time: 1p.m.ET/NoonCT/11a.m. MT/10a.m.PT Anita Engles, VP Products and Marketing Daegis Doug Stewart, VP Sales Support Daegis
  • 2. TAR Defined A process for prioritizing or coding a collection of electronic documents using a computerized system that harnesses human judgments of one or more Subject Matter Expert(s) on a smaller set of documents and then extrapolates those judgments to the remaining Document Population. * Grossman & Cormack 2012
  • 3. The TAR Frontlines • Evaluation of Machine-Learning Protocols for Technology-Assisted Review in Electronic Discovery (2014) • Maura R. Grossman and Gordon V. Cormack • http://cormack.uwaterloo.ca/cormack/calstudy/
  • 4. Key Findings • Non-Random Selection Methods Work Best for Seed Set • Active Learning Better than Passive Learning • Senior Level Subject Matter Experts are NOT Required to Train System
  • 5. TAR Steps Process Overview ProducingTraining Assessing Results Creating the Seed Set Keyword Searching Relatedness Scoring Identifying the Population
  • 6. Relatedness Scoring Building the Map • Build the MapStep • Measure Relationships Purpose • AlgorithmsVariations • Core to Predictive Functionality Why It Matters
  • 7. Keyword Searching Tried and True • Validated & Iterative Keyword SearchingStep • Inexpensive TrainingPurpose • Not used in All ApproachesVariations • Drives Efficiency Why It Matters motorcycle or bike AND ((throttle or accel*) w/10 stick)
  • 8. Seed Set Building the Seed Set • Review Strategically Sampled DocsStep • Generates High-level Relevancy “Heat Map”Purpose • Random, Strategic, Judgmental SamplesVariations • Drives Efficiency Why It Matters
  • 9. Predicting Responsiveness The Prediction Engine Prediction Engine Relatedness Map Seed Set / Search Training Definitely Predictive Calls Responsive? Definitely Not The three categories of information we know are fed into the system’s algorithm, which evaluates the data to score the likelihood of each document’s being responsive.
  • 10. Assessing the Results Building the Answer Key •Assess Accuracy Based on Industry Standard MetricsStep •Informs Decision to Stop TARPurpose •Simple and Stratified Sampling •Sample Once or Multiple TimesVariations •Defensibility Why It Matters Definitely Predictive Calls Responsive? Definitely Not
  • 11. Training / Learning Continual Refinement Definitely Predictive Calls Responsive? Definitely Not Refining keyword searches and manually reviewing documents with highest levels of uncertainty moves docs from the middle toward the endpoints. • Reviewers Train and System LearnsStep • Transfer Subject Matter Expertise to TAR SystemPurpose • Active Learning • Passive LearningVariations • Dramatic Cost Savings Why It Matters
  • 12. Post-TAR Producing the Responsive Documents • Terminate TAR Review • Decision based on Accuracy and Cost Metrics • “Stabilization” • Harvest Predicted Calls • Review Responsive Docs • Sample Non-Responsive Docs • Document Entire Process
  • 13. Accuracy Metrics How Accuracy is Measured TAR improves the F1 score by moving documents from false (incorrect) bins to the true bins where they belong.
  • 14. Selected TAR Bibliography TAR Resources 1. Search, Forward: Will Manual Document Review and Keyword Searches be Replaced by Computer-assisted Coding? (2011) • Judge Andrew Peck • http://www.law.com/jsp/lawtechnologynews/PubArticleLTN.jsp?id=120251653 0534 2. Technology-Assisted Review in E-Discovery can be More Effective and More Efficient than Exhaustive Manual Review (2011) • Maura R. Grossman and Gordon V. Cormack • http://jolt.richmond.edu/v17i3/article11.pdf 3. Where the Money Goes: Understanding Litigant Expenditures for Producing Electronic Discovery (2012) • RAND Institute for Civil Justice: Nicholas M. Pace, Laura Zakaras • http://www.rand.org/pubs/monographs/MG1208.html#abstract

Editor's Notes

  1. Doug transitions to this slide in explaining the TAR process without Seniors doing the training Anita asks Doug to explain the process with an example—DP NMSIC, this may have to hold until all the deep dive slides have been explained. Deep dive into the process either by anticipating that Doug is talking about a process deeply or by his verbal request to advance to the next slide.
  2. Doug explains process Anita asks clarifying questions if appropriate
  3. Doug, key point spend some time Anita, will ask clarifying questions only to steer direction
  4. Doug, segue from previous slide Anita, ask clarifying questions if needed
  5. Doug
  6. Doug explains process Anita asks clarifying questions if appropriate
  7. Doug explains process Anita asks clarifying questions if appropriate
  8. Questions to ask Doug: How do you know when to stop? What is harvesting predicated calls exactly? How would you sample non-responsive docs? How would you document the whole process? If we have not already described how successfully this worked for DP NMSIC, then this is the time to do so briefly and plug for judicial acceptance.
  9. Just in case you didn’t have enough info on TAR let’s dive into what the accuracy measurements mean to you and your review. Time permitting
  10. To get more info….