SlideShare a Scribd company logo
1 of 18
A FAMILIAR STORY 
HERE IS A MOUNTAIN OF PAPER DOCUMENTS. . . 
By: Brad Stauber, MSU College of Law
WHAT DO WE SEE? 
• Long and Expensive 
Process 
• Humans get tired, upset, 
hungover, etc. 
• Mistakes can happen
MAJOR SHIFTS IN 
• The Economy 
• The Customer 
• The Technology 
• The Data
THE 
ECONOMY 
• THE GREAT RECESSION 
• Individuals, Corporations, and 
Law Firms 
• Hurt financially 
• Forced to institute belt-tightening 
measures 
• Tighter financing requirements
THE 
CUSTOMER 
• No longer willing to hand 
out exorbitant amounts of 
money for e-discovery 
• Greater emphasis on: 
• Better, Faster, Cheaper 
• “Going Green” 
• More tech savvy
THE 
TECHNOLOGY 
• Emergence of the 
“Cloud” 
• Increased Computing 
Power 
• Decrease in Data 
Storage Costs 
• Moore’s Law
THE DATA
THE EFFECT OF 
THESE MAJOR 
SHIFTS 
• In The Past, 
• All You Needed was Keyword 
Searches and Manual Review 
• Now and In Future, 
• You Would Be Wise to Have 
Predictive Coding and Clustering 
In Your E-Discovery Arsenal
PREDICTIVE CODING 
• Combines the Efficiencies of: 
• A Computerized Sampling System, and 
• A Human Expert 
• Components: 
• Data 
• Complex Algorithms 
• Software/Programs 
• Human Input 
• Samples and Tests
HOW DOES THE PROCESS WORK? 
• THIS TRAINING PROCESS CAN BE 
REPEATED AND THE PROGRAM CAN 
CONTINUE TO LEARN. 
• THUS, IMPROVING THE ACCURACY OF ITS 
OUTPUT
WHERE HAVE WE SEEN 
PREDICTIVE 
ANALYTICS BEFORE? 
• Google 
• Translate, Spell Check, 
Searches 
• Netflix 
• Movie Suggestions 
• Pandora 
• Predicts songs that the 
user should like 
…and the list goes on and on.
PROS OF PREDICTIVE CODING 
• Faster than Linear Review 
• Recalls a Higher Percentage of the Relevant Documents 
• Higher Precision than a Human Document Reviewer 
• Cost Savings 
• Allows Law Firms to Do More with Less
CONS OF PREDICTIVE CODING 
• The “smoking gun” may be missed 
• Privileged documents may be produced 
• Savings may not be as good as advertised 
• Training Costs 
• Document Review Costs
CLUSTERING 
• The program analyzes the text of 
ESI and groups related documents 
together into clusters. 
• Clusters can be: 
• Concept-Based 
• Restricted to certain keywords 
• Duplicates or Near-Duplicates 
• E.g., A Cluster of Bob’s Emails 
to Sally
ADVANTAGES OF CLUSTERING 
•Quickly identify major topics and 
sub-topics. 
•Apply tags to a single document, a 
cluster of documents, or a group of 
clusters 
•Clustering helps to reveal document 
relationships and context. 
•Identify near-duplicates and process 
them as a unit or individually. 
•Automatic categorization.
CONSIDERATIONS 
• Implementation Considerations: 
• Software Costs 
• Training Costs 
• Mistakes Will Be Made 
• Risk that a computer can’t find something that a 
human could 
• How long will this technology last?
WHY ARE SOME 
LAWYERS STILL IN 
THE STONE AGE? 
• Business Model? 
• More Time Used = More Billable Hours 
= Greater Equity Share 
• Belief That Legal Services Should 
Require Human Input? 
• Fear of Change?
CONCLUSION 
• Lawyers Need to Adapt 
• Lawyers Need to Conquer Their 
Fear 
• Predictive Coding and Clustering 
allow you to deliver legal 
services 
• Better, Faster, and Cheaper 
Because Sooner or Later, 
Relying on Linear Review and Keyword 
Searches WILL BE CONSIDERED……………..

More Related Content

Viewers also liked

Diagnosa keperawatan keluarga
Diagnosa keperawatan keluarga Diagnosa keperawatan keluarga
Diagnosa keperawatan keluarga pjj_kemenkes
 
MoreS - Smart House for... Non-Smart People
MoreS - Smart House for... Non-Smart PeopleMoreS - Smart House for... Non-Smart People
MoreS - Smart House for... Non-Smart PeopleAlessio Reggiani
 
Why the Possum’s Tail is Bare
Why the Possum’s Tail is BareWhy the Possum’s Tail is Bare
Why the Possum’s Tail is Barecoolstuff
 
Ldb 25 strumenti gis e webgis_2014-05-26 vaira - architetture dei sistemi inf...
Ldb 25 strumenti gis e webgis_2014-05-26 vaira - architetture dei sistemi inf...Ldb 25 strumenti gis e webgis_2014-05-26 vaira - architetture dei sistemi inf...
Ldb 25 strumenti gis e webgis_2014-05-26 vaira - architetture dei sistemi inf...laboratoridalbasso
 
Práctica sobre geometría de undécimo año 2015
Práctica sobre geometría de undécimo año 2015Práctica sobre geometría de undécimo año 2015
Práctica sobre geometría de undécimo año 2015MCMurray
 

Viewers also liked (9)

Diagnosa keperawatan keluarga
Diagnosa keperawatan keluarga Diagnosa keperawatan keluarga
Diagnosa keperawatan keluarga
 
O livro a Mulher V
O livro a Mulher VO livro a Mulher V
O livro a Mulher V
 
Marina nabil's CV
Marina nabil's CVMarina nabil's CV
Marina nabil's CV
 
Endocrine system
Endocrine systemEndocrine system
Endocrine system
 
MoreS - Smart House for... Non-Smart People
MoreS - Smart House for... Non-Smart PeopleMoreS - Smart House for... Non-Smart People
MoreS - Smart House for... Non-Smart People
 
Why the Possum’s Tail is Bare
Why the Possum’s Tail is BareWhy the Possum’s Tail is Bare
Why the Possum’s Tail is Bare
 
Ldb 25 strumenti gis e webgis_2014-05-26 vaira - architetture dei sistemi inf...
Ldb 25 strumenti gis e webgis_2014-05-26 vaira - architetture dei sistemi inf...Ldb 25 strumenti gis e webgis_2014-05-26 vaira - architetture dei sistemi inf...
Ldb 25 strumenti gis e webgis_2014-05-26 vaira - architetture dei sistemi inf...
 
Práctica sobre geometría de undécimo año 2015
Práctica sobre geometría de undécimo año 2015Práctica sobre geometría de undécimo año 2015
Práctica sobre geometría de undécimo año 2015
 
DIE LINKE. [Mindestlohn]
DIE LINKE. [Mindestlohn]DIE LINKE. [Mindestlohn]
DIE LINKE. [Mindestlohn]
 

Similar to Ignite Presentation

Correlation does not mean causation
Correlation does not mean causationCorrelation does not mean causation
Correlation does not mean causationPeter Varhol
 
Industrial Data Science
Industrial Data ScienceIndustrial Data Science
Industrial Data ScienceNiko Vuokko
 
ACEDS-Zylab 4-3-15 Webcast
ACEDS-Zylab 4-3-15 Webcast ACEDS-Zylab 4-3-15 Webcast
ACEDS-Zylab 4-3-15 Webcast Logikcull.com
 
Data Foundation for Analytics Excellence by Tanimura, cathy from Okta
Data Foundation for Analytics Excellence by Tanimura, cathy from OktaData Foundation for Analytics Excellence by Tanimura, cathy from Okta
Data Foundation for Analytics Excellence by Tanimura, cathy from OktaTin Ho
 
KAI, the Information Specialist
KAI, the Information SpecialistKAI, the Information Specialist
KAI, the Information Specialistaik762
 
Community Action - Paper-Mess to Paper-Less
Community Action - Paper-Mess to Paper-LessCommunity Action - Paper-Mess to Paper-Less
Community Action - Paper-Mess to Paper-LessLarry Phelps
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkNYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkVivian S. Zhang
 
Systemising advice
Systemising adviceSystemising advice
Systemising adviceDavid Harvey
 
Kontextdrivna krav - effektiv kravställning för din organisation
Kontextdrivna krav - effektiv kravställning för din organisationKontextdrivna krav - effektiv kravställning för din organisation
Kontextdrivna krav - effektiv kravställning för din organisationADDQ
 
Law Practice Management - Organization, Cloud, Social Media and Ethics
Law Practice Management - Organization, Cloud, Social Media and EthicsLaw Practice Management - Organization, Cloud, Social Media and Ethics
Law Practice Management - Organization, Cloud, Social Media and EthicsJennifer Ellis, JD, LLC
 
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...Health IT Conference – iHT2
 
Measuring ROI on Enterprise Search - John Lenker, Lucidworks
Measuring ROI on Enterprise Search - John Lenker, Lucidworks Measuring ROI on Enterprise Search - John Lenker, Lucidworks
Measuring ROI on Enterprise Search - John Lenker, Lucidworks Lucidworks
 
Designing fit for purpose data structures
Designing fit for purpose data structuresDesigning fit for purpose data structures
Designing fit for purpose data structuresStephen Bounds
 
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...Adlib - The PDF Experts
 
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantLynne Thomas
 
Agility for big data
Agility for big data Agility for big data
Agility for big data Charlie Cheng
 
Internal Investigations and the Cloud
Internal Investigations and the CloudInternal Investigations and the Cloud
Internal Investigations and the CloudDan Michaluk
 
MSCPA Update 2017
MSCPA Update 2017MSCPA Update 2017
MSCPA Update 2017Allen Lloyd
 
Using Machine Learning to Optimize DevOps Practices
Using Machine Learning to Optimize DevOps PracticesUsing Machine Learning to Optimize DevOps Practices
Using Machine Learning to Optimize DevOps PracticesPeter Varhol
 

Similar to Ignite Presentation (20)

Correlation does not mean causation
Correlation does not mean causationCorrelation does not mean causation
Correlation does not mean causation
 
Industrial Data Science
Industrial Data ScienceIndustrial Data Science
Industrial Data Science
 
ACEDS-Zylab 4-3-15 Webcast
ACEDS-Zylab 4-3-15 Webcast ACEDS-Zylab 4-3-15 Webcast
ACEDS-Zylab 4-3-15 Webcast
 
Data Foundation for Analytics Excellence by Tanimura, cathy from Okta
Data Foundation for Analytics Excellence by Tanimura, cathy from OktaData Foundation for Analytics Excellence by Tanimura, cathy from Okta
Data Foundation for Analytics Excellence by Tanimura, cathy from Okta
 
KAI, the Information Specialist
KAI, the Information SpecialistKAI, the Information Specialist
KAI, the Information Specialist
 
Community Action - Paper-Mess to Paper-Less
Community Action - Paper-Mess to Paper-LessCommunity Action - Paper-Mess to Paper-Less
Community Action - Paper-Mess to Paper-Less
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkNYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
 
Systemising advice
Systemising adviceSystemising advice
Systemising advice
 
Kontextdrivna krav - effektiv kravställning för din organisation
Kontextdrivna krav - effektiv kravställning för din organisationKontextdrivna krav - effektiv kravställning för din organisation
Kontextdrivna krav - effektiv kravställning för din organisation
 
Law Practice Management - Organization, Cloud, Social Media and Ethics
Law Practice Management - Organization, Cloud, Social Media and EthicsLaw Practice Management - Organization, Cloud, Social Media and Ethics
Law Practice Management - Organization, Cloud, Social Media and Ethics
 
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...
CHIME LEAD New York 2014 "Case Studies from the Field: Putting Cyber Security...
 
Measuring ROI on Enterprise Search - John Lenker, Lucidworks
Measuring ROI on Enterprise Search - John Lenker, Lucidworks Measuring ROI on Enterprise Search - John Lenker, Lucidworks
Measuring ROI on Enterprise Search - John Lenker, Lucidworks
 
Lsntap triage and expert systems slides
Lsntap triage and expert systems slidesLsntap triage and expert systems slides
Lsntap triage and expert systems slides
 
Designing fit for purpose data structures
Designing fit for purpose data structuresDesigning fit for purpose data structures
Designing fit for purpose data structures
 
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...
PRESENTATION: Capture. Compliance. Centralization. How Advanced Rendering Del...
 
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
 
Agility for big data
Agility for big data Agility for big data
Agility for big data
 
Internal Investigations and the Cloud
Internal Investigations and the CloudInternal Investigations and the Cloud
Internal Investigations and the Cloud
 
MSCPA Update 2017
MSCPA Update 2017MSCPA Update 2017
MSCPA Update 2017
 
Using Machine Learning to Optimize DevOps Practices
Using Machine Learning to Optimize DevOps PracticesUsing Machine Learning to Optimize DevOps Practices
Using Machine Learning to Optimize DevOps Practices
 

Ignite Presentation

  • 1. A FAMILIAR STORY HERE IS A MOUNTAIN OF PAPER DOCUMENTS. . . By: Brad Stauber, MSU College of Law
  • 2. WHAT DO WE SEE? • Long and Expensive Process • Humans get tired, upset, hungover, etc. • Mistakes can happen
  • 3. MAJOR SHIFTS IN • The Economy • The Customer • The Technology • The Data
  • 4. THE ECONOMY • THE GREAT RECESSION • Individuals, Corporations, and Law Firms • Hurt financially • Forced to institute belt-tightening measures • Tighter financing requirements
  • 5. THE CUSTOMER • No longer willing to hand out exorbitant amounts of money for e-discovery • Greater emphasis on: • Better, Faster, Cheaper • “Going Green” • More tech savvy
  • 6. THE TECHNOLOGY • Emergence of the “Cloud” • Increased Computing Power • Decrease in Data Storage Costs • Moore’s Law
  • 8. THE EFFECT OF THESE MAJOR SHIFTS • In The Past, • All You Needed was Keyword Searches and Manual Review • Now and In Future, • You Would Be Wise to Have Predictive Coding and Clustering In Your E-Discovery Arsenal
  • 9. PREDICTIVE CODING • Combines the Efficiencies of: • A Computerized Sampling System, and • A Human Expert • Components: • Data • Complex Algorithms • Software/Programs • Human Input • Samples and Tests
  • 10. HOW DOES THE PROCESS WORK? • THIS TRAINING PROCESS CAN BE REPEATED AND THE PROGRAM CAN CONTINUE TO LEARN. • THUS, IMPROVING THE ACCURACY OF ITS OUTPUT
  • 11. WHERE HAVE WE SEEN PREDICTIVE ANALYTICS BEFORE? • Google • Translate, Spell Check, Searches • Netflix • Movie Suggestions • Pandora • Predicts songs that the user should like …and the list goes on and on.
  • 12. PROS OF PREDICTIVE CODING • Faster than Linear Review • Recalls a Higher Percentage of the Relevant Documents • Higher Precision than a Human Document Reviewer • Cost Savings • Allows Law Firms to Do More with Less
  • 13. CONS OF PREDICTIVE CODING • The “smoking gun” may be missed • Privileged documents may be produced • Savings may not be as good as advertised • Training Costs • Document Review Costs
  • 14. CLUSTERING • The program analyzes the text of ESI and groups related documents together into clusters. • Clusters can be: • Concept-Based • Restricted to certain keywords • Duplicates or Near-Duplicates • E.g., A Cluster of Bob’s Emails to Sally
  • 15. ADVANTAGES OF CLUSTERING •Quickly identify major topics and sub-topics. •Apply tags to a single document, a cluster of documents, or a group of clusters •Clustering helps to reveal document relationships and context. •Identify near-duplicates and process them as a unit or individually. •Automatic categorization.
  • 16. CONSIDERATIONS • Implementation Considerations: • Software Costs • Training Costs • Mistakes Will Be Made • Risk that a computer can’t find something that a human could • How long will this technology last?
  • 17. WHY ARE SOME LAWYERS STILL IN THE STONE AGE? • Business Model? • More Time Used = More Billable Hours = Greater Equity Share • Belief That Legal Services Should Require Human Input? • Fear of Change?
  • 18. CONCLUSION • Lawyers Need to Adapt • Lawyers Need to Conquer Their Fear • Predictive Coding and Clustering allow you to deliver legal services • Better, Faster, and Cheaper Because Sooner or Later, Relying on Linear Review and Keyword Searches WILL BE CONSIDERED……………..

Editor's Notes

  1. Processor speeds, or overall processing power for computers will double every two years. 
  2. Predictive Coding combines the efficiencies of a computerized sampling system with a human “expert.” The human interacts with the system by making “yes/no” calls to a question against a series of controlled samples of 40 documents at a time. Questions can be “Is this document responsive?” or “Does it pertain to this specific issue?” or “Is this document privileged?”, etc.
  3. In my research, there was a pessimistic article that mentioned these concerns.
  4. To conclude,