SlideShare a Scribd company logo
1 of 35
Demys&fying	
  Technology	
  
Assisted	
  Review	
  
Sonya	
  L.	
  Sigler	
  
Part	
  4:	
  Ge@ng	
  Started	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Agenda	
  
  Overview	
  of	
  TAR	
  Spectrum	
  
  Your	
  Purpose	
  for	
  Using	
  TAR	
  
  What	
  the	
  TAR	
  Technology	
  Does	
  
  The	
  TAR	
  Balancing	
  Act	
  
  People	
  
  Process	
  
  Technology	
  	
  
  QuesBons	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Overview	
  of	
  TAR	
  Spectrum	
  
Linear	
  Review	
  
Culling	
  
IteraBve	
  search	
  
Review	
  
Accelerated	
  Review	
  	
  
Email	
  Threading	
  
Near	
  Duplicate	
  DetecBon	
  
RA	
  -­‐	
  Clustering	
  
CategorizaBon	
  (Supervised)	
  
Automated	
  Review	
  	
  
Relevance	
  Ranking	
  
Machine	
  Learning	
  
Latent	
  SemanBc	
  Indexing	
  
(staBsBcal	
  probability)	
  
PaPern	
  Analysis	
  
Sampling	
  Data	
  for	
  High	
  
Precision	
  and	
  Recall	
  Rates	
  
Per	
  	
  
Document	
  
Cost	
  
Organiza3on	
  Commitment	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Underlying	
  Technologies	
  
StaBsBcal	
  -­‐	
  #s	
  based	
  
LinguisBc	
  –	
  word	
  based	
  
Key	
  word	
  Search	
  
Ontologies	
  
Lucene	
  
dtSearch	
  
Other	
  Search	
  Engines	
  
Rules	
  Based	
  Systems	
  
Bayesian	
  ClassificaBon	
  
Latent	
  SemanBc	
  Indexing	
  
Support	
  Vector	
  Models	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Your	
  Purpose	
  for	
  Using	
  TAR	
  
  BePer,	
  Cheaper,	
  Faster	
  
  CombinaBon	
  
  All	
  3…	
  
Money	
  (Cheaper)	
  
Time	
  (Faster)	
  
Accuracy	
  (BePer)	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Purpose	
  of	
  the	
  TAR	
  Technology	
  Itself	
  	
  
  Learn	
  from	
  case	
  expert	
  
  Propagate	
  informaBon	
  to	
  enBre	
  data	
  set	
  
  Provide	
  Forum	
  	
  
  Quality	
  Control	
  of	
  the	
  Training	
  
  Quality	
  Control	
  of	
  the	
  PropagaBon.	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Balancing	
  Act	
  
  Technology	
  
  Process	
  
  People	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Technology/Data	
  
  TAR	
  Tools	
  
  Data	
  Types	
  
  Data	
  Size	
  
  Types	
  of	
  Cases	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
TAR	
  Tools	
  
  Seed	
  Set	
  
  Training	
  Set	
  
  Control	
  Set	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Data	
  Types	
  	
  
  Text	
  based	
  
  Meta	
  data	
  	
  
  Images	
  
  Excel	
  files	
  
  Power	
  point	
  
  Funky	
  file	
  types	
  –	
  normalize	
  
  Sun	
  
  Mac	
  
  Online	
  (IMs,	
  Facebook,	
  Sharepoint,	
  Wiki)	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Data	
  Size	
  
  #	
  of	
  GBs	
  
  #	
  of	
  Files	
  
  #	
  of	
  Each	
  File	
  Type	
  
  Families	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
When	
  to	
  Consider	
  TAR?	
  
  Timeline	
  Pressures	
  
  2nd	
  Requests	
  
  M&A	
  TransacBons	
  
  Understanding	
  Your	
  Data	
  
  InvesBgaBons	
  (Internal,	
  Government,	
  Regulatory)	
  
  Advanced	
  Analysis	
  
  AnBtrust	
  Cases,	
  Complex	
  LiBgaBon	
  
  ProducBons	
  
  Costs	
  
  Mere	
  Compliance	
  Trumps	
  
  Large	
  Data	
  Sets,	
  MulBple	
  Data	
  Sets	
  
  ProporBonal,	
  Managed	
  Costs	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Low	
  Risk	
  Case	
  Types	
  for	
  Gebng	
  Started	
  
  Measure	
  Linear	
  Review	
  Accuracy	
  
  Internal	
  InvesBgaBons	
  
  ProducBons	
  
  Opposing	
  ProducBons	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Case	
  Type:	
  InvesBgaBons	
  	
  
  People	
  Involved	
  
  Subject	
  MaPer	
  
  Bad	
  Behavior	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Case	
  Type:	
  ProducBons	
  
  Time	
  is	
  of	
  the	
  Essence	
  
  Volume	
  is	
  Ever	
  Expanding	
  
  2	
  Case	
  Studies	
  
  1.7	
  M	
  docs	
  –	
  narrow	
  review	
  set	
  
  3	
  weeks	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Case	
  Type:	
  ArbitraBon	
  -­‐	
  3	
  Weeks	
  
366,960	
  Total	
  documents	
  
108,750	
  docs	
  –	
  IteraBve	
  
Keyword	
  Culling	
  
92,067	
  to	
  Zoom	
  
13,844	
  
Reviewed	
  
Aner	
  Zoom	
  
258,210	
  sampled	
  
16,683	
  to	
  Linear	
  Review	
  
~1,000	
  +	
  families	
   ~2,000	
  +	
  families	
  
24,209	
  Docs	
  Reviewed	
  
2,026	
  Docs	
  Produced	
  
2,440	
  docs	
  trained	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Case	
  Type:	
  ArbitraBon	
  (1.7m	
  Docs)	
  
1,713,860	
  Total	
  documents	
  
1,646,062	
  docs	
  to	
  Zoom	
  
112,285	
  Junk	
  File	
  
56,949	
  Junk	
  
Domain	
  Analysis	
  
137,000	
  
Reviewed	
  
Aner	
  Zoom	
  
File	
  types,	
  size,	
  no	
  text	
  
1,600	
  Docs	
  Trained	
  
Ongoing	
  NegoBaBons	
  
Ltd	
  Custodians	
  
~1.1M	
  docs	
  
Sampling	
  Below	
  the	
  Cutoff	
  –	
  
500	
  docs	
  (95%	
  +/-­‐	
  4.38%)	
  
~600K	
  docs	
  
Above	
  the	
  Cutoff	
  –	
  Review	
  	
  
Priv	
  Screen	
  Docs	
  Only	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Case	
  Type:	
  Opposing	
  ProducBons	
  
  Time	
  is	
  of	
  the	
  Essence	
  
  What	
  is	
  There?	
  
  What	
  is	
  Missing?	
  
  Re-­‐use	
  Training	
  
  Your	
  ProducBon	
  Training	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
TAR	
  Workflow/Process	
  
  PrioriBzed	
  Review	
  
  Culling	
  (Responsive	
  v.	
  Not	
  Responsive)	
  
  Privilege	
  Screen	
  
  Quality	
  Control	
  
Privilege	
  
Screen	
  
Non-­‐
relevant	
  
Relevant	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Equivio	
  Zoom/Relevance	
  Workflow	
  
Machine	
  
Categoriza&on	
  
Random	
  Sample	
  
Itera&on	
  
Random	
  
Sample	
  
Expert	
  
Reviews	
  
Sample	
  
Responsive	
  =	
  1	
  
Non-­‐Responsive	
  =	
  0	
  
Relevancy	
  Ranking	
  
25-­‐50	
  Rounds	
  of	
  40	
  documents	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
How	
  to	
  Decide	
  What	
  to	
  Review,	
  Sample	
  or	
  Ignore?	
  
Privilege	
  
Screen	
  
Privilege	
  
Screen	
  
Privilege	
  
Screen	
  
Non-­‐
relevant	
  
Non-­‐
relevant	
  
Non-­‐
relevant	
  
Relevant	
   Relevant	
   Relevant	
  
Review	
  
Sample	
  
Accuracy	
  
MaPers	
   Cost	
  MaPers	
   Time	
  MaPers	
  Legend	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Process:	
  PrioriBzed	
  Review	
  
  Review	
  It	
  ALL	
  
  0	
  v	
  1	
  
  0	
  -­‐	
  >	
  1	
  
  But	
  How?	
  
  Priority	
  Batching	
  
  Ranking/Scores	
  
  Topics	
  
  Email	
  Threads	
  
  Deduplicated	
  Sets	
  
  Why?	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Process:	
  Culling	
  
Non-­‐responsive	
  v	
  Responsive	
  
Where	
  is	
  the	
  Cutoff	
  Point?	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
24
Ignoring,	
  Sampling,	
  or	
  Reviewing	
  
Documents	
  that	
  are	
  highly	
  
likely	
  to	
  NOT	
  be	
  relevant	
  
Documents	
  that	
  could	
  go	
  
either	
  way	
  
Documents	
  that	
  are	
  highly	
  
likely	
  to	
  be	
  relevant	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Process:	
  Privilege	
  Screen	
  
  How	
  much	
  data?	
  
  How	
  is	
  it	
  PrioriBzed	
  for	
  Review	
  
  Who	
  Reviews	
  this	
  Data?	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Process:	
  Quality	
  Control	
  
  What	
  was	
  Len	
  Behind?	
  
  Review	
  
  Sampling	
  
  Nothing	
  
  What	
  is	
  Moving	
  Forward?	
  
  Produce	
  
  Sample,	
  then	
  Produce	
  
  Review,	
  then	
  Produce	
  
Privilege	
  
Screen	
  
Non-­‐
relevant	
  
Relevant	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
People	
  
  Case	
  Expert	
  
  Project	
  Manager	
  
  Defensibility	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Who	
  Trains	
  the	
  TAR	
  Tool?	
  
v.	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
People:	
  Case	
  Expert	
  
  Who	
  
  Why	
  
  What	
  Do	
  They	
  Do?	
  
  How	
  Long	
  Does	
  It	
  Take?	
  
  Benefits	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Assessment	
  and	
  Training	
  View	
  in	
  Equivio	
  
Document	
  
List	
  
Document	
  
Contents	
  
Ranking	
  PalePe	
  for	
  ranking	
  
all	
  issues	
  at	
  once	
  
Progress	
  
status	
  
Ranking	
  PalePe	
  for	
  ranking	
  
issues	
  separately	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
People:	
  Project	
  Manager	
  
  Who	
  
  Why	
  
  How	
  Involved	
  
  Training	
  
  ExperBse	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
People:	
  Defensibility	
  
  Who	
  
  ExperBse	
  
  Why	
  
  What	
  Do	
  They	
  Do?	
  
  Reports	
  
  Affidavits	
  
  TesBmony	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Defensibility	
  Report 	
  	
  
  Document,	
  Document,	
  Document	
  
  Transparency	
  
  Workflow	
  
  What	
  Was	
  Considered,	
  By	
  Whom?	
  
  QC	
  Process	
  
  Metrics	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Delicate	
  Balance	
  
  People	
  
  Process	
  
  Technology	
  
Technology	
  
SophisBcaBon	
  Level	
  
Required	
  Subject	
  
MaPer	
  ExperBse	
  
Involvement	
  
Elapsed	
  Time	
  	
  
Document	
  Review	
  
Expenditure	
  
Required	
  Human	
  
Resources	
  
 	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Demys&fying	
  Technology	
  Assisted	
  Review	
  
Q&A - Thank you!	
  
Sonya	
  L.	
  Sigler	
  
Vice	
  President,	
  Product	
  Strategy	
  &	
  Consul&ng	
  
SFL	
  Data	
  
415-­‐321-­‐8385	
  
sonya@sfldata.com	
  	
  
www.sfldata.com	
  	
  
Post	
  your	
  ques&ons	
  to	
  the	
  
presenter	
  in	
  the	
  chat	
  secBon	
  

More Related Content

Similar to 2013 3 27 TAR Webinar Part 4 Getting Started Sigler

Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...
Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...
Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...Zia Consulting
 
ISTQB Foundation - Chapter 3
ISTQB Foundation - Chapter 3ISTQB Foundation - Chapter 3
ISTQB Foundation - Chapter 3Chandukar
 
Thesis Presentation
Thesis PresentationThesis Presentation
Thesis Presentationnirvdrum
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineSalford Systems
 
Iseb, ISTQB Static Testing
Iseb, ISTQB Static TestingIseb, ISTQB Static Testing
Iseb, ISTQB Static Testingonsoftwaretest
 
ISTQB / ISEB Foundation Exam Practice
ISTQB / ISEB Foundation Exam PracticeISTQB / ISEB Foundation Exam Practice
ISTQB / ISEB Foundation Exam PracticeYogindernath Gupta
 
Can we induce change with what we measure?
Can we induce change with what we measure?Can we induce change with what we measure?
Can we induce change with what we measure?Michaela Greiler
 
Continuous Performance Testing and Monitoring in Agile Development
Continuous Performance Testing and Monitoring in Agile DevelopmentContinuous Performance Testing and Monitoring in Agile Development
Continuous Performance Testing and Monitoring in Agile DevelopmentNeotys
 
Everything you Always Wanted to Know About Capture into Process
Everything you Always Wanted to Know About Capture into ProcessEverything you Always Wanted to Know About Capture into Process
Everything you Always Wanted to Know About Capture into ProcessAIIM International
 
Dare to Explore: Discover ET!
Dare to Explore: Discover ET!Dare to Explore: Discover ET!
Dare to Explore: Discover ET!Raj Indugula
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxElasticsearch
 
How to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayHow to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayPhillip Law
 
How to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayHow to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayPhillip Law
 
An Agile Approach to Machine Learning
An Agile Approach to Machine LearningAn Agile Approach to Machine Learning
An Agile Approach to Machine LearningRandy Shoup
 
Scaling cd at the enterprise velocity2015 taco_bakker_markheistek
Scaling cd at the enterprise velocity2015 taco_bakker_markheistekScaling cd at the enterprise velocity2015 taco_bakker_markheistek
Scaling cd at the enterprise velocity2015 taco_bakker_markheistekMark Heistek
 
Just Enough (Automated) Testing
Just Enough (Automated) TestingJust Enough (Automated) Testing
Just Enough (Automated) TestingSauce Labs
 
Security Consulting Methodology
Security Consulting MethodologySecurity Consulting Methodology
Security Consulting Methodologyciso_insights
 
Qiagram
QiagramQiagram
Qiagramjwppz
 

Similar to 2013 3 27 TAR Webinar Part 4 Getting Started Sigler (20)

Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...
Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...
Migrating to Alfresco Part II: The “How” – Tools & Best Practices for Renovat...
 
ISTQB Foundation - Chapter 3
ISTQB Foundation - Chapter 3ISTQB Foundation - Chapter 3
ISTQB Foundation - Chapter 3
 
2014 toronto-torbug
2014 toronto-torbug2014 toronto-torbug
2014 toronto-torbug
 
Thesis Presentation
Thesis PresentationThesis Presentation
Thesis Presentation
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search Engine
 
Iseb, ISTQB Static Testing
Iseb, ISTQB Static TestingIseb, ISTQB Static Testing
Iseb, ISTQB Static Testing
 
ISTQB / ISEB Foundation Exam Practice
ISTQB / ISEB Foundation Exam PracticeISTQB / ISEB Foundation Exam Practice
ISTQB / ISEB Foundation Exam Practice
 
Can we induce change with what we measure?
Can we induce change with what we measure?Can we induce change with what we measure?
Can we induce change with what we measure?
 
Continuous Performance Testing and Monitoring in Agile Development
Continuous Performance Testing and Monitoring in Agile DevelopmentContinuous Performance Testing and Monitoring in Agile Development
Continuous Performance Testing and Monitoring in Agile Development
 
Bestpractices
BestpracticesBestpractices
Bestpractices
 
Everything you Always Wanted to Know About Capture into Process
Everything you Always Wanted to Know About Capture into ProcessEverything you Always Wanted to Know About Capture into Process
Everything you Always Wanted to Know About Capture into Process
 
Dare to Explore: Discover ET!
Dare to Explore: Discover ET!Dare to Explore: Discover ET!
Dare to Explore: Discover ET!
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolbox
 
How to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayHow to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 Day
 
How to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 DayHow to Build an Attribution Solution in 1 Day
How to Build an Attribution Solution in 1 Day
 
An Agile Approach to Machine Learning
An Agile Approach to Machine LearningAn Agile Approach to Machine Learning
An Agile Approach to Machine Learning
 
Scaling cd at the enterprise velocity2015 taco_bakker_markheistek
Scaling cd at the enterprise velocity2015 taco_bakker_markheistekScaling cd at the enterprise velocity2015 taco_bakker_markheistek
Scaling cd at the enterprise velocity2015 taco_bakker_markheistek
 
Just Enough (Automated) Testing
Just Enough (Automated) TestingJust Enough (Automated) Testing
Just Enough (Automated) Testing
 
Security Consulting Methodology
Security Consulting MethodologySecurity Consulting Methodology
Security Consulting Methodology
 
Qiagram
QiagramQiagram
Qiagram
 

More from Sonya Sigler

Georgetown lecture 2012 6 2 full
Georgetown lecture 2012 6 2 fullGeorgetown lecture 2012 6 2 full
Georgetown lecture 2012 6 2 fullSonya Sigler
 
2012 6 27 TAR Webinar Part 1 Sigler
2012 6 27 TAR Webinar Part 1 Sigler2012 6 27 TAR Webinar Part 1 Sigler
2012 6 27 TAR Webinar Part 1 SiglerSonya Sigler
 
2012 11 7 TAR Webinar Part 3 Sigler
2012 11 7 TAR Webinar Part 3 Sigler2012 11 7 TAR Webinar Part 3 Sigler
2012 11 7 TAR Webinar Part 3 SiglerSonya Sigler
 
2012 8 29 TAR Webinar Part 2 Sigler
2012 8 29 TAR Webinar Part 2 Sigler2012 8 29 TAR Webinar Part 2 Sigler
2012 8 29 TAR Webinar Part 2 SiglerSonya Sigler
 
Georgetown Law Guest Lecture 2012 6 2
Georgetown Law Guest Lecture 2012 6 2Georgetown Law Guest Lecture 2012 6 2
Georgetown Law Guest Lecture 2012 6 2Sonya Sigler
 
SF Women in eDiscovery Sept 2011
SF Women in eDiscovery Sept 2011SF Women in eDiscovery Sept 2011
SF Women in eDiscovery Sept 2011Sonya Sigler
 

More from Sonya Sigler (6)

Georgetown lecture 2012 6 2 full
Georgetown lecture 2012 6 2 fullGeorgetown lecture 2012 6 2 full
Georgetown lecture 2012 6 2 full
 
2012 6 27 TAR Webinar Part 1 Sigler
2012 6 27 TAR Webinar Part 1 Sigler2012 6 27 TAR Webinar Part 1 Sigler
2012 6 27 TAR Webinar Part 1 Sigler
 
2012 11 7 TAR Webinar Part 3 Sigler
2012 11 7 TAR Webinar Part 3 Sigler2012 11 7 TAR Webinar Part 3 Sigler
2012 11 7 TAR Webinar Part 3 Sigler
 
2012 8 29 TAR Webinar Part 2 Sigler
2012 8 29 TAR Webinar Part 2 Sigler2012 8 29 TAR Webinar Part 2 Sigler
2012 8 29 TAR Webinar Part 2 Sigler
 
Georgetown Law Guest Lecture 2012 6 2
Georgetown Law Guest Lecture 2012 6 2Georgetown Law Guest Lecture 2012 6 2
Georgetown Law Guest Lecture 2012 6 2
 
SF Women in eDiscovery Sept 2011
SF Women in eDiscovery Sept 2011SF Women in eDiscovery Sept 2011
SF Women in eDiscovery Sept 2011
 

Recently uploaded

JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuidePixlogix Infotech
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaWSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityVictorSzoltysek
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringWSO2
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfdanishmna97
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxMarkSteadman7
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....rightmanforbloodline
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMKumar Satyam
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAnitaRaj43
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2
 

Recently uploaded (20)

JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 

2013 3 27 TAR Webinar Part 4 Getting Started Sigler

  • 1. Demys&fying  Technology   Assisted  Review   Sonya  L.  Sigler   Part  4:  Ge@ng  Started  
  • 2.                      Demys&fying  Technology  Assisted  Review   Agenda     Overview  of  TAR  Spectrum     Your  Purpose  for  Using  TAR     What  the  TAR  Technology  Does     The  TAR  Balancing  Act     People     Process     Technology       QuesBons  
  • 3.                      Demys&fying  Technology  Assisted  Review   Overview  of  TAR  Spectrum   Linear  Review   Culling   IteraBve  search   Review   Accelerated  Review     Email  Threading   Near  Duplicate  DetecBon   RA  -­‐  Clustering   CategorizaBon  (Supervised)   Automated  Review     Relevance  Ranking   Machine  Learning   Latent  SemanBc  Indexing   (staBsBcal  probability)   PaPern  Analysis   Sampling  Data  for  High   Precision  and  Recall  Rates   Per     Document   Cost   Organiza3on  Commitment  
  • 4.                      Demys&fying  Technology  Assisted  Review   Underlying  Technologies   StaBsBcal  -­‐  #s  based   LinguisBc  –  word  based   Key  word  Search   Ontologies   Lucene   dtSearch   Other  Search  Engines   Rules  Based  Systems   Bayesian  ClassificaBon   Latent  SemanBc  Indexing   Support  Vector  Models  
  • 5.                      Demys&fying  Technology  Assisted  Review   Your  Purpose  for  Using  TAR     BePer,  Cheaper,  Faster     CombinaBon     All  3…   Money  (Cheaper)   Time  (Faster)   Accuracy  (BePer)  
  • 6.                      Demys&fying  Technology  Assisted  Review   Purpose  of  the  TAR  Technology  Itself       Learn  from  case  expert     Propagate  informaBon  to  enBre  data  set     Provide  Forum       Quality  Control  of  the  Training     Quality  Control  of  the  PropagaBon.  
  • 7.                      Demys&fying  Technology  Assisted  Review   Balancing  Act     Technology     Process     People  
  • 8.                      Demys&fying  Technology  Assisted  Review   Technology/Data     TAR  Tools     Data  Types     Data  Size     Types  of  Cases  
  • 9.                      Demys&fying  Technology  Assisted  Review   TAR  Tools     Seed  Set     Training  Set     Control  Set  
  • 10.                      Demys&fying  Technology  Assisted  Review   Data  Types       Text  based     Meta  data       Images     Excel  files     Power  point     Funky  file  types  –  normalize     Sun     Mac     Online  (IMs,  Facebook,  Sharepoint,  Wiki)  
  • 11.                      Demys&fying  Technology  Assisted  Review   Data  Size     #  of  GBs     #  of  Files     #  of  Each  File  Type     Families  
  • 12.                      Demys&fying  Technology  Assisted  Review   When  to  Consider  TAR?     Timeline  Pressures     2nd  Requests     M&A  TransacBons     Understanding  Your  Data     InvesBgaBons  (Internal,  Government,  Regulatory)     Advanced  Analysis     AnBtrust  Cases,  Complex  LiBgaBon     ProducBons     Costs     Mere  Compliance  Trumps     Large  Data  Sets,  MulBple  Data  Sets     ProporBonal,  Managed  Costs  
  • 13.                      Demys&fying  Technology  Assisted  Review   Low  Risk  Case  Types  for  Gebng  Started     Measure  Linear  Review  Accuracy     Internal  InvesBgaBons     ProducBons     Opposing  ProducBons  
  • 14.                      Demys&fying  Technology  Assisted  Review   Case  Type:  InvesBgaBons       People  Involved     Subject  MaPer     Bad  Behavior  
  • 15.                      Demys&fying  Technology  Assisted  Review   Case  Type:  ProducBons     Time  is  of  the  Essence     Volume  is  Ever  Expanding     2  Case  Studies     1.7  M  docs  –  narrow  review  set     3  weeks  
  • 16.                      Demys&fying  Technology  Assisted  Review   Case  Type:  ArbitraBon  -­‐  3  Weeks   366,960  Total  documents   108,750  docs  –  IteraBve   Keyword  Culling   92,067  to  Zoom   13,844   Reviewed   Aner  Zoom   258,210  sampled   16,683  to  Linear  Review   ~1,000  +  families   ~2,000  +  families   24,209  Docs  Reviewed   2,026  Docs  Produced   2,440  docs  trained  
  • 17.                      Demys&fying  Technology  Assisted  Review   Case  Type:  ArbitraBon  (1.7m  Docs)   1,713,860  Total  documents   1,646,062  docs  to  Zoom   112,285  Junk  File   56,949  Junk   Domain  Analysis   137,000   Reviewed   Aner  Zoom   File  types,  size,  no  text   1,600  Docs  Trained   Ongoing  NegoBaBons   Ltd  Custodians   ~1.1M  docs   Sampling  Below  the  Cutoff  –   500  docs  (95%  +/-­‐  4.38%)   ~600K  docs   Above  the  Cutoff  –  Review     Priv  Screen  Docs  Only  
  • 18.                      Demys&fying  Technology  Assisted  Review   Case  Type:  Opposing  ProducBons     Time  is  of  the  Essence     What  is  There?     What  is  Missing?     Re-­‐use  Training     Your  ProducBon  Training  
  • 19.                      Demys&fying  Technology  Assisted  Review   TAR  Workflow/Process     PrioriBzed  Review     Culling  (Responsive  v.  Not  Responsive)     Privilege  Screen     Quality  Control   Privilege   Screen   Non-­‐ relevant   Relevant  
  • 20.                      Demys&fying  Technology  Assisted  Review   Equivio  Zoom/Relevance  Workflow   Machine   Categoriza&on   Random  Sample   Itera&on   Random   Sample   Expert   Reviews   Sample   Responsive  =  1   Non-­‐Responsive  =  0   Relevancy  Ranking   25-­‐50  Rounds  of  40  documents  
  • 21.                      Demys&fying  Technology  Assisted  Review   How  to  Decide  What  to  Review,  Sample  or  Ignore?   Privilege   Screen   Privilege   Screen   Privilege   Screen   Non-­‐ relevant   Non-­‐ relevant   Non-­‐ relevant   Relevant   Relevant   Relevant   Review   Sample   Accuracy   MaPers   Cost  MaPers   Time  MaPers  Legend  
  • 22.                      Demys&fying  Technology  Assisted  Review   Process:  PrioriBzed  Review     Review  It  ALL     0  v  1     0  -­‐  >  1     But  How?     Priority  Batching     Ranking/Scores     Topics     Email  Threads     Deduplicated  Sets     Why?  
  • 23.                      Demys&fying  Technology  Assisted  Review   Process:  Culling   Non-­‐responsive  v  Responsive   Where  is  the  Cutoff  Point?  
  • 24.                      Demys&fying  Technology  Assisted  Review   24 Ignoring,  Sampling,  or  Reviewing   Documents  that  are  highly   likely  to  NOT  be  relevant   Documents  that  could  go   either  way   Documents  that  are  highly   likely  to  be  relevant  
  • 25.                      Demys&fying  Technology  Assisted  Review   Process:  Privilege  Screen     How  much  data?     How  is  it  PrioriBzed  for  Review     Who  Reviews  this  Data?  
  • 26.                      Demys&fying  Technology  Assisted  Review   Process:  Quality  Control     What  was  Len  Behind?     Review     Sampling     Nothing     What  is  Moving  Forward?     Produce     Sample,  then  Produce     Review,  then  Produce   Privilege   Screen   Non-­‐ relevant   Relevant  
  • 27.                      Demys&fying  Technology  Assisted  Review   People     Case  Expert     Project  Manager     Defensibility  
  • 28.                      Demys&fying  Technology  Assisted  Review   Who  Trains  the  TAR  Tool?   v.  
  • 29.                      Demys&fying  Technology  Assisted  Review   People:  Case  Expert     Who     Why     What  Do  They  Do?     How  Long  Does  It  Take?     Benefits  
  • 30.                      Demys&fying  Technology  Assisted  Review   Assessment  and  Training  View  in  Equivio   Document   List   Document   Contents   Ranking  PalePe  for  ranking   all  issues  at  once   Progress   status   Ranking  PalePe  for  ranking   issues  separately  
  • 31.                      Demys&fying  Technology  Assisted  Review   People:  Project  Manager     Who     Why     How  Involved     Training     ExperBse  
  • 32.                      Demys&fying  Technology  Assisted  Review   People:  Defensibility     Who     ExperBse     Why     What  Do  They  Do?     Reports     Affidavits     TesBmony  
  • 33.                      Demys&fying  Technology  Assisted  Review   Defensibility  Report       Document,  Document,  Document     Transparency     Workflow     What  Was  Considered,  By  Whom?     QC  Process     Metrics  
  • 34.                      Demys&fying  Technology  Assisted  Review   Delicate  Balance     People     Process     Technology   Technology   SophisBcaBon  Level   Required  Subject   MaPer  ExperBse   Involvement   Elapsed  Time     Document  Review   Expenditure   Required  Human   Resources  
  • 35.                      Demys&fying  Technology  Assisted  Review   Q&A - Thank you!   Sonya  L.  Sigler   Vice  President,  Product  Strategy  &  Consul&ng   SFL  Data   415-­‐321-­‐8385   sonya@sfldata.com     www.sfldata.com     Post  your  ques&ons  to  the   presenter  in  the  chat  secBon