SlideShare a Scribd company logo
© Relativity. All rights reserved.
7 Days of Playing Minesweeper, or
How to Shut Down Whistleblower Defense with Analytics
Elise Tropiano, Senior Technical Product Manager
© Relativity. All rights reserved.
Elise Tropiano
Senior Technical Product Manager, Analytics
© Relativity. All rights reserved.
Agenda
Who We Are
The
Problems
We Solve
How We
Solve These
Problems
Case Study
Who we are
• Fast-growing legal tech company
• Unstructured big data platform
enhanced with advanced
analytics, machine learning, and
powerful visualizations
• 800+ employees worldwide
• Headquartered in Chicago, with
offices in London, Kraków, Hong
Kong and Melbourne
Witamy w naszym
Krakowskim biurze
• Product Innovation Center
• Opened in September, 2015
• Focus on data transfer solutions
• Growing team up to 100 this year
© Relativity. All rights reserved.
Relativity helps manage and analyze data
relevant to litigation and investigations
© Relativity. All rights reserved.
organize data
discover the truth
act on it
© Relativity. All rights reserved.
data problems we solve
© Relativity. All rights reserved.
We live in a world where there is an electronic trail of evidence in every potential
litigation. Every sent email or document created could be relevant in a trial.
© Relativity. All rights reserved.
= 45,000,000 emails
Scale of a Hypothetical Large Case
500 people potentially involved
x 100 each sending emails per day
x 180 working days a year
x 5 years
© Relativity. All rights reserved.
It all could be potentially relevant in litigation.
And it’s not just emails that are relevant…
© Relativity. All rights reserved.
750M+ files
in the largest case in Relativity
© Relativity. All rights reserved.
our solution
© Relativity. All rights reserved.
SaaS platform
© Relativity. All rights reserved.
our platform
search & search workflow
machine learning
email analytics
workflow & applications
repository
data analytics & reporting
© Relativity. All rights reserved.
whistleblower or extortionist?
© Relativity. All rights reserved.
Drinker Biddle is a full-service law firm providing
litigation, regulatory and business solutions to public
and private corporations, multinational Fortune 100
companies and start-ups.
© Relativity. All rights reserved.
• When a senior executive for a publicly
traded company was fired for
underperformance, he made a serious
allegation on his way out the door.
• He claimed he was laid off because of
his repeated attempts to inform
officials that the company was
falsifying quarterly financial reports to
the public.
© Relativity. All rights reserved.
Kick off a long list of tasks for the
company, including waiting for a lawyer
to send them a demand letter, gearing up
for defense, coaxing out the facts as
knowledge evolves, and possibly settling
the case before even getting to the truth.
Two Options to Handle This Case
Traditional Approach
Start an internal investigation and figure
out exactly what happened and decide
how to handle it.
Analytical Approach
Cost: $1,500,000+ Cost: $80,000
© Relativity. All rights reserved.
Investigation Timeline
Days 1-3
• Collected multi-sourced data
– One million emails + thousands of complex financial reports
• Relativity Analytics: Email threading
– Grouped email conversations and found initial set of relevant
meeting notes and presentations
• Relativity Analytics: Keyword expansion
– Ran terms round in these meeting notes through keyword
expansion to find additional relevant terms
• Relativity Analytics: Clustering
– Groups documents into conceptually similar groups
– Prioritized the review of clusters containing relevant documents
to find additional sources of intelligence
DAY 1 DAY 2 DAY 3
© Relativity. All rights reserved.
What ACTUALLY Happened?
Former employee had been emailing
with his wife on his personal
account and forwarding those
emails to his work email.
The emails contained keywords and
phrases relevant to the investigation,
so Drinker Biddle were able to make a
case to collect from former employee’s
personal email, and load that into
Relativity for further review.
Drinker Biddle found evidence of the
former employee working with his wife,
an employment attorney, to develop a
case against the company, and they were
able to prove he was drafting emails
about fraudulent accounting before the
quarterly numbers were recorded.
© Relativity. All rights reserved.
Relativity Analytics helped shut down
a whistleblower defense in 7 days,
saving an estimated $1.5M+
© Relativity. All rights reserved.
How We Did It
Email Threading
DIVIDE emails into segments and understand each
segment’s metadata.
1
COMBINE emails into conversation threads.
2
IDENTIFY inclusive emails for optimal efficiency.
3
© Relativity. All rights reserved.
From: Brandon Gauthier
Sent: Tuesday, November 24, 2015 7:50 AM
To: Michael Di Salvo
Subject: Demo Email
I need some email data for a demo, can you reply
back to this email. Thanks!
From: Michael Di Salvo
Sent: Tuesday, November 24, 2015 9:52 AM
To: Brandon Gauthier
Subject: Re: Demo Email
This is me, replying to your e-mail.
You sir, are very welcome.
From: Brandon Gauthier
Sent: Tuesday, November 24, 2015 7:53 AM
To: Michael Di Salvo
Subject: Re: Demo Email
One more time! This way we can show email
segments better.
From: Michael Di Salvo
Sent: Tuesday, November 24, 2015 9:54 AM
To: Brandon Gauthier
Subject: Re: Demo Email
This is me, creating an additional segment.
From: Brandon Gauthier
Sent: Tuesday, November 24, 2015 7:50 AM
To: Michael Di Salvo
Subject: Demo Email
I need some email data for a demo, can you reply
back to this email. Thanks!
From: Michael Di Salvo
Sent: Tuesday, November 24, 2015 9:52 AM
To: Brandon Gauthier
Subject: Re: Demo Email
This is me, replying to your e-mail.
You sir, are very welcome.
© Relativity. All rights reserved.
How We Did It
Keyword
Expansion
CREATE multi-dimensional space from terms in document text.
1
IDENTIFY terms that are conceptually similar to a
user-provided query.
2
RETURN the terms for augmented searching.
3
© Relativity. All rights reserved.
“Personally-held”
© Relativity. All rights reserved.
How We Did It
Clustering
INDEX documents into multi-dimensional space.
1
HIERARCHICALLY GROUP documents into conceptually similar
groups using the document text.
2
VISUALIZE clusters for promoting corpus understanding
and searching.
3
© Relativity. All rights reserved.
© Relativity. All rights reserved.
“It’s incredibly unjust for companies to pay
a settlement if they’re unsure that a claim
has merit simply because they don’t have
the money or the resources to investigate
it properly.”
Chief Data Scientist and Partner, Drinker Biddle
Bennett Borden
© Relativity. All rights reserved.
any questions?
© Relativity. All rights reserved.
thank you

More Related Content

Similar to 7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with Analytics - Elise Tropiano, Relativity

LoanResolve Brief Presentation
LoanResolve Brief PresentationLoanResolve Brief Presentation
LoanResolve Brief Presentation
jimmymac935
 
Investigation and discovery tools in law firms
Investigation and discovery tools in law firmsInvestigation and discovery tools in law firms
Investigation and discovery tools in law firms
Clio - Cloud-Based Legal Technology
 
eTapestry Webinar
eTapestry WebinareTapestry Webinar
eTapestry Webinarmikekierce
 
Case Organization, Analysis & Presentation in the Age of eDiscovery
Case Organization, Analysis & Presentation in the Age of eDiscoveryCase Organization, Analysis & Presentation in the Age of eDiscovery
Case Organization, Analysis & Presentation in the Age of eDiscovery
LexisNexis Software Division
 
Email Marketing and Digital Copywriting
Email Marketing and Digital CopywritingEmail Marketing and Digital Copywriting
Email Marketing and Digital Copywriting
Spotler
 
Streamline Your Court Interactions With Technology
Streamline Your Court Interactions With TechnologyStreamline Your Court Interactions With Technology
Streamline Your Court Interactions With Technology
Clio - Cloud-Based Legal Technology
 
Legal Tech Innovators Showcase @ ABA TECHSHOW
Legal Tech Innovators Showcase @ ABA TECHSHOWLegal Tech Innovators Showcase @ ABA TECHSHOW
Legal Tech Innovators Showcase @ ABA TECHSHOW
Evolve Law
 
2014 ota databreach3
2014 ota databreach32014 ota databreach3
2014 ota databreach3Meg Weber
 
eTapestry webinar
eTapestry webinareTapestry webinar
eTapestry webinar
rmmcnult
 
Catelas Legal - Intelligent Discoveryor Slideshare
Catelas Legal - Intelligent Discoveryor SlideshareCatelas Legal - Intelligent Discoveryor Slideshare
Catelas Legal - Intelligent Discoveryor Slideshare
Rob Levey
 
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...
Financial Poise
 
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...
Julia Grosman
 
Building Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowsBuilding Information Governance Policies and Workflows
Building Information Governance Policies and Workflows
kCura_Relativity
 
Iapp cipmExact IAPP CIPM Questions And Answers
Iapp cipmExact IAPP CIPM Questions And AnswersIapp cipmExact IAPP CIPM Questions And Answers
Iapp cipmExact IAPP CIPM Questions And Answers
Armstrongsmith
 
Tale of two law firms utah bar - january 25 2016 - final
Tale of two law firms   utah bar - january 25 2016 - finalTale of two law firms   utah bar - january 25 2016 - final
Tale of two law firms utah bar - january 25 2016 - final
Gary Allen
 
Minimize Your Client's Risk: From IP to Cash Flow
Minimize Your Client's Risk: From IP to Cash FlowMinimize Your Client's Risk: From IP to Cash Flow
Minimize Your Client's Risk: From IP to Cash Flow
Traklight.com
 
eDiscovery Perspective
eDiscovery PerspectiveeDiscovery Perspective
eDiscovery Perspective
Russ Gould
 
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...Raj Goel
 
Data Breach Response: Before and After the Breach
Data Breach Response: Before and After the BreachData Breach Response: Before and After the Breach
Data Breach Response: Before and After the Breach
Financial Poise
 
SNW Fall 2009
SNW Fall 2009SNW Fall 2009
SNW Fall 2009
Jeff Kubacki
 

Similar to 7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with Analytics - Elise Tropiano, Relativity (20)

LoanResolve Brief Presentation
LoanResolve Brief PresentationLoanResolve Brief Presentation
LoanResolve Brief Presentation
 
Investigation and discovery tools in law firms
Investigation and discovery tools in law firmsInvestigation and discovery tools in law firms
Investigation and discovery tools in law firms
 
eTapestry Webinar
eTapestry WebinareTapestry Webinar
eTapestry Webinar
 
Case Organization, Analysis & Presentation in the Age of eDiscovery
Case Organization, Analysis & Presentation in the Age of eDiscoveryCase Organization, Analysis & Presentation in the Age of eDiscovery
Case Organization, Analysis & Presentation in the Age of eDiscovery
 
Email Marketing and Digital Copywriting
Email Marketing and Digital CopywritingEmail Marketing and Digital Copywriting
Email Marketing and Digital Copywriting
 
Streamline Your Court Interactions With Technology
Streamline Your Court Interactions With TechnologyStreamline Your Court Interactions With Technology
Streamline Your Court Interactions With Technology
 
Legal Tech Innovators Showcase @ ABA TECHSHOW
Legal Tech Innovators Showcase @ ABA TECHSHOWLegal Tech Innovators Showcase @ ABA TECHSHOW
Legal Tech Innovators Showcase @ ABA TECHSHOW
 
2014 ota databreach3
2014 ota databreach32014 ota databreach3
2014 ota databreach3
 
eTapestry webinar
eTapestry webinareTapestry webinar
eTapestry webinar
 
Catelas Legal - Intelligent Discoveryor Slideshare
Catelas Legal - Intelligent Discoveryor SlideshareCatelas Legal - Intelligent Discoveryor Slideshare
Catelas Legal - Intelligent Discoveryor Slideshare
 
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...
 
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...
 
Building Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowsBuilding Information Governance Policies and Workflows
Building Information Governance Policies and Workflows
 
Iapp cipmExact IAPP CIPM Questions And Answers
Iapp cipmExact IAPP CIPM Questions And AnswersIapp cipmExact IAPP CIPM Questions And Answers
Iapp cipmExact IAPP CIPM Questions And Answers
 
Tale of two law firms utah bar - january 25 2016 - final
Tale of two law firms   utah bar - january 25 2016 - finalTale of two law firms   utah bar - january 25 2016 - final
Tale of two law firms utah bar - january 25 2016 - final
 
Minimize Your Client's Risk: From IP to Cash Flow
Minimize Your Client's Risk: From IP to Cash FlowMinimize Your Client's Risk: From IP to Cash Flow
Minimize Your Client's Risk: From IP to Cash Flow
 
eDiscovery Perspective
eDiscovery PerspectiveeDiscovery Perspective
eDiscovery Perspective
 
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...
 
Data Breach Response: Before and After the Breach
Data Breach Response: Before and After the BreachData Breach Response: Before and After the Breach
Data Breach Response: Before and After the Breach
 
SNW Fall 2009
SNW Fall 2009SNW Fall 2009
SNW Fall 2009
 

More from Evention

The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...
Evention
 
A/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.comA/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.com
Evention
 
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Evention
 
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Evention
 
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Evention
 
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, AdformBuilding a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Evention
 
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data ArtisansApache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Evention
 
Privacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, MapflatPrivacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, Mapflat
Evention
 
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Evention
 
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Evention
 
Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...
Evention
 
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Evention
 
Stream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data ArtisansStream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data Artisans
Evention
 
Scaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell SpotifyScaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell Spotify
Evention
 
Big Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz ŚliwaBig Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz Śliwa
Evention
 
Elastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz KołpućElastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz Kołpuć
Evention
 
H2 o deep water making deep learning accessible to everyone -jo-fai chow
H2 o deep water   making deep learning accessible to everyone -jo-fai chowH2 o deep water   making deep learning accessible to everyone -jo-fai chow
H2 o deep water making deep learning accessible to everyone -jo-fai chow
Evention
 
That won’t fit into RAM - Michał Brzezicki
That won’t fit into RAM -  Michał  BrzezickiThat won’t fit into RAM -  Michał  Brzezicki
That won’t fit into RAM - Michał Brzezicki
Evention
 
Stream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian HueskeStream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian Hueske
Evention
 
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Evention
 

More from Evention (20)

The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...
 
A/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.comA/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.com
 
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
 
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
 
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
 
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, AdformBuilding a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
 
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data ArtisansApache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
 
Privacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, MapflatPrivacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, Mapflat
 
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
 
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
 
Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...
 
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
 
Stream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data ArtisansStream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data Artisans
 
Scaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell SpotifyScaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell Spotify
 
Big Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz ŚliwaBig Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz Śliwa
 
Elastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz KołpućElastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz Kołpuć
 
H2 o deep water making deep learning accessible to everyone -jo-fai chow
H2 o deep water   making deep learning accessible to everyone -jo-fai chowH2 o deep water   making deep learning accessible to everyone -jo-fai chow
H2 o deep water making deep learning accessible to everyone -jo-fai chow
 
That won’t fit into RAM - Michał Brzezicki
That won’t fit into RAM -  Michał  BrzezickiThat won’t fit into RAM -  Michał  Brzezicki
That won’t fit into RAM - Michał Brzezicki
 
Stream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian HueskeStream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian Hueske
 
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
 

Recently uploaded

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 

Recently uploaded (20)

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 

7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with Analytics - Elise Tropiano, Relativity

  • 1. © Relativity. All rights reserved. 7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with Analytics Elise Tropiano, Senior Technical Product Manager
  • 2. © Relativity. All rights reserved. Elise Tropiano Senior Technical Product Manager, Analytics
  • 3. © Relativity. All rights reserved. Agenda Who We Are The Problems We Solve How We Solve These Problems Case Study
  • 4. Who we are • Fast-growing legal tech company • Unstructured big data platform enhanced with advanced analytics, machine learning, and powerful visualizations • 800+ employees worldwide • Headquartered in Chicago, with offices in London, Kraków, Hong Kong and Melbourne
  • 5. Witamy w naszym Krakowskim biurze • Product Innovation Center • Opened in September, 2015 • Focus on data transfer solutions • Growing team up to 100 this year
  • 6. © Relativity. All rights reserved. Relativity helps manage and analyze data relevant to litigation and investigations
  • 7. © Relativity. All rights reserved. organize data discover the truth act on it
  • 8. © Relativity. All rights reserved. data problems we solve
  • 9. © Relativity. All rights reserved. We live in a world where there is an electronic trail of evidence in every potential litigation. Every sent email or document created could be relevant in a trial.
  • 10. © Relativity. All rights reserved. = 45,000,000 emails Scale of a Hypothetical Large Case 500 people potentially involved x 100 each sending emails per day x 180 working days a year x 5 years
  • 11. © Relativity. All rights reserved. It all could be potentially relevant in litigation. And it’s not just emails that are relevant…
  • 12. © Relativity. All rights reserved. 750M+ files in the largest case in Relativity
  • 13. © Relativity. All rights reserved. our solution
  • 14. © Relativity. All rights reserved. SaaS platform
  • 15. © Relativity. All rights reserved. our platform search & search workflow machine learning email analytics workflow & applications repository data analytics & reporting
  • 16. © Relativity. All rights reserved. whistleblower or extortionist?
  • 17. © Relativity. All rights reserved. Drinker Biddle is a full-service law firm providing litigation, regulatory and business solutions to public and private corporations, multinational Fortune 100 companies and start-ups.
  • 18. © Relativity. All rights reserved. • When a senior executive for a publicly traded company was fired for underperformance, he made a serious allegation on his way out the door. • He claimed he was laid off because of his repeated attempts to inform officials that the company was falsifying quarterly financial reports to the public.
  • 19. © Relativity. All rights reserved. Kick off a long list of tasks for the company, including waiting for a lawyer to send them a demand letter, gearing up for defense, coaxing out the facts as knowledge evolves, and possibly settling the case before even getting to the truth. Two Options to Handle This Case Traditional Approach Start an internal investigation and figure out exactly what happened and decide how to handle it. Analytical Approach Cost: $1,500,000+ Cost: $80,000
  • 20. © Relativity. All rights reserved. Investigation Timeline Days 1-3 • Collected multi-sourced data – One million emails + thousands of complex financial reports • Relativity Analytics: Email threading – Grouped email conversations and found initial set of relevant meeting notes and presentations • Relativity Analytics: Keyword expansion – Ran terms round in these meeting notes through keyword expansion to find additional relevant terms • Relativity Analytics: Clustering – Groups documents into conceptually similar groups – Prioritized the review of clusters containing relevant documents to find additional sources of intelligence DAY 1 DAY 2 DAY 3
  • 21. © Relativity. All rights reserved. What ACTUALLY Happened? Former employee had been emailing with his wife on his personal account and forwarding those emails to his work email. The emails contained keywords and phrases relevant to the investigation, so Drinker Biddle were able to make a case to collect from former employee’s personal email, and load that into Relativity for further review. Drinker Biddle found evidence of the former employee working with his wife, an employment attorney, to develop a case against the company, and they were able to prove he was drafting emails about fraudulent accounting before the quarterly numbers were recorded.
  • 22. © Relativity. All rights reserved. Relativity Analytics helped shut down a whistleblower defense in 7 days, saving an estimated $1.5M+
  • 23. © Relativity. All rights reserved. How We Did It Email Threading DIVIDE emails into segments and understand each segment’s metadata. 1 COMBINE emails into conversation threads. 2 IDENTIFY inclusive emails for optimal efficiency. 3
  • 24. © Relativity. All rights reserved. From: Brandon Gauthier Sent: Tuesday, November 24, 2015 7:50 AM To: Michael Di Salvo Subject: Demo Email I need some email data for a demo, can you reply back to this email. Thanks! From: Michael Di Salvo Sent: Tuesday, November 24, 2015 9:52 AM To: Brandon Gauthier Subject: Re: Demo Email This is me, replying to your e-mail. You sir, are very welcome. From: Brandon Gauthier Sent: Tuesday, November 24, 2015 7:53 AM To: Michael Di Salvo Subject: Re: Demo Email One more time! This way we can show email segments better. From: Michael Di Salvo Sent: Tuesday, November 24, 2015 9:54 AM To: Brandon Gauthier Subject: Re: Demo Email This is me, creating an additional segment. From: Brandon Gauthier Sent: Tuesday, November 24, 2015 7:50 AM To: Michael Di Salvo Subject: Demo Email I need some email data for a demo, can you reply back to this email. Thanks! From: Michael Di Salvo Sent: Tuesday, November 24, 2015 9:52 AM To: Brandon Gauthier Subject: Re: Demo Email This is me, replying to your e-mail. You sir, are very welcome.
  • 25.
  • 26.
  • 27. © Relativity. All rights reserved. How We Did It Keyword Expansion CREATE multi-dimensional space from terms in document text. 1 IDENTIFY terms that are conceptually similar to a user-provided query. 2 RETURN the terms for augmented searching. 3
  • 28. © Relativity. All rights reserved. “Personally-held”
  • 29.
  • 30. © Relativity. All rights reserved. How We Did It Clustering INDEX documents into multi-dimensional space. 1 HIERARCHICALLY GROUP documents into conceptually similar groups using the document text. 2 VISUALIZE clusters for promoting corpus understanding and searching. 3
  • 31. © Relativity. All rights reserved.
  • 32.
  • 33. © Relativity. All rights reserved. “It’s incredibly unjust for companies to pay a settlement if they’re unsure that a claim has merit simply because they don’t have the money or the resources to investigate it properly.” Chief Data Scientist and Partner, Drinker Biddle Bennett Borden
  • 34. © Relativity. All rights reserved. any questions?
  • 35. © Relativity. All rights reserved. thank you