Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Leaks, journalism & graphs
How ICIJ Used Neo4j to
Unravel the Panama Papers
Mar Cabra
Editor, Data & Research Unit
The Int...
Almost 200 journalists
Based in 65 countries
“Our aim is to bring journalists from different countries together
in teams -...
You may remember us from...
+370 journalists
+100 media organizations
76 countries
Nearly one in 10 of the 31,000 tax haven companies that own
British property are linked to Mossack Fonseca
#panamapapers
INSIDE THE 2.6 TB
Redis queue
35 x g2.xlarge Amazon instances with Ubuntu + Tesseract + Extract
Lucene syntax queries with
proximity matching!
400 users
INSIDE THE 2.6 TB
offshoreleaks.icij.org
MAGIC!!
● 950,000 nodes, 1.2 million edges (4GB)
Small, I know!!
● Find shortest path
Wow!
● Fuzzy searching
● Public widg...
May 9, 7pm London
Save the date…
...and download!! ;)
HELP US OUT SOME?
THE END
…or is it?
Mar Cabra
mcabra@icij.org | @cabralens
icij.org/support
bit.ly/icijgraphconnect2016
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra
Upcoming SlideShare
Loading in …5
×

GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra

5,085 views

Published on

GraphConnect Europe 2016
ICIJ

Published in: Technology
  • Be the first to comment

  • Be the first to like this

GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra

  1. 1. Leaks, journalism & graphs How ICIJ Used Neo4j to Unravel the Panama Papers Mar Cabra Editor, Data & Research Unit The International Consortium of Investigative Journalists (ICIJ) @cabralens | @ICIJorg icij.org
  2. 2. Almost 200 journalists Based in 65 countries “Our aim is to bring journalists from different countries together in teams - eliminating rivalry and promoting collaboration. Together, we aim to be the world’s best cross-border investigative team.” icij.org/about
  3. 3. You may remember us from...
  4. 4. +370 journalists +100 media organizations 76 countries
  5. 5. Nearly one in 10 of the 31,000 tax haven companies that own British property are linked to Mossack Fonseca
  6. 6. #panamapapers
  7. 7. INSIDE THE 2.6 TB
  8. 8. Redis queue 35 x g2.xlarge Amazon instances with Ubuntu + Tesseract + Extract
  9. 9. Lucene syntax queries with proximity matching! 400 users
  10. 10. INSIDE THE 2.6 TB
  11. 11. offshoreleaks.icij.org
  12. 12. MAGIC!! ● 950,000 nodes, 1.2 million edges (4GB) Small, I know!! ● Find shortest path Wow! ● Fuzzy searching ● Public widgets ● API
  13. 13. May 9, 7pm London Save the date… ...and download!! ;)
  14. 14. HELP US OUT SOME?
  15. 15. THE END …or is it? Mar Cabra mcabra@icij.org | @cabralens icij.org/support bit.ly/icijgraphconnect2016

×