GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra

2,342 views

Published on

GraphConnect Europe 2016
ICIJ

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
2,342
On SlideShare
0
From Embeds
0
Number of Embeds
1,693
Actions
Shares
0
Downloads
24
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

GraphConnect Europe 2016 - How the ICIJ Used Neo4j to Unravel the Panama Papers - Mar Cabra

  1. 1. Leaks, journalism & graphs How ICIJ Used Neo4j to Unravel the Panama Papers Mar Cabra Editor, Data & Research Unit The International Consortium of Investigative Journalists (ICIJ) @cabralens | @ICIJorg icij.org
  2. 2. Almost 200 journalists Based in 65 countries “Our aim is to bring journalists from different countries together in teams - eliminating rivalry and promoting collaboration. Together, we aim to be the world’s best cross-border investigative team.” icij.org/about
  3. 3. You may remember us from...
  4. 4. +370 journalists +100 media organizations 76 countries
  5. 5. Nearly one in 10 of the 31,000 tax haven companies that own British property are linked to Mossack Fonseca
  6. 6. #panamapapers
  7. 7. INSIDE THE 2.6 TB
  8. 8. Redis queue 35 x g2.xlarge Amazon instances with Ubuntu + Tesseract + Extract
  9. 9. Lucene syntax queries with proximity matching! 400 users
  10. 10. INSIDE THE 2.6 TB
  11. 11. offshoreleaks.icij.org
  12. 12. MAGIC!! ● 950,000 nodes, 1.2 million edges (4GB) Small, I know!! ● Find shortest path Wow! ● Fuzzy searching ● Public widgets ● API
  13. 13. May 9, 7pm London Save the date… ...and download!! ;)
  14. 14. HELP US OUT SOME?
  15. 15. THE END …or is it? Mar Cabra mcabra@icij.org | @cabralens icij.org/support bit.ly/icijgraphconnect2016

×