NoSQL War Stories preso: Hadoop and Neo4j for networks
Upcoming SlideShare
Loading in...5
×

Like this? Share it with your network

Share
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
2,286
On Slideshare
2,284
From Embeds
2
Number of Embeds
2

Actions

Shares
Downloads
17
Comments
0
Likes
0

Embeds 2

https://si0.twimg.com 1
https://twitter.com 1

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. TABLE_DUMP2|1332345590|B|195.66.224.97|1299|1.11.64.0/21| 1299 6461 9318 38091|EGP|195.66.224.97|0|0||NAG|| AS1299 AS9318 AS6461 AS38091
  • 2. http://www.cascading.org/
  • 3. [head] GlobHfs[/Users/friso/Downloads/bview/alltxt.txt] [{14}:proto, time, type, peerip, peeras, prefix, path, origin, nexthop, localpref, MED, community, AAGG, aggregator] [{14}:proto, [{14}:proto, time, type, peerip, peeras, prefix, path, origin, nexthop, localpref, MED, community, AAGG, aggregator] [{14}:proto,Each(nodes)[PathToNodes[decl:id, name]] [{2}:id, name] Each(edges)[PathToEdges[decl:from, to, updatecount]] [{2}:id, name] [{3}:from, to, updatecount] Each(nodes)[FilterPartialDuplicates[decl:id, name]] [{3}:from, to, updatecount] [{2}:id, name] GroupBy(edges)[by:[from, to]] [{2}:id, name] edges[{2}:from, to] GroupBy(nodes)[by:[id]] [{3}:from, to, updatecount] nodes[{1}:id] Every(edges)[Sum[decl:updatecount][args:1]] [{2}:id, name] [{3}:from, to, updatecount] Every(nodes)[First[decl:id, name]] [{3}:from, to, updatecount] [{2}:id, name] Hfs[TextDelimited[[from, to, updatecount]]][/tmp/edges]] [{2}:id, name] [{3}:from, to, updatecount] Hfs[TextDelimited[[id, name]]][/tmp/nodes]] [{3}:from, to, updatecount] [{2}:id, name] [{2}:id, name] [tail]
  • 4. http://gephi.org/https://gephi.org/plugins/openord-layout/
  • 5. http://bit.ly/IzWvcT and http://bit.ly/HHNNIb
  • 6. http://thejit.org/http://neo4j.org/
  • 7. org.neo4j.kernel.impl.batchinsert.BatchInserterorg.neo4j.graphdb.index.BatchInserterIndexProvider 30M nodes + 250M edges, < 20 minutes
  • 8. • No SQL was used throughout the entire codebase• (Even though it was tempting to use Hive at one point)• You can find code here: https://github.com/friso/graphs• You can find me on Twitter here: @fzk• You can find me on e-mail here: fvanvollenhoven@xebia.com