Graph Databases, The Web of Data Storage Engines
Upcoming SlideShare
Loading in...5
×
 

Graph Databases, The Web of Data Storage Engines

on

  • 6,616 views

 

Statistics

Views

Total Views
6,616
Views on SlideShare
6,400
Embed Views
216

Actions

Likes
1
Downloads
141
Comments
0

8 Embeds 216

http://www.graph-database.org 205
http://static.slidesharecdn.com 3
http://pinterest.com 2
http://blog.newitfarmer.com 2
http://graph-databases.org 1
http://www.twylah.com 1
http://translate.googleusercontent.com 1
https://www.linkedin.com 1
More...

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Graph Databases, The Web of Data Storage Engines Graph Databases, The Web of Data Storage Engines Presentation Transcript

  • Graph databases, the Web of Data storage engines Pere Urbón Bayes Senior Software Engineer Independent purbon@purbon.com purbon.com in/purbon February of 2010 @purbon
  • Graph databases, the Web of Data storage engines● We are going to talk about – Graph databases, facts and definitions. – Graph database vendors. – Use cases and applications, graph theory. The web of data storage engines - DataDevRoom - Fosdem 2011 2
  • Graph databases, the Web of Data storage engines“A graph database is a database that uses graph structures with nodes, edges, and properties to represent and store information. General graph databases that can store any graph are distinct from specialized graph databases such as triple stores and network databases.” Wikipedia The web of data storage engines - DataDevRoom - Fosdem 2011 3
  • Graph Database Property graph● Abstractions – Nodes – Relationships – Properties on both. John smith liked http://www.example.com at 01/10/11 The web of data storage engines - DataDevRoom - Fosdem 2011 4
  • Graph databases FactsConnectivity Everything connected RDF Ontologies Linked Data Tagging Blogs Folksonomies Social Networks Text files 1990s 2010s 2020s Decades The web of data storage engines - DataDevRoom - Fosdem 2011 5
  • Graph databases FactsSize of 1990s 2010s 2020s Decades http://www.guardian.co.uk/business/2009/may/18/digital-content-expansion The web of data storage engines - DataDevRoom - Fosdem 2011 6
  • Graph databases FactsPerformance Lists Graph like structures Semantic web Semantic reasoning Linked data Performance slowdown Unstructured The web of data storage engines - DataDevRoom - Fosdem 2011 7
  • Graph databases Performance comparison Query RDBMS OIM GraphDB Q1: count 20.38 17.35 0 Q2: projection 17.34 43.7 33.19 Q3: scan 32.76 174.64 3.14 Q4: values 12.28 20.77 0.01 Q5: select 7.34 5.43 0.84 Q6: hubs >3hours >3hours 624.68 RDBMS OIM GraphDB data 27.36 GB 54 GB 9.69 GBoverhead 10.9 21.51 3.86 load 52891 s 17543 s 95579 s The web of data storage engines - DataDevRoom - Fosdem 2011 8
  • Graph databases Vendors● Neo4J (neo4j.org)● Embedded, disk-based, fully transactional Java persistence engine that stores data structured in graphs rather than in tables.● Dual-Licensed AGPL and Commercial.● High Availability, scalability, concurrent,etc. The web of data storage engines - DataDevRoom - Fosdem 2011 9
  • Graph databases Vendors● InfiniteGraph● A java distributed, scalable, with high performance results commercial graph database, provided with the experience of Objectivity Inc.● More info: http://www.infinitegraph.com/ The web of data storage engines - DataDevRoom - Fosdem 2011 10
  • Graph databases Vendors● OrientDB● An embedded pure java fast, transactional, scalable document-graph storage engine.● Schema free, ACID, suport for SQL and JSON.● Apache License 2.0● More info: http://www.orientechnologies.com/ The web of data storage engines - DataDevRoom - Fosdem 2011 11
  • Graph databases More Vendors● Dex: The high performance graph database.● HyperGraphDB: An IA and semantic web graph database.● Infogrid: The Internet graph database.● Sones: SaaS dot Net graph database.● AllegroGraph: The semantic graph database.● VertexDB: High performance database server. The web of data storage engines - DataDevRoom - Fosdem 2011 12
  • Graph Theory analytics● Clustering ● Task planning (Communities) ● Scheduling● Social connexions ● Process assignation● Hubs ● Routing● Graph Mining ● Logistics● Centrality measures ● League planning The web of data storage engines - DataDevRoom - Fosdem 2011 13
  • Graph Theory Applications● Pattern Recognition● Dependency analysis● Impact analysis● Network flow – Traffic analysis and optimization – Delivery optimization● Optimization of tasks The web of data storage engines - DataDevRoom - Fosdem 2011 14
  • Graph Like Applications● Recommendations – Heuristics (PageRank) – Local ● Shortest Paths ● Hammock Functions ● Walks ● Search algorithms ● Shooting stars ● K-nearest neighbours The web of data storage engines - DataDevRoom - Fosdem 2011 15
  • Graph Like Applications● Location based services● Hubs● Spatial databases● Logical (multi-)index construction The web of data storage engines - DataDevRoom - Fosdem 2011 16
  • Web Trending Topics● Semantic web – RDF (OWL) Store – RDF-Sail – SPARQL● Linked data (Open Data)● Link analysis● Structure mining The web of data storage engines - DataDevRoom - Fosdem 2011 17
  • Graph databases Performance HPC Scalable Graph Analysis Benchmark IWGD 2010Kernel DEX Neo4j Jena HyperGraphDBScale 15Load(s) 7,44 697 141 +24hScan (s) 0,0010 2,71 0,6892-Hops(s) 0,0120 0,0260 0,443BC (s) 14,8 8,24 138Size (MB) 30 17 207 Kernel DEX Neo4j Jena HyperGraph Scale 20 DB Load(s) 317 32.094 4.560 +24h Scan (s) 0,005 751 18,6 2-Hops(s) 0,033 0,0230 0,4580 BC (s) 617 7027 59512 Size (MB) 893 539 6656 The web of data storage engines - DataDevRoom - Fosdem 2011 18
  • Graph databases XI FOSDEM DinnerInterested in Graph Databases and NoSQL, attending this year FOSDEM. Meeting point: 20:00 PM In front of Le Roy dEspagne Grand Place 1 Brussels The web of data storage engines - DataDevRoom - Fosdem 2011 19
  • Graph databases Moviepilot is hiringInterested in movies, data analytics, ruby, git, opensource. Join us!.Moviepilot is a leading provider and discovery services for movies and TV series, based in Berlin. Interested, talk with @jannis or @purbon The web of data storage engines - DataDevRoom - Fosdem 2011 20
  • Graph databases, the Web of Data storage engines Questions? Pere Urbón Bayes Senior Software Engineer Independent purbon@purbon.com February of 2010 The web of data storage engines - DataDevRoom - Fosdem 2011 21