Graph ( Theory and Databases ) Pere Urbón Bayes Senior Software Engineer Independent purbon@purbon.com purbon.com in/purbon December of 2010 @purbon

Graph (Theory and Databases)● Graph Theory ● Graph Databases – Definitions – Definitions – Applications – Facts – Analytics – Performance – Vendors Graph ( Theory and Databases ) 2

Graph Definitions● Graph G(V,E) where V = {v1,v2,...,vN) and E = {E1,E2,...,EN) – Directed / Undirected – Mixed – Multigraph – Weighted – .... Graph ( Theory and Databases ) 3

Graph Definitions● Directed graphs● Vertex● Edges● From V(N) to V(M) Graph ( Theory and Databases ) 4

Graph Definitions Multigraph Labelling● More than one edge ● The process of between two nodes. assigning a label to a● Loops, edges vertex and edges. between the same node. Graph ( Theory and Databases ) 5

Graph Theory Applications● Task planning● Scheduling● Process assignation● Routing● Logistics● League planning Graph ( Theory and Databases ) 6

Graph Theory Applications● Pattern Recognition● Dependency analysis● Impact analysis● Network flow – Traffic analysis and optimization – Delivery optimization● Optimization of tasks Graph ( Theory and Databases ) 7

Graph Theory analytics● Clustering (Communities)● Social connexions● Hubs● Graph Mining● Centrality measures Graph ( Theory and Databases ) 8

Graph Like Applications● Recommendations – Heuristics (PageRank) – Local ● Shortest Paths ● Hammock Functions ● Walks ● Search algorithms ● Shooting stars ● K-nearest neighbours Graph ( Theory and Databases ) 9

Graph Like Applications● Location based services● Hubs● Spatial databases● Logical (multi-)index construction Graph ( Theory and Databases ) 10

Web Trending Topics● Semantic web – RDF (OWL) Store – RDF-Sail – SPARQL● Linked data (Open Data)● Link analysis● Structure mining Graph ( Theory and Databases ) 11

Graph databases“A graph database is a database that uses graph structures with nodes, edges, and properties to represent and store information. General graph databases that can store any graph are distinct from specialized graph databases such as triple stores and network databases.” Wikipedia Graph ( Theory and Databases ) 12

Graph databases Property graph● Abstractions – Nodes – Relationships – Properties on both. John smith liked http://www.example.com at 01/10/11 Graph ( Theory and Databases ) 13

Graph databases FactsConnectivity Everything connected RDF Ontologies Linked Data Tagging Blogs Folksonomies Social Networks Text files 1990s 2010s 2020s Decades Graph ( Theory and Databases ) 14

Graph databases FactsSize of 1990s 2010s 2020s Decades Graph ( Theory and Databases ) 15 http://www.guardian.co.uk/business/2009/may/18/digital-content-expansion

Graph databases FactsPerformance Lists Graph like structures Semantic web Semantic reasoning Linked data Performance slowdown Unstructured Graph ( Theory and Databases ) 16

Graph databases PerformanceKernel DEX Neo4j Jena HyperGraphDBScale 15Load(s) 7,44 697 141 +24hScan (s) 0,0010 2,71 0,6892-Hops(s) 0,0120 0,0260 0,443BC (s) 14,8 8,24 138Size (MB) 30 17 207 Kernel DEX Neo4j Jena HyperGraph Scale 20 DB Load(s) 317 32.094 4.560 +24h Scan (s) 0,005 751 18,6 2-Hops(s) 0,033 0,0230 0,4580 BC (s) 617 7027 59512 Size (MB) 893 539 6656 Graph ( Theory and Databases ) 17HPC Scalable Graph Analysis Benchmark IWGD 2010

Graph databases Vendors● Neo4J: Open source database NoSQL graph.● Dex: The high performance graph database.● HyperGraphDB: An IA and semantic web graph database.● Infogrid: The Internet Graph database.● Sones: SaaS dot Net Graph database.● VertexDB: High performance database server. Graph ( Theory and Databases ) 18

Graph ( Theory and Databases ) Thanks! purbon@purbon.com December of 2010 Graph ( Theory and Databases ) 19

Full NameComment goes here.Pere Urbón-Bayes, Software Engineer at Universitat Politècnica de Catalunya Sorry, It's a typo, thanks for the notice, -_-!. Orient DB is current must see graph database, I haven't included OrientDB because I recycled an old slide. I'll update it as soon as possible.Luca Garulli, CEO at Orient Technologies LTD Hi Pere,it's a shame you haven't included OrientDB in slides.

Lvc@

Pere Urbón-Bayes, Software Engineer at Universitat Politècnica de Catalunya Yes I got an email from slideshare about that my slides was hot on twitter, xD!! Thanks for the notice.Patrick Durusau, Independent Consultant/Standards Editor at Home Office Great slide deck! One small quibble.Slide 14 has the obligatory progress of connectivity slide.

Actually everything is already connected. Each of the labeled boxes represent efforts that resulted in partial representations of that connectedness.

To put it differently, the world is a lot larger and more nuanced than any of the representation strategies listed on this slide.

Jonathan Boutelle, Director of Technology at SlideShare awesome ... found this cause it was hot on twitter.BTW did you get an email about that?

Andrey Sinitsyn, Software architect at ITRuStore 1 year agoTakahiro Komamizu, PhD candidate at University of Tsukuba 2 years agoPham HungTagsgraphs3 years agoStephane Fellah, Chief Knowledge Scientist at Image Matters LLC Tagsgraphdb graph performance3 years agoDaisuke TanakaTagsgraphtheorygraphdb3 years ago