Neo Technology, Inc Confidential
GraphConnect 2013
graphs are everywhere
Emil Eifrem
@emileifrem
#graphconnect
Neo Technology, Inc Confidential
Neo Technology, Inc Confidential
“Five richest big data sources on theWeb
include social graph, intent graph,
consumption graph, interest graph and
mobile graph.”
http://www.forbes.com/sites/ericsavitz/2012/10/23/gartner-top-10-strategic-technology-trends-for-2013/
- Gartner:“Top 10 Strategic Trends For 2013,” Oct 2012
Neo Technology, Inc Confidential
“[I]t is arguable that graph databases will have a
bigger impact on the database landscape than
Hadoop or its competitors.”
- Bloor Research, May 2012
http://www.bloorresearch.com/blog/IM-Blog/2012/5/graph-databases-nosql.html
Neo Technology, Inc Confidential
“Graph analysis is the true killer app for Big Data.”
- Forrester Research, Dec 2011
http://blogs.forrester.com/james_kobielus/11-12-19-the_year_ahead_in_big_data_big_cool_new_stuff_looms_large
Neo Technology, Inc Confidential
http://gigaom.com/2013/05/14/were-witnessing-the-rise-of-the-graph-in-big-data/
GigaOm, May 2013
Neo Technology, Inc Confidential
FastCompany, March 2013
http://www.fastcompany.com/magazine/174/exposing-yahoos-strategy-marissa-mayer
Neo Technology, Inc Confidential
FastCompany, March 2013
http://www.fastcompany.com/magazine/174/exposing-yahoos-strategy-marissa-mayer
Neo Technology, Inc Confidential
Ian Robinson,
Jim Webber & Emil Eifrem
Graph
Databases
h
Com
plim
ents
ofNeo
Technology
Neo Technology, Inc Confidential
?
Neo Technology, Inc Confidential
Neo Technology, Inc Confidential
Neo Technology, Inc Confidential
Neo Technology, Inc Confidential
NASDAQ:ORCL
Market Cap Today: ~$150B
Neo Technology, Inc Confidential
More recent examples?
Neo Technology, Inc Confidential
More recent examples?
Neo Technology, Inc Confidential
More recent examples?
Neo Technology, Inc Confidential
More recent examples?
Neo Technology, Inc Confidential
Social Graph
More recent examples?
Neo Technology, Inc Confidential
Social Graph
Link Graph
Knowledge Graph
More recent examples?
Neo Technology, Inc Confidential
Social Graph
Interest Graph
Link Graph
Knowledge Graph
More recent examples?
Neo Technology, Inc Confidential
Neo Technology, Inc Confidential
Connected Data.
Neo Technology, Inc Confidential
Neo Technology, Inc Confidential
Wow.
But it looks like graph = social.
Right?
Neo Technology, Inc Confidential
Core Industries
& Use Cases:
Software
Financial
Services
Telecomm-
unications
Network & Data
Center Management
MDM
Social
Geo
Early Adopter Segments
Neo Technology, Inc Confidential
Neo4j Adoption Snapshot
Core Industries
& Use Cases:
Software
Financial
Services
Telecomm-
unications
Network & Data
Center Management
MDM
Social
Geo
Select Commercial Customers (Community Users Not Included)
Neo Technology, Inc Confidential
Core Industries
& Use Cases:
Web / ISV
Finance &
Insurance
Telecomm-
unications
Network & Data
Center Management
MDM
Social
Geo
Neo4j Adoption Snapshot
Select Commercial Customers (Community Users Not Included)
Neo Technology, Inc Confidential
Neo4j Adoption Snapshot
Select Commercial Customers (Community Users Not Included)
Core Industries
& Use Cases:
Software
Financial
Services
Telecomm
unications
Web Social, HR
& Recruiting
Health Care &
Life Sciences
Media &
Publishing
Energy, Services,
Automotive, Gov’t,
Logistics, Education,
Gaming, Other
Network & Data
Center
Management
MDM / System of
Record
Social
Geo
Identity &
Access Mgmt
Content
Management
Recommend-
ations
BI, CRM, Impact
Analysis, Fraud
Detection, Resource
Optimization, etc.
Accenture
Neo Technology, Inc Confidential
So what’s this product
they’re using?
Neo Technology, Inc Confidential
LIVES WITH
LOVES
OWNS
DRIVES
LOVES
name:“James”
age: 32
twitter:“@spam”
name:“Mary”
age: 35
brand:“Volvo”
model:“V70”
property type:“car”
Graph data model
Neo Technology, Inc Confidential
Image credits:Tobias Ivarsson
“Whiteboard friendliness”
Neo Technology, Inc Confidential
thobe
Wardrobe Strength
Joe project blog
Hello Joe
Neo4j performance analysis
Modularizing Jython
Image credits:Tobias Ivarsson
“Whiteboard friendliness”
Neo Technology, Inc Confidential
Database # persons query time
MySQL
Neo4j
Neo4j
๏a sample social graph
•with ~1,000 persons
๏average 50 friends per person
๏pathExists(a,b) limited to depth 4
๏caches warmed up to eliminate disk I/O
Graph db performance
Neo Technology, Inc Confidential
Database # persons query time
MySQL
Neo4j
Neo4j
1,000
๏a sample social graph
•with ~1,000 persons
๏average 50 friends per person
๏pathExists(a,b) limited to depth 4
๏caches warmed up to eliminate disk I/O
Graph db performance
Neo Technology, Inc Confidential
Database # persons query time
MySQL
Neo4j
Neo4j
1,000 2,000 ms
๏a sample social graph
•with ~1,000 persons
๏average 50 friends per person
๏pathExists(a,b) limited to depth 4
๏caches warmed up to eliminate disk I/O
Graph db performance
Neo Technology, Inc Confidential
Database # persons query time
MySQL
Neo4j
Neo4j
1,000 2,000 ms
1,000 2 ms
๏a sample social graph
•with ~1,000 persons
๏average 50 friends per person
๏pathExists(a,b) limited to depth 4
๏caches warmed up to eliminate disk I/O
Graph db performance
Neo Technology, Inc Confidential
Database # persons query time
MySQL
Neo4j
Neo4j
1,000 2,000 ms
1,000 2 ms
1,000,000
๏a sample social graph
•with ~1,000 persons
๏average 50 friends per person
๏pathExists(a,b) limited to depth 4
๏caches warmed up to eliminate disk I/O
Graph db performance
Neo Technology, Inc Confidential
Database # persons query time
MySQL
Neo4j
Neo4j
1,000 2,000 ms
1,000 2 ms
1,000,000 2 ms
๏a sample social graph
•with ~1,000 persons
๏average 50 friends per person
๏pathExists(a,b) limited to depth 4
๏caches warmed up to eliminate disk I/O
Graph db performance
Neo Technology, Inc Confidential
Drivers of Graph Adoption
“Why did you use a graph database for your application?”
Neo Technology, Inc Confidential
Drivers of Graph Adoption
Naturally Graphy Data
Complex Graph Queries
Query Performance
0% 20% 40% 60% 80%
Neo Technology, Inc Confidential
Great product.
What’s up next?
Neo Technology, Inc Confidential
Top 12 Month Product Themes
Neo Technology, Inc Confidential
Top 12 Month Product Themes
Ease of Use
Neo Technology, Inc Confidential
Top 12 Month Product Themes
Big DataEase of Use
Neo Technology, Inc Confidential
Top 12 Month Product Themes
Big Data CloudEase of Use
Neo Technology, Inc Confidential
2.0
Q2 Q3 Q4Q1
2013
Theme: Ease of Use
Neo4j 2.0
Neo Technology, Inc Confidential
2.0
• Labels. First expansion of the Property Graph model
since its inception. Nodes can have one or more labels.
Significantly improve power & ease of use.
Q2 Q3 Q4Q1
2013
Theme: Ease of Use
Neo4j 2.0
Neo Technology, Inc Confidential
2.0
• Labels. First expansion of the Property Graph model
since its inception. Nodes can have one or more labels.
Significantly improve power & ease of use.
• Index automation.
Improve indexing ease of use, leveraging new “Label” construct
Enable indexing operations through Cypher
Q2 Q3 Q4Q1
2013
Theme: Ease of Use
Neo4j 2.0
Neo Technology, Inc Confidential
2.0
• Labels. First expansion of the Property Graph model
since its inception. Nodes can have one or more labels.
Significantly improve power & ease of use.
• Index automation.
Improve indexing ease of use, leveraging new “Label” construct
Enable indexing operations through Cypher
• REST Improvements.
Improved Transactionality & Robustness
Q2 Q3 Q4Q1
2013
Theme: Ease of Use
Neo4j 2.0
Neo Technology, Inc Confidential
2.0
• Labels. First expansion of the Property Graph model
since its inception. Nodes can have one or more labels.
Significantly improve power & ease of use.
• Index automation.
Improve indexing ease of use, leveraging new “Label” construct
Enable indexing operations through Cypher
• REST Improvements.
Improved Transactionality & Robustness
• Cypher performance improvements.
Q2 Q3 Q4Q1
2013
Theme: Ease of Use
Neo4j 2.0
Neo Technology, Inc Confidential
Theme: Big Data
2.1
Q2 Q3 Q4Q12013
Neo4j 2.1
2014
Neo Technology, Inc Confidential
Theme: Big Data
2.1
• Increase upper size limits of single-machine graph
Q2 Q3 Q4Q12013
Neo4j 2.1
2014
Neo Technology, Inc Confidential
Theme: Big Data
2.1
• Increase upper size limits of single-machine graph
• Performance optimizations targeting densely-
connected nodes
Q2 Q3 Q4Q12013
Neo4j 2.1
2014
Neo Technology, Inc Confidential
Theme: Big Data
2.1
• Increase upper size limits of single-machine graph
• Performance optimizations targeting densely-
connected nodes
• Bulk data import improvements: easer & faster to
bring large amounts of data into Neo4j
Q2 Q3 Q4Q12013
Neo4j 2.1
2014
Neo Technology, Inc Confidential
Theme: Big Data
2.1
• Increase upper size limits of single-machine graph
• Performance optimizations targeting densely-
connected nodes
• Bulk data import improvements: easer & faster to
bring large amounts of data into Neo4j
• Cypher performance improvements
Q2 Q3 Q4Q12013
Neo4j 2.1
2014
Neo Technology, Inc Confidential
GraphConnect Boston
June 10-11, 2013 | Catalyst Restaurant
graphs are everywhere
Neo Technology, Inc Confidential
Your Mission:
Connect.
GraphConnect Boston
June 10-11, 2013 | Catalyst Restaurant
graphs are everywhere

New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

  • 1.
    Neo Technology, IncConfidential GraphConnect 2013 graphs are everywhere Emil Eifrem @emileifrem #graphconnect
  • 2.
    Neo Technology, IncConfidential
  • 3.
    Neo Technology, IncConfidential “Five richest big data sources on theWeb include social graph, intent graph, consumption graph, interest graph and mobile graph.” http://www.forbes.com/sites/ericsavitz/2012/10/23/gartner-top-10-strategic-technology-trends-for-2013/ - Gartner:“Top 10 Strategic Trends For 2013,” Oct 2012
  • 4.
    Neo Technology, IncConfidential “[I]t is arguable that graph databases will have a bigger impact on the database landscape than Hadoop or its competitors.” - Bloor Research, May 2012 http://www.bloorresearch.com/blog/IM-Blog/2012/5/graph-databases-nosql.html
  • 5.
    Neo Technology, IncConfidential “Graph analysis is the true killer app for Big Data.” - Forrester Research, Dec 2011 http://blogs.forrester.com/james_kobielus/11-12-19-the_year_ahead_in_big_data_big_cool_new_stuff_looms_large
  • 6.
    Neo Technology, IncConfidential http://gigaom.com/2013/05/14/were-witnessing-the-rise-of-the-graph-in-big-data/ GigaOm, May 2013
  • 7.
    Neo Technology, IncConfidential FastCompany, March 2013 http://www.fastcompany.com/magazine/174/exposing-yahoos-strategy-marissa-mayer
  • 8.
    Neo Technology, IncConfidential FastCompany, March 2013 http://www.fastcompany.com/magazine/174/exposing-yahoos-strategy-marissa-mayer
  • 9.
    Neo Technology, IncConfidential Ian Robinson, Jim Webber & Emil Eifrem Graph Databases h Com plim ents ofNeo Technology
  • 10.
    Neo Technology, IncConfidential ?
  • 11.
    Neo Technology, IncConfidential
  • 12.
    Neo Technology, IncConfidential
  • 13.
    Neo Technology, IncConfidential
  • 14.
    Neo Technology, IncConfidential NASDAQ:ORCL Market Cap Today: ~$150B
  • 15.
    Neo Technology, IncConfidential More recent examples?
  • 16.
    Neo Technology, IncConfidential More recent examples?
  • 17.
    Neo Technology, IncConfidential More recent examples?
  • 18.
    Neo Technology, IncConfidential More recent examples?
  • 19.
    Neo Technology, IncConfidential Social Graph More recent examples?
  • 20.
    Neo Technology, IncConfidential Social Graph Link Graph Knowledge Graph More recent examples?
  • 21.
    Neo Technology, IncConfidential Social Graph Interest Graph Link Graph Knowledge Graph More recent examples?
  • 22.
    Neo Technology, IncConfidential
  • 23.
    Neo Technology, IncConfidential Connected Data.
  • 24.
    Neo Technology, IncConfidential
  • 25.
    Neo Technology, IncConfidential Wow. But it looks like graph = social. Right?
  • 26.
    Neo Technology, IncConfidential Core Industries & Use Cases: Software Financial Services Telecomm- unications Network & Data Center Management MDM Social Geo Early Adopter Segments
  • 27.
    Neo Technology, IncConfidential Neo4j Adoption Snapshot Core Industries & Use Cases: Software Financial Services Telecomm- unications Network & Data Center Management MDM Social Geo Select Commercial Customers (Community Users Not Included)
  • 28.
    Neo Technology, IncConfidential Core Industries & Use Cases: Web / ISV Finance & Insurance Telecomm- unications Network & Data Center Management MDM Social Geo Neo4j Adoption Snapshot Select Commercial Customers (Community Users Not Included)
  • 29.
    Neo Technology, IncConfidential Neo4j Adoption Snapshot Select Commercial Customers (Community Users Not Included) Core Industries & Use Cases: Software Financial Services Telecomm unications Web Social, HR & Recruiting Health Care & Life Sciences Media & Publishing Energy, Services, Automotive, Gov’t, Logistics, Education, Gaming, Other Network & Data Center Management MDM / System of Record Social Geo Identity & Access Mgmt Content Management Recommend- ations BI, CRM, Impact Analysis, Fraud Detection, Resource Optimization, etc. Accenture
  • 30.
    Neo Technology, IncConfidential So what’s this product they’re using?
  • 31.
    Neo Technology, IncConfidential LIVES WITH LOVES OWNS DRIVES LOVES name:“James” age: 32 twitter:“@spam” name:“Mary” age: 35 brand:“Volvo” model:“V70” property type:“car” Graph data model
  • 32.
    Neo Technology, IncConfidential Image credits:Tobias Ivarsson “Whiteboard friendliness”
  • 33.
    Neo Technology, IncConfidential thobe Wardrobe Strength Joe project blog Hello Joe Neo4j performance analysis Modularizing Jython Image credits:Tobias Ivarsson “Whiteboard friendliness”
  • 34.
    Neo Technology, IncConfidential Database # persons query time MySQL Neo4j Neo4j ๏a sample social graph •with ~1,000 persons ๏average 50 friends per person ๏pathExists(a,b) limited to depth 4 ๏caches warmed up to eliminate disk I/O Graph db performance
  • 35.
    Neo Technology, IncConfidential Database # persons query time MySQL Neo4j Neo4j 1,000 ๏a sample social graph •with ~1,000 persons ๏average 50 friends per person ๏pathExists(a,b) limited to depth 4 ๏caches warmed up to eliminate disk I/O Graph db performance
  • 36.
    Neo Technology, IncConfidential Database # persons query time MySQL Neo4j Neo4j 1,000 2,000 ms ๏a sample social graph •with ~1,000 persons ๏average 50 friends per person ๏pathExists(a,b) limited to depth 4 ๏caches warmed up to eliminate disk I/O Graph db performance
  • 37.
    Neo Technology, IncConfidential Database # persons query time MySQL Neo4j Neo4j 1,000 2,000 ms 1,000 2 ms ๏a sample social graph •with ~1,000 persons ๏average 50 friends per person ๏pathExists(a,b) limited to depth 4 ๏caches warmed up to eliminate disk I/O Graph db performance
  • 38.
    Neo Technology, IncConfidential Database # persons query time MySQL Neo4j Neo4j 1,000 2,000 ms 1,000 2 ms 1,000,000 ๏a sample social graph •with ~1,000 persons ๏average 50 friends per person ๏pathExists(a,b) limited to depth 4 ๏caches warmed up to eliminate disk I/O Graph db performance
  • 39.
    Neo Technology, IncConfidential Database # persons query time MySQL Neo4j Neo4j 1,000 2,000 ms 1,000 2 ms 1,000,000 2 ms ๏a sample social graph •with ~1,000 persons ๏average 50 friends per person ๏pathExists(a,b) limited to depth 4 ๏caches warmed up to eliminate disk I/O Graph db performance
  • 40.
    Neo Technology, IncConfidential Drivers of Graph Adoption “Why did you use a graph database for your application?”
  • 41.
    Neo Technology, IncConfidential Drivers of Graph Adoption Naturally Graphy Data Complex Graph Queries Query Performance 0% 20% 40% 60% 80%
  • 42.
    Neo Technology, IncConfidential Great product. What’s up next?
  • 43.
    Neo Technology, IncConfidential Top 12 Month Product Themes
  • 44.
    Neo Technology, IncConfidential Top 12 Month Product Themes Ease of Use
  • 45.
    Neo Technology, IncConfidential Top 12 Month Product Themes Big DataEase of Use
  • 46.
    Neo Technology, IncConfidential Top 12 Month Product Themes Big Data CloudEase of Use
  • 47.
    Neo Technology, IncConfidential 2.0 Q2 Q3 Q4Q1 2013 Theme: Ease of Use Neo4j 2.0
  • 48.
    Neo Technology, IncConfidential 2.0 • Labels. First expansion of the Property Graph model since its inception. Nodes can have one or more labels. Significantly improve power & ease of use. Q2 Q3 Q4Q1 2013 Theme: Ease of Use Neo4j 2.0
  • 49.
    Neo Technology, IncConfidential 2.0 • Labels. First expansion of the Property Graph model since its inception. Nodes can have one or more labels. Significantly improve power & ease of use. • Index automation. Improve indexing ease of use, leveraging new “Label” construct Enable indexing operations through Cypher Q2 Q3 Q4Q1 2013 Theme: Ease of Use Neo4j 2.0
  • 50.
    Neo Technology, IncConfidential 2.0 • Labels. First expansion of the Property Graph model since its inception. Nodes can have one or more labels. Significantly improve power & ease of use. • Index automation. Improve indexing ease of use, leveraging new “Label” construct Enable indexing operations through Cypher • REST Improvements. Improved Transactionality & Robustness Q2 Q3 Q4Q1 2013 Theme: Ease of Use Neo4j 2.0
  • 51.
    Neo Technology, IncConfidential 2.0 • Labels. First expansion of the Property Graph model since its inception. Nodes can have one or more labels. Significantly improve power & ease of use. • Index automation. Improve indexing ease of use, leveraging new “Label” construct Enable indexing operations through Cypher • REST Improvements. Improved Transactionality & Robustness • Cypher performance improvements. Q2 Q3 Q4Q1 2013 Theme: Ease of Use Neo4j 2.0
  • 52.
    Neo Technology, IncConfidential Theme: Big Data 2.1 Q2 Q3 Q4Q12013 Neo4j 2.1 2014
  • 53.
    Neo Technology, IncConfidential Theme: Big Data 2.1 • Increase upper size limits of single-machine graph Q2 Q3 Q4Q12013 Neo4j 2.1 2014
  • 54.
    Neo Technology, IncConfidential Theme: Big Data 2.1 • Increase upper size limits of single-machine graph • Performance optimizations targeting densely- connected nodes Q2 Q3 Q4Q12013 Neo4j 2.1 2014
  • 55.
    Neo Technology, IncConfidential Theme: Big Data 2.1 • Increase upper size limits of single-machine graph • Performance optimizations targeting densely- connected nodes • Bulk data import improvements: easer & faster to bring large amounts of data into Neo4j Q2 Q3 Q4Q12013 Neo4j 2.1 2014
  • 56.
    Neo Technology, IncConfidential Theme: Big Data 2.1 • Increase upper size limits of single-machine graph • Performance optimizations targeting densely- connected nodes • Bulk data import improvements: easer & faster to bring large amounts of data into Neo4j • Cypher performance improvements Q2 Q3 Q4Q12013 Neo4j 2.1 2014
  • 57.
    Neo Technology, IncConfidential GraphConnect Boston June 10-11, 2013 | Catalyst Restaurant graphs are everywhere
  • 58.
    Neo Technology, IncConfidential Your Mission: Connect. GraphConnect Boston June 10-11, 2013 | Catalyst Restaurant graphs are everywhere