OpenStack Community Network Mining with Neo4j

996 views
890 views

Published on

vBrownBag talk at OpenStack Summit Atlanta 2014
Presentation's live recording here: http://www.youtube.com/watch?v=yS6E2N8_A0Y

Published in: Technology
1 Comment
2 Likes
Statistics
Notes
No Downloads
Views
Total views
996
On SlideShare
0
From Embeds
0
Number of Embeds
5
Actions
Shares
0
Downloads
0
Comments
1
Likes
2
Embeds 0
No embeds

No notes for slide

OpenStack Community Network Mining with Neo4j

  1. 1. Mining OpenStack Community Network with Neo4j OpenStack Summit Atlanta vBrownBag | May 13, 2014 Kamesh Raghavendra kamesh@netapp.com
  2. 2. 2 What? Property graph representation of the global OpenStack community network including: • People • Developers, Customers, Service Operators • Interactions • Mailing lists, Blueprints, Code check-ins • Contexts • Location, Parent Organization, Project Opened to the community in the form of canned and adhoc graph queries
  3. 3. 3 Who? Joint initiative between: Where? http://www.ccbd-pes.in/openstack-tool.html http://www.ccbd-pes.in/openstack-graph-query.html When? Project active since March 2014, v1.0 by Paris summit Collaborators welcome!
  4. 4. 4 Motivations • Product strategy & management • Discover patterns of OpenStack consumption & deployment • Demographic trends across organizations, industry verticals & geography • Segment consumers by demography • Analyze multi-faceted roles • Community members playing roles of consumers, developers & service operators – more often multiple at the same time
  5. 5. 5 OpenStack Data Sources Integrated • Mail Archives [58,702: http://openstack.markmail.org/] • Support Form [10,344: https://ask.openstack.org/en/questions/] • Bug Tracker [6,520: https://bugs.launchpad.net/openstack] • Blueprints [6,311: https://blueprints.launchpad.net/openstack] • More sources being integrated
  6. 6. 6 OpenStack Network Graph Data Model Hosted on Neo4j 2.0.3 Community Edition Server Demographic Context • Parent organization • Country • Industry Vertical Interaction Context • Project • Sentiment Person
  7. 7. 7 OpenStack Network Graph Query Demo https://www.youtube.com/watch?v=8N-D1IRcUo4
  8. 8. 8 Sample Graph Queries Which are most popular OpenStack projects in Japan? MATCH (C:COUNTRY)--()--()--()--(Q)--(N:Project) WHERE C.NAME=’Japan' WITH C,COUNT(Q) AS Count,N ORDER BY Count DESC RETURN N.TAGNAME,Count Which are the most popular industries in UK adopting OpenStack? MATCH (I:INDUSTRY)--(O:ORGANIZATION)--(C:COUNTRY) WHERE C.NAME='Uk' WITH COUNT(I) AS S,I ORDER BY S DESC RETURN I.NAME,S LIMIT 5
  9. 9. 9 More Sample Graph Queries Who are the top 5 weekend contributors? MATCH (O:ORGANIZATION)--(D:DOMAIN)--(P:PERSON)--(Q) WHERE Q.TIMESTAMP=~".*Sat.*" OR Q.TIMESTAMP=~".*Sun.*" WITH COUNT(Q) AS N, P, O ORDER BY N DESC RETURN P.FULL_NAME AS Name, O.NAME AS Organization, N AS Weekend_Contributions LIMIT 5
  10. 10. 10 Road Ahead • Enhance & automate data ETL • Integrate more data sources • Extract more contexts – sentiment, expertise, role • Enhance query user experience – schema, syntax • Bring popular queries as canned reports Seeking early users & collaborators to accelerate development
  11. 11. Thank You

×