Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Visualising Research Graph using Neo4j and Gephi

261 views

Published on

Presentation of Research Graph Gephi Toolkit in Open Repository 2017

Published in: Internet
  • Be the first to comment

  • Be the first to like this

Visualising Research Graph using Neo4j and Gephi

  1. 1. Visualising Research Graph using Neo4j and Gephi Amir Aryani - orcid.org/0000-0002-4259-9774 Hao Zhang - orcid.org/0000-0002-1382-0505 Australian National University (ANU) This presentation on "Creating a Distributed Graph using RD-Switchboard" by Dr. Amir Aryani is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License (http://creativecommons.org/licenses/by-sa/4.0/).
  2. 2. Agenda • What do we try to achive (abstract view) and why? • Background: What is Research Graph? • Live demo of Neo4j • Data pipeline • Live demo of Gephi • Gephi filters
  3. 3. What & Why? What do we try to achive and why?
  4. 4. • Connect DSpace records to • ORCID profile records • Grants • Visualise the graph in Gephi
  5. 5. Background Challenge of cross-platform discovery
  6. 6. {All started here!} Research Data Australia Suggested Links March 2014, Version 12
  7. 7. Data Description Registry Interoperability (DDRI) Working Group Research Data Alliance Goal: enabling cross-platform discovery between research data infrastructures
  8. 8. DDRI WG Approach Connecting datasets on the basis of co-authorship or other collaboration models such as joint funding and grants. Research Data Alliance
  9. 9. https://researchdata.ands.org.au/idmm-immunome- database-for-marsupials-and-monotremes/11139
  10. 10. Show 105 more publications
  11. 11. http://dx.doi.org/10.1371/journal.pone.0079092 One of the 105 articles …
  12. 12. doi:10.5061/dryad.4qq0v Authors: Wong ESW, Nichol S, Warren WC, Belov K Dryad Dataset http://datadryad.org/resource/doi:10.5061/dryad.4qq0v
  13. 13. We have found another dataset from the same author… Dataset Researcher Publication Dataset
  14. 14. Toward an
 Automated Solution
  15. 15. Research Data Switchboard Toward automating the process
  16. 16. Using machines…
  17. 17. http://researchgraph.org/schema/
  18. 18. Connecting Datasets by Three Degrees of Separation
  19. 19. Live demo Neo4j
  20. 20. Exploring the graph…
  21. 21. Find links to ORCID match (n:dataset:dryad)- -(o:orcid) return count(n) match (n:dataset:ands)- -(o:orcid) where n.ands_group='The University of Sydney' return n limit 10
  22. 22. Find Highly Connected Datasets match (n:ands:dataset)--(x) return n.key, n.title, count(x) order by count (x) DESC limit 25
  23. 23. Find Shortest Path Between Two Datasets MATCH p=shortestPath(
 (d1:dryad:dataset {doi: '10.5061/dryad.4qq0v'})-[*]- (d2:ands:dataset {doi:'10.1186/1471-2172-12-48'})
 ) RETURN p
  24. 24. Data Pipeline
  25. 25. 1. DSpace to Neo4j 1.1. Harvest 1.2. Import-XML 2. Synthesis 3. Neo4j to Gephi 3.1. Neo4j Shell 3.2. Gephi Filter program
  26. 26. Harvest https://github.com/rd-switchboard
  27. 27. Import-XML
  28. 28. Neo4j Result
  29. 29. Synthesis
  30. 30. Synthesis
  31. 31. Synthesis Configuration • Sync.Source: S3 Bucket • Sync.Target: S3 Bucket • Sync.Level • Sync.Keys
  32. 32. Neo4j to Gephi • Neo4j Shell • export-graphml -t -r -o out.graphml match (n)-[r]->() return n,r • Gephi Filter • java -jar graph-filter.1.0.jar gephi.conf
  33. 33. General track 10: Machine friendly repositories Time: Thursday, 29/Jun/2017: 9:00am - 10:30am Research Graph: Building a Distributed Graph of Scholarly Works using Research Data Switchboard
  34. 34. Further information • Collaborators: http://researchgraph.org/collaborators/ • Schema: http://researchgraph.org/schema/ • GitHub: https://github.com/researchgraph • Facebook: facebook.com/researchgraph To get involved in this project please contact amir.aryani@ands.org.au

×