SlideShare a Scribd company logo
1 of 28
Download to read offline
Property graphs with time
Julia Stoyanovich, joint work with Vera Moffitt
Drexel University
Philadelphia, PA USA
stoyanovich.org
openCypher MeetupOctober 25, 2017
openCypher MeetupOctober 25, 2017 2
2008 20092007
20112010
openCypher MeetupOctober 25, 2017 3
https://www.kenedict.com/apples-internal-innovation-network-unraveled-part-1-evolving-networks/
openCypher MeetupOctober 25, 2017 4
https://arxiv.org/abs/1709.06176
openCypher MeetupOctober 25, 2017
Exploratory analysis of evolving graphs
• Which nodes are showing an increasing popularity trend?
• Have any changes in network connectivity been
observed?
• At what time scale can interesting trends be observed?
• How can multiple data sources be used jointly to
complement or corroborate information about network
evolution?
5
openCypher MeetupOctober 25, 2017
Goal
6
Principled and systematics support for usable,
scalable and extensible analysis of evolving graphs
openCypher MeetupOctober 25, 2017
Are Alice and Bill connected?
7
TNGP
… by a path?
openCypher MeetupOctober 25, 2017
Snapshot reducibility
8
openCypher MeetupOctober 25, 2017
Are Alice and Bill connected?
extended snapshot reducibility
9
… by a journey?
… by a path that persists over >2 time instants
openCypher MeetupOctober 25, 2017
TGraph: an evolving property graph
10
openCypher MeetupOctober 25, 2017
TGA: Temporal Graph Algebra
• Temporal variants of standard graph operators + novel time-
specific operators
• Compositional: TGraph (or a pair of TGraphs) as input -
TGraph as output
• Operations maintain model integrity
- graph integrity at each time instant: no dangling edges, a
node/edge appears at most once
- temporal integrity: semantics of temporal operations are
automatically enforced (formally: point semantics)
11
openCypher MeetupOctober 25, 2017
TGA operations
• trim
• temporal versions of
- vertex-map, edge-map
- subgraph, path
- aggregate messages
- union, intersection, difference - binary
• snapshot analytics
- PageRank, connected components,… - Pregel
12
openCypher MeetupOctober 25, 2017
TGA operations
• node creation
• based on temporal window: temporal zoom
• attribute-based: structural zoom
• edge creation
13
openCypher MeetupOctober 25, 2017
Structural zoom
14
add university nodes Drexel and CMU,
and edges between students and these universities
openCypher MeetupOctober 25, 2017
Structural zoom
15
openCypher MeetupOctober 25, 2017
Temporal zoom
16
coarsen taxi trip start-times into 10-min intervals
openCypher MeetupOctober 25, 2017
System architecture
17
Portal
Interactive	Shell
Query	Parser
Spark	
Runtime
GraphX
Data	Structures
Worker
Spark	Runtime
HDFS
Worker
Spark	Runtime
HDFS
…
System	
Catalog
SparkSQL
Portal	Runtime	
(optimizer,	operators,	etc)
Spark 2.0, interoperable with SparkSQL and
with BigDatalog
openCypher MeetupOctober 25, 2017
Physical data representation
• On-disk: Apache Parquet
- vertex / edge files
- broken down into snapshot groups
- each file sorted on start time followed by node /edge id
• In-memory:
- nested relational (Vertex-Edge RDDs)
- GraphX-based: RepresentativeGraphs (RG), One
Graph (OG), HybridGraph (HG)
18
1 2 3
BitSet(p1,p2,p3,p4) BitSet(p2,p3,p4,p5)
BitSet(p5)
BitSet(p1,p2,p3,p4,p5)
BitSet(p2,p3)
openCypher MeetupOctober 25, 2017
Performance highlights
• 16-node Open Stack cluster
• Apache Spark 2.0
• 4 cores, 16GB / RAM per node
19
openCypher MeetupOctober 25, 2017
PageRank on wiki-talk
20
openCypher MeetupOctober 25, 2017
PageRank on nGrams
21
openCypher MeetupOctober 25, 2017
PageRank on Twitter
22
openCypher MeetupOctober 25, 2017
Aggregate messages on wiki-talk
23
openCypher MeetupOctober 25, 2017
Vertex-subgraph on wiki-talk
24
openCypher MeetupOctober 25, 2017
Portal vs. G*
25
average node degree, wiki-talk
openCypher MeetupOctober 25, 2017
Take-aways
• TGraph: a logical model of property graphs with time
• TGA: a compositional temporal graph algebra under
point semantics
• Portal: a library on top of Apache Spark, inter-
operable with SparkSQL
• Ongoing work on a declarative language, multi-
operator query optimization, benchmarking
• Planned open source release this Fall
26
openCypher MeetupOctober 25, 2017
References
• Temporal Graph Algebra, Moffitt & Stoyanovich, DBPL
2017.
• Zooming in on NYC taxi data with Portal, Stoyanovich,
Gilbride and Moffitt, DSSG 2017 (arXiv).
• Towards sequenced semantics for evolving graphs,
Moffitt & Stoyanovich, EDBT 2017.
• Towards a distributed infrastructure for evolving graph
analytics, Moffitt & Stoyanovich, TempWeb 2016.
• Vera Moffitt’s Ph.D. thesis.
27
openCypher MeetupOctober 25, 2017
Thank you!

More Related Content

What's hot

Team 10 geo dcat ap for earth observation data
Team 10 geo dcat ap for earth observation dataTeam 10 geo dcat ap for earth observation data
Team 10 geo dcat ap for earth observation dataplan4all
 
Big Data LDN 2017: Your flight is boarding now!
Big Data LDN 2017: Your flight is boarding now!Big Data LDN 2017: Your flight is boarding now!
Big Data LDN 2017: Your flight is boarding now!Matt Stubbs
 
Team 09 open land use and smart points of interest visualisation using web g ...
Team 09 open land use and smart points of interest visualisation using web g ...Team 09 open land use and smart points of interest visualisation using web g ...
Team 09 open land use and smart points of interest visualisation using web g ...plan4all
 
Seventh openCypher Implementers Group Meeting: Status Update
Seventh openCypher Implementers Group Meeting: Status UpdateSeventh openCypher Implementers Group Meeting: Status Update
Seventh openCypher Implementers Group Meeting: Status UpdateopenCypher
 
Team 02 metadata catalogue for the open land use map
Team 02 metadata catalogue for the open land use mapTeam 02 metadata catalogue for the open land use map
Team 02 metadata catalogue for the open land use mapplan4all
 
Neo4j - Rik Van Bruggen
Neo4j - Rik Van BruggenNeo4j - Rik Van Bruggen
Neo4j - Rik Van Bruggenbigdatalondon
 
Third openCypher Implementers Group Meeting: Status Update
Third openCypher Implementers Group Meeting: Status UpdateThird openCypher Implementers Group Meeting: Status Update
Third openCypher Implementers Group Meeting: Status UpdateopenCypher
 
BDE SC4 Hangout - Hajira Jabeen, general architecture
BDE SC4 Hangout - Hajira Jabeen, general architectureBDE SC4 Hangout - Hajira Jabeen, general architecture
BDE SC4 Hangout - Hajira Jabeen, general architectureBigData_Europe
 
Stair Captions and Stair Actions(ステアラボ人工知能シンポジウム2017)
Stair Captions and Stair Actions(ステアラボ人工知能シンポジウム2017)Stair Captions and Stair Actions(ステアラボ人工知能シンポジウム2017)
Stair Captions and Stair Actions(ステアラボ人工知能シンポジウム2017)STAIR Lab, Chiba Institute of Technology
 
Producing Linked Open Data with a Content Management System
Producing Linked Open Data with a Content Management SystemProducing Linked Open Data with a Content Management System
Producing Linked Open Data with a Content Management SystemOpen Knowledge Belgium
 
Publishing metadata provenance
Publishing metadata provenancePublishing metadata provenance
Publishing metadata provenanceJana Hentschke
 
Károly Kazi: Theory and Practice: BHEs cooperation with educational organizat...
Károly Kazi: Theory and Practice: BHEs cooperation with educational organizat...Károly Kazi: Theory and Practice: BHEs cooperation with educational organizat...
Károly Kazi: Theory and Practice: BHEs cooperation with educational organizat...CUBCCE Conference
 
This week in Neo4j - 28th October 2017
This week in Neo4j - 28th October 2017This week in Neo4j - 28th October 2017
This week in Neo4j - 28th October 2017Neo4j
 
Semantic MediaWiki and Open Data
Semantic MediaWiki and Open DataSemantic MediaWiki and Open Data
Semantic MediaWiki and Open DataBernhard Krabina
 
03 20170905 inspire workshop_cwa_mondon
03 20170905 inspire workshop_cwa_mondon03 20170905 inspire workshop_cwa_mondon
03 20170905 inspire workshop_cwa_mondonplan4all
 
GPU Computation and the Next Gen Cloud
GPU Computation and the Next Gen CloudGPU Computation and the Next Gen Cloud
GPU Computation and the Next Gen CloudJayprasad Hegde
 
Red Hat Summit 2017 - LT107508 - Better Managing your Red Hat footprint with ...
Red Hat Summit 2017 - LT107508 - Better Managing your Red Hat footprint with ...Red Hat Summit 2017 - LT107508 - Better Managing your Red Hat footprint with ...
Red Hat Summit 2017 - LT107508 - Better Managing your Red Hat footprint with ...Miguel Pérez Colino
 
Presto Summit 2018 - 08 - FINRA
Presto Summit 2018  - 08 - FINRAPresto Summit 2018  - 08 - FINRA
Presto Summit 2018 - 08 - FINRAkbajda
 

What's hot (19)

Team 10 geo dcat ap for earth observation data
Team 10 geo dcat ap for earth observation dataTeam 10 geo dcat ap for earth observation data
Team 10 geo dcat ap for earth observation data
 
Big Data LDN 2017: Your flight is boarding now!
Big Data LDN 2017: Your flight is boarding now!Big Data LDN 2017: Your flight is boarding now!
Big Data LDN 2017: Your flight is boarding now!
 
Team 09 open land use and smart points of interest visualisation using web g ...
Team 09 open land use and smart points of interest visualisation using web g ...Team 09 open land use and smart points of interest visualisation using web g ...
Team 09 open land use and smart points of interest visualisation using web g ...
 
Seventh openCypher Implementers Group Meeting: Status Update
Seventh openCypher Implementers Group Meeting: Status UpdateSeventh openCypher Implementers Group Meeting: Status Update
Seventh openCypher Implementers Group Meeting: Status Update
 
Team 02 metadata catalogue for the open land use map
Team 02 metadata catalogue for the open land use mapTeam 02 metadata catalogue for the open land use map
Team 02 metadata catalogue for the open land use map
 
Neo4j - Rik Van Bruggen
Neo4j - Rik Van BruggenNeo4j - Rik Van Bruggen
Neo4j - Rik Van Bruggen
 
Third openCypher Implementers Group Meeting: Status Update
Third openCypher Implementers Group Meeting: Status UpdateThird openCypher Implementers Group Meeting: Status Update
Third openCypher Implementers Group Meeting: Status Update
 
Plotter
PlotterPlotter
Plotter
 
BDE SC4 Hangout - Hajira Jabeen, general architecture
BDE SC4 Hangout - Hajira Jabeen, general architectureBDE SC4 Hangout - Hajira Jabeen, general architecture
BDE SC4 Hangout - Hajira Jabeen, general architecture
 
Stair Captions and Stair Actions(ステアラボ人工知能シンポジウム2017)
Stair Captions and Stair Actions(ステアラボ人工知能シンポジウム2017)Stair Captions and Stair Actions(ステアラボ人工知能シンポジウム2017)
Stair Captions and Stair Actions(ステアラボ人工知能シンポジウム2017)
 
Producing Linked Open Data with a Content Management System
Producing Linked Open Data with a Content Management SystemProducing Linked Open Data with a Content Management System
Producing Linked Open Data with a Content Management System
 
Publishing metadata provenance
Publishing metadata provenancePublishing metadata provenance
Publishing metadata provenance
 
Károly Kazi: Theory and Practice: BHEs cooperation with educational organizat...
Károly Kazi: Theory and Practice: BHEs cooperation with educational organizat...Károly Kazi: Theory and Practice: BHEs cooperation with educational organizat...
Károly Kazi: Theory and Practice: BHEs cooperation with educational organizat...
 
This week in Neo4j - 28th October 2017
This week in Neo4j - 28th October 2017This week in Neo4j - 28th October 2017
This week in Neo4j - 28th October 2017
 
Semantic MediaWiki and Open Data
Semantic MediaWiki and Open DataSemantic MediaWiki and Open Data
Semantic MediaWiki and Open Data
 
03 20170905 inspire workshop_cwa_mondon
03 20170905 inspire workshop_cwa_mondon03 20170905 inspire workshop_cwa_mondon
03 20170905 inspire workshop_cwa_mondon
 
GPU Computation and the Next Gen Cloud
GPU Computation and the Next Gen CloudGPU Computation and the Next Gen Cloud
GPU Computation and the Next Gen Cloud
 
Red Hat Summit 2017 - LT107508 - Better Managing your Red Hat footprint with ...
Red Hat Summit 2017 - LT107508 - Better Managing your Red Hat footprint with ...Red Hat Summit 2017 - LT107508 - Better Managing your Red Hat footprint with ...
Red Hat Summit 2017 - LT107508 - Better Managing your Red Hat footprint with ...
 
Presto Summit 2018 - 08 - FINRA
Presto Summit 2018  - 08 - FINRAPresto Summit 2018  - 08 - FINRA
Presto Summit 2018 - 08 - FINRA
 

Similar to Property Graphs with Time

HPC I/O for Computational Scientists
HPC I/O for Computational ScientistsHPC I/O for Computational Scientists
HPC I/O for Computational Scientistsinside-BigData.com
 
DLP: a Web-based Facility for Exploration and Basic Modification of Ontologie...
DLP: a Web-based Facility for Exploration and Basic Modification of Ontologie...DLP: a Web-based Facility for Exploration and Basic Modification of Ontologie...
DLP: a Web-based Facility for Exploration and Basic Modification of Ontologie...Luca Mazzola
 
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...4Science
 
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهمعرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهWeb Standards School
 
What to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science PlatformWhat to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science PlatformMario Juric
 
CPaaS.io Y1 Review Meeting - Holistic Data Management
CPaaS.io Y1 Review Meeting - Holistic Data ManagementCPaaS.io Y1 Review Meeting - Holistic Data Management
CPaaS.io Y1 Review Meeting - Holistic Data ManagementStephan Haller
 
Tds — big science dec 2021
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021Gérard Dupont
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataEUCLID project
 
Inspire hack 2017-linked-data
Inspire hack 2017-linked-dataInspire hack 2017-linked-data
Inspire hack 2017-linked-dataRaul Palma
 
Team 05 linked data generation
Team 05 linked data generationTeam 05 linked data generation
Team 05 linked data generationplan4all
 
Delivering Agile Data Science on Openshift - Red Hat Summit 2019
Delivering Agile Data Science on Openshift  - Red Hat Summit 2019Delivering Agile Data Science on Openshift  - Red Hat Summit 2019
Delivering Agile Data Science on Openshift - Red Hat Summit 2019John Archer
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataMarin Dimitrov
 
ArrayUDF: User-Defined Scientific Data Analysis on Arrays
ArrayUDF: User-Defined Scientific Data Analysis on ArraysArrayUDF: User-Defined Scientific Data Analysis on Arrays
ArrayUDF: User-Defined Scientific Data Analysis on ArraysGoon83
 
Sparql querying of-property-graphs-harsh thakkar-graph day 2017 sf
Sparql querying of-property-graphs-harsh thakkar-graph day 2017 sfSparql querying of-property-graphs-harsh thakkar-graph day 2017 sf
Sparql querying of-property-graphs-harsh thakkar-graph day 2017 sfHarsh Thakkar
 
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral ConsortiumEnabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral ConsortiumHenrique O. Santos
 
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...4Science
 
How to Create the Google for Earth Data (XLDB 2015, Stanford)
How to Create the Google for Earth Data (XLDB 2015, Stanford)How to Create the Google for Earth Data (XLDB 2015, Stanford)
How to Create the Google for Earth Data (XLDB 2015, Stanford)Rainer Sternfeld
 
Leveraging Data Driven Research Through Microsoft Azure
Leveraging Data Driven Research Through Microsoft AzureLeveraging Data Driven Research Through Microsoft Azure
Leveraging Data Driven Research Through Microsoft AzureMiguel González-Fierro
 
Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021hala Skaf
 

Similar to Property Graphs with Time (20)

HPC I/O for Computational Scientists
HPC I/O for Computational ScientistsHPC I/O for Computational Scientists
HPC I/O for Computational Scientists
 
DLP: a Web-based Facility for Exploration and Basic Modification of Ontologie...
DLP: a Web-based Facility for Exploration and Basic Modification of Ontologie...DLP: a Web-based Facility for Exploration and Basic Modification of Ontologie...
DLP: a Web-based Facility for Exploration and Basic Modification of Ontologie...
 
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
 
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهمعرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
 
What to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science PlatformWhat to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science Platform
 
CPaaS.io Y1 Review Meeting - Holistic Data Management
CPaaS.io Y1 Review Meeting - Holistic Data ManagementCPaaS.io Y1 Review Meeting - Holistic Data Management
CPaaS.io Y1 Review Meeting - Holistic Data Management
 
Tds — big science dec 2021
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
Inspire hack 2017-linked-data
Inspire hack 2017-linked-dataInspire hack 2017-linked-data
Inspire hack 2017-linked-data
 
Team 05 linked data generation
Team 05 linked data generationTeam 05 linked data generation
Team 05 linked data generation
 
Delivering Agile Data Science on Openshift - Red Hat Summit 2019
Delivering Agile Data Science on Openshift  - Red Hat Summit 2019Delivering Agile Data Science on Openshift  - Red Hat Summit 2019
Delivering Agile Data Science on Openshift - Red Hat Summit 2019
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
ArrayUDF: User-Defined Scientific Data Analysis on Arrays
ArrayUDF: User-Defined Scientific Data Analysis on ArraysArrayUDF: User-Defined Scientific Data Analysis on Arrays
ArrayUDF: User-Defined Scientific Data Analysis on Arrays
 
Sparql querying of-property-graphs-harsh thakkar-graph day 2017 sf
Sparql querying of-property-graphs-harsh thakkar-graph day 2017 sfSparql querying of-property-graphs-harsh thakkar-graph day 2017 sf
Sparql querying of-property-graphs-harsh thakkar-graph day 2017 sf
 
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral ConsortiumEnabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium
Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium
 
H2o tutorial
H2o tutorialH2o tutorial
H2o tutorial
 
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
 
How to Create the Google for Earth Data (XLDB 2015, Stanford)
How to Create the Google for Earth Data (XLDB 2015, Stanford)How to Create the Google for Earth Data (XLDB 2015, Stanford)
How to Create the Google for Earth Data (XLDB 2015, Stanford)
 
Leveraging Data Driven Research Through Microsoft Azure
Leveraging Data Driven Research Through Microsoft AzureLeveraging Data Driven Research Through Microsoft Azure
Leveraging Data Driven Research Through Microsoft Azure
 
Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021
 

More from openCypher

Learning Timed Automata with Cypher
Learning Timed Automata with CypherLearning Timed Automata with Cypher
Learning Timed Automata with CypheropenCypher
 
Incremental View Maintenance for openCypher Queries
Incremental View Maintenance for openCypher QueriesIncremental View Maintenance for openCypher Queries
Incremental View Maintenance for openCypher QueriesopenCypher
 
Formal semantics for Cypher queries and updates
Formal semantics for Cypher queries and updatesFormal semantics for Cypher queries and updates
Formal semantics for Cypher queries and updatesopenCypher
 
Cypher.PL: an executable specification of Cypher semantics
Cypher.PL: an executable specification of Cypher semanticsCypher.PL: an executable specification of Cypher semantics
Cypher.PL: an executable specification of Cypher semanticsopenCypher
 
Multiple Graphs: Updatable Views
Multiple Graphs: Updatable ViewsMultiple Graphs: Updatable Views
Multiple Graphs: Updatable ViewsopenCypher
 
Micro-Servicing Linked Data
Micro-Servicing Linked DataMicro-Servicing Linked Data
Micro-Servicing Linked DataopenCypher
 
Graph abstraction
Graph abstractionGraph abstraction
Graph abstractionopenCypher
 
From Cypher 9 to GQL: Conceptual overview of multiple named graphs and compos...
From Cypher 9 to GQL: Conceptual overview of multiple named graphs and compos...From Cypher 9 to GQL: Conceptual overview of multiple named graphs and compos...
From Cypher 9 to GQL: Conceptual overview of multiple named graphs and compos...openCypher
 
Cypher for Gremlin
Cypher for GremlinCypher for Gremlin
Cypher for GremlinopenCypher
 
Comparing PGQL, G-Core and Cypher
Comparing PGQL, G-Core and CypherComparing PGQL, G-Core and Cypher
Comparing PGQL, G-Core and CypheropenCypher
 
Multiple graphs in openCypher
Multiple graphs in openCypherMultiple graphs in openCypher
Multiple graphs in openCypheropenCypher
 
Eighth openCypher Implementers Group Meeting: Status Update
Eighth openCypher Implementers Group Meeting: Status UpdateEighth openCypher Implementers Group Meeting: Status Update
Eighth openCypher Implementers Group Meeting: Status UpdateopenCypher
 
Cypher for Gremlin
Cypher for GremlinCypher for Gremlin
Cypher for GremlinopenCypher
 
Supporting dates and times in Cypher
Supporting dates and times in CypherSupporting dates and times in Cypher
Supporting dates and times in CypheropenCypher
 
Academic research on graph processing: connecting recent findings to industri...
Academic research on graph processing: connecting recent findings to industri...Academic research on graph processing: connecting recent findings to industri...
Academic research on graph processing: connecting recent findings to industri...openCypher
 
Cypher.PL: Executable Specification of Cypher written in Prolog
Cypher.PL: Executable Specification of Cypher written in PrologCypher.PL: Executable Specification of Cypher written in Prolog
Cypher.PL: Executable Specification of Cypher written in PrologopenCypher
 
Use case: processing multiple graphs
Use case: processing multiple graphsUse case: processing multiple graphs
Use case: processing multiple graphsopenCypher
 
openCypher Technology Compatibility Kit (TCK)
openCypher Technology Compatibility Kit (TCK)openCypher Technology Compatibility Kit (TCK)
openCypher Technology Compatibility Kit (TCK)openCypher
 
Cypher Editor in the Web
Cypher Editor in the WebCypher Editor in the Web
Cypher Editor in the WebopenCypher
 
The inGraph project and incremental evaluation of Cypher queries
The inGraph project and incremental evaluation of Cypher queriesThe inGraph project and incremental evaluation of Cypher queries
The inGraph project and incremental evaluation of Cypher queriesopenCypher
 

More from openCypher (20)

Learning Timed Automata with Cypher
Learning Timed Automata with CypherLearning Timed Automata with Cypher
Learning Timed Automata with Cypher
 
Incremental View Maintenance for openCypher Queries
Incremental View Maintenance for openCypher QueriesIncremental View Maintenance for openCypher Queries
Incremental View Maintenance for openCypher Queries
 
Formal semantics for Cypher queries and updates
Formal semantics for Cypher queries and updatesFormal semantics for Cypher queries and updates
Formal semantics for Cypher queries and updates
 
Cypher.PL: an executable specification of Cypher semantics
Cypher.PL: an executable specification of Cypher semanticsCypher.PL: an executable specification of Cypher semantics
Cypher.PL: an executable specification of Cypher semantics
 
Multiple Graphs: Updatable Views
Multiple Graphs: Updatable ViewsMultiple Graphs: Updatable Views
Multiple Graphs: Updatable Views
 
Micro-Servicing Linked Data
Micro-Servicing Linked DataMicro-Servicing Linked Data
Micro-Servicing Linked Data
 
Graph abstraction
Graph abstractionGraph abstraction
Graph abstraction
 
From Cypher 9 to GQL: Conceptual overview of multiple named graphs and compos...
From Cypher 9 to GQL: Conceptual overview of multiple named graphs and compos...From Cypher 9 to GQL: Conceptual overview of multiple named graphs and compos...
From Cypher 9 to GQL: Conceptual overview of multiple named graphs and compos...
 
Cypher for Gremlin
Cypher for GremlinCypher for Gremlin
Cypher for Gremlin
 
Comparing PGQL, G-Core and Cypher
Comparing PGQL, G-Core and CypherComparing PGQL, G-Core and Cypher
Comparing PGQL, G-Core and Cypher
 
Multiple graphs in openCypher
Multiple graphs in openCypherMultiple graphs in openCypher
Multiple graphs in openCypher
 
Eighth openCypher Implementers Group Meeting: Status Update
Eighth openCypher Implementers Group Meeting: Status UpdateEighth openCypher Implementers Group Meeting: Status Update
Eighth openCypher Implementers Group Meeting: Status Update
 
Cypher for Gremlin
Cypher for GremlinCypher for Gremlin
Cypher for Gremlin
 
Supporting dates and times in Cypher
Supporting dates and times in CypherSupporting dates and times in Cypher
Supporting dates and times in Cypher
 
Academic research on graph processing: connecting recent findings to industri...
Academic research on graph processing: connecting recent findings to industri...Academic research on graph processing: connecting recent findings to industri...
Academic research on graph processing: connecting recent findings to industri...
 
Cypher.PL: Executable Specification of Cypher written in Prolog
Cypher.PL: Executable Specification of Cypher written in PrologCypher.PL: Executable Specification of Cypher written in Prolog
Cypher.PL: Executable Specification of Cypher written in Prolog
 
Use case: processing multiple graphs
Use case: processing multiple graphsUse case: processing multiple graphs
Use case: processing multiple graphs
 
openCypher Technology Compatibility Kit (TCK)
openCypher Technology Compatibility Kit (TCK)openCypher Technology Compatibility Kit (TCK)
openCypher Technology Compatibility Kit (TCK)
 
Cypher Editor in the Web
Cypher Editor in the WebCypher Editor in the Web
Cypher Editor in the Web
 
The inGraph project and incremental evaluation of Cypher queries
The inGraph project and incremental evaluation of Cypher queriesThe inGraph project and incremental evaluation of Cypher queries
The inGraph project and incremental evaluation of Cypher queries
 

Recently uploaded

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 

Recently uploaded (20)

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 

Property Graphs with Time

  • 1. Property graphs with time Julia Stoyanovich, joint work with Vera Moffitt Drexel University Philadelphia, PA USA stoyanovich.org openCypher MeetupOctober 25, 2017
  • 2. openCypher MeetupOctober 25, 2017 2 2008 20092007 20112010
  • 3. openCypher MeetupOctober 25, 2017 3 https://www.kenedict.com/apples-internal-innovation-network-unraveled-part-1-evolving-networks/
  • 4. openCypher MeetupOctober 25, 2017 4 https://arxiv.org/abs/1709.06176
  • 5. openCypher MeetupOctober 25, 2017 Exploratory analysis of evolving graphs • Which nodes are showing an increasing popularity trend? • Have any changes in network connectivity been observed? • At what time scale can interesting trends be observed? • How can multiple data sources be used jointly to complement or corroborate information about network evolution? 5
  • 6. openCypher MeetupOctober 25, 2017 Goal 6 Principled and systematics support for usable, scalable and extensible analysis of evolving graphs
  • 7. openCypher MeetupOctober 25, 2017 Are Alice and Bill connected? 7 TNGP … by a path?
  • 8. openCypher MeetupOctober 25, 2017 Snapshot reducibility 8
  • 9. openCypher MeetupOctober 25, 2017 Are Alice and Bill connected? extended snapshot reducibility 9 … by a journey? … by a path that persists over >2 time instants
  • 10. openCypher MeetupOctober 25, 2017 TGraph: an evolving property graph 10
  • 11. openCypher MeetupOctober 25, 2017 TGA: Temporal Graph Algebra • Temporal variants of standard graph operators + novel time- specific operators • Compositional: TGraph (or a pair of TGraphs) as input - TGraph as output • Operations maintain model integrity - graph integrity at each time instant: no dangling edges, a node/edge appears at most once - temporal integrity: semantics of temporal operations are automatically enforced (formally: point semantics) 11
  • 12. openCypher MeetupOctober 25, 2017 TGA operations • trim • temporal versions of - vertex-map, edge-map - subgraph, path - aggregate messages - union, intersection, difference - binary • snapshot analytics - PageRank, connected components,… - Pregel 12
  • 13. openCypher MeetupOctober 25, 2017 TGA operations • node creation • based on temporal window: temporal zoom • attribute-based: structural zoom • edge creation 13
  • 14. openCypher MeetupOctober 25, 2017 Structural zoom 14 add university nodes Drexel and CMU, and edges between students and these universities
  • 15. openCypher MeetupOctober 25, 2017 Structural zoom 15
  • 16. openCypher MeetupOctober 25, 2017 Temporal zoom 16 coarsen taxi trip start-times into 10-min intervals
  • 17. openCypher MeetupOctober 25, 2017 System architecture 17 Portal Interactive Shell Query Parser Spark Runtime GraphX Data Structures Worker Spark Runtime HDFS Worker Spark Runtime HDFS … System Catalog SparkSQL Portal Runtime (optimizer, operators, etc) Spark 2.0, interoperable with SparkSQL and with BigDatalog
  • 18. openCypher MeetupOctober 25, 2017 Physical data representation • On-disk: Apache Parquet - vertex / edge files - broken down into snapshot groups - each file sorted on start time followed by node /edge id • In-memory: - nested relational (Vertex-Edge RDDs) - GraphX-based: RepresentativeGraphs (RG), One Graph (OG), HybridGraph (HG) 18 1 2 3 BitSet(p1,p2,p3,p4) BitSet(p2,p3,p4,p5) BitSet(p5) BitSet(p1,p2,p3,p4,p5) BitSet(p2,p3)
  • 19. openCypher MeetupOctober 25, 2017 Performance highlights • 16-node Open Stack cluster • Apache Spark 2.0 • 4 cores, 16GB / RAM per node 19
  • 20. openCypher MeetupOctober 25, 2017 PageRank on wiki-talk 20
  • 21. openCypher MeetupOctober 25, 2017 PageRank on nGrams 21
  • 22. openCypher MeetupOctober 25, 2017 PageRank on Twitter 22
  • 23. openCypher MeetupOctober 25, 2017 Aggregate messages on wiki-talk 23
  • 24. openCypher MeetupOctober 25, 2017 Vertex-subgraph on wiki-talk 24
  • 25. openCypher MeetupOctober 25, 2017 Portal vs. G* 25 average node degree, wiki-talk
  • 26. openCypher MeetupOctober 25, 2017 Take-aways • TGraph: a logical model of property graphs with time • TGA: a compositional temporal graph algebra under point semantics • Portal: a library on top of Apache Spark, inter- operable with SparkSQL • Ongoing work on a declarative language, multi- operator query optimization, benchmarking • Planned open source release this Fall 26
  • 27. openCypher MeetupOctober 25, 2017 References • Temporal Graph Algebra, Moffitt & Stoyanovich, DBPL 2017. • Zooming in on NYC taxi data with Portal, Stoyanovich, Gilbride and Moffitt, DSSG 2017 (arXiv). • Towards sequenced semantics for evolving graphs, Moffitt & Stoyanovich, EDBT 2017. • Towards a distributed infrastructure for evolving graph analytics, Moffitt & Stoyanovich, TempWeb 2016. • Vera Moffitt’s Ph.D. thesis. 27
  • 28. openCypher MeetupOctober 25, 2017 Thank you!