SlideShare a Scribd company logo
1 of 18
Download to read offline
© 2021, Amazon Web Services, Inc. or its Affiliates.
Graph Abstractions Matter
(a Personal View)
Dr. Ora Lassila
Principal Technologist
Amazon Neptune
© 2021, Amazon Web Services, Inc. or its Affiliates.
Game plan: 1. Some graph history
2. Graph abstractions, queries
and access patterns
3. Graph as an abstraction:
“One graph to rule them all”
4. The way forward
© 2021, Amazon Web Services, Inc. or its Affiliates.
A brief history of graphs and ontologies
1730s: Graph theory (Euler)
1870s: Library classification (Dewey)
1900: Semantics, ontology and logic (Husserl)
1950s and onwards: Graphs as the essential
underpinning of computer science
1960s-1970s: Network databases (CODASYL), semantic
networks (Quillian et al)
1970s-1990s: Predicate logic as the foundation of
Knowledge Representation (Hayes et al)
1997 and onwards: The Semantic Web, RDF, OWL, etc. (Lassila et al)
Today: Modern knowledge graphs and graph databases
3rd Century BCE: Categories
& logic (Aristotle)
1730s: Taxonomical classification of plants and
animals (Linnaeus)
1960s: Social networks, “small-world experiment”,
Erdős number (Milgram et al)
© 2021, Amazon Web Services, Inc. or its Affiliates.
1730s: Graph theory (Euler)
1870s: Library classification (Dewey)
1900: Semantics, ontology and logic (Husserl)
1950s and onwards: Graphs as the essential
underpinning of computer science
1960s-1970s: Network databases (CODASYL), semantic
networks (Quillian et al)
1970s-1990s: Predicate logic as the foundation of
Knowledge Representation (Hayes et al)
Today: Modern knowledge graphs and graph databases
3rd Century BCE: Categories
& logic (Aristotle)
1730s: Taxonomical classification of plants and
animals (Linnaeus)
1960s: Social networks, “small-world experiment”,
Erdős number (Milgram et al)
A brief history of graphs and ontologies
1997 and onwards: The Semantic Web, RDF, OWL, etc. (Lassila et al)
Despite all this history,
programming with graphs and
ontologies is still cumbersome
© 2021, Amazon Web Services, Inc. or its Affiliates.
RDF Labeled property graphs
Features:
• vertices and edge types: identifiers
• graphs decompose into “triples”
• scalar data types from XSD, values
are special vertices
• no composite data types; objects
use graph structure
Query languages:
• SPARQL (declarative)
Features:
• vertices and edges: objects
• no common identifier scheme
• scalar and composite data types
borrowed from implementation
languages
Query languages:
• Gremlin (imperative)
• Cypher (declarative)
© 2021, Amazon Web Services, Inc. or its Affiliates.
Graph abstractions
Triples are a very low-level abstraction of graphs
• akin to assembly language
Mapping graphs to trees does not necessarily work
• metamodel mismatch
• RDF/XML and JSON-LD as warning examples
• (also: syntax does not matter!)
“Object-graph mapping”…?
• how to determine what constitutes an object?
• in RDF, blank nodes can be difficult to handle
© 2021, Amazon Web Services, Inc. or its Affiliates.
Graph queries (or: “how do I get there from here?”)
Traversal is an excellent graph access paradigm
• “where can I go from here given this path pattern?”
• “are these two nodes connected, and how?”
• using path patterns for traversal is akin to pattern matching [Lassila 2007]
SPARQL is a ”very big hammer” for many access problems
We need simple APIs:
• search, faceted search (filtering)
• “give me everything you know about this node” (cf. object-graph mapping)
• traversal
© 2021, Amazon Web Services, Inc. or its Affiliates.
Models and ontologies
Problems:
1. LPGs (by and large) do not have schema languages
2. RDF has too many schema languages (RDFS, OWL, SHACL)
3. how to map models to code?
What does mapping your data to a shared ontology “buy” you?
Can we support ontologies with predefined libraries?
• ”ontology engines”
© 2021, Amazon Web Services, Inc. or its Affiliates.
What about reasoning?
Symbolic reasoning:
• generative reasoning can be easily hidden from the application/client
• restrictions as a means of data validation (SHACL rather than OWL)
Non-symbolic reasoning:
• ML techniques can be used much like symbolic reasoning
We need an expanded view of reasoning, and we need to rethink what an
“application” is:
Logic should be associated with data
and models, not with an application
© 2021, Amazon Web Services, Inc. or its Affiliates.
Graphs as a logical abstraction
Modern enterprise data practice is messy
Data silos are commonplace
Data integration often happens via
bespoke and ad hoc solutions
We need a unifying logical view of data (a
lingua franca for data integration) or risk
ongoing and ever-increasing
fragmentation
Until some better option shows up, RDF
and OWL can provide this view
“Tower of Babel”, Pieter Brueghel the Elder, 1563; Kunsthistorisches Museum, Wien
We are here
© 2021, Amazon Web Services, Inc. or its Affiliates.
RDF vs. labeled property graphs…?
Confuses users (when given the choice)
Should not matter
The real issue may in fact be
Graph as a logical representation vs. graph as a data structure
The practical problem is still that we have two separate ecosystems
© 2021, Amazon Web Services, Inc. or its Affiliates.
RDF Labeled property graphs
Pros:
• W3C standard, well established
• easy to use external data sources
• schema language(s)
• formal semantics for graph merging
• supports reasoning
Cons:
• often considered “too academic”
• no easy “on-ramp”
Pros:
• intuitive for software developers
• integrates well with programming
languages (esp. Gremlin)
Cons:
• no standard, many divergent
(proprietary) implementations
• no schema language
• no formal semantics
© 2021, Amazon Web Services, Inc. or its Affiliates.
RDF Labeled property graphs
Pros:
• W3C standard, well established
• easy to use external data sources
• schema language(s)
• formal semantics for graph merging
• supports reasoning
Pros:
• intuitive for software developers
• integrates well with programming
languages (esp. Gremlin)
We want both!
© 2021, Amazon Web Services, Inc. or its Affiliates.
Graph as an abstraction
What we really want is a unifying metamodel that covers both RDF and LPGs
• helps adoption à important for the graph industry
• gives users more choices (e.g., Gremlin over RDF)
We are working on this (codename “One Graph”)
This is not an easy undertaking, however…
© 2021, Amazon Web Services, Inc. or its Affiliates.
Some challenges in unifying RDF and LPGs
RDF and RDF-star
semantics
• triples are unique, but LPG edges are not
• (“old style” reification does not have this problem)
Lack of formal
semantics for LPGs
• difficult to determine “correct” semantics
• also: diversity of scalar datatypes
Graph partitioning
• no named graphs in current LPGs
• in RDF vertices are just identifiers (no structure), but
in LPGs vertices are structured objects
Update semantics
• deleting non-unique triples: what happens?
• cascading delete for edge properties?
© 2021, Amazon Web Services, Inc. or its Affiliates.
The way forward (or: what are the real pain points?)
How do I write software that leverages a
knowledge graph?
• bridging the “graph ←→ code -gap”
• low-level interfaces (lots of boilerplate
code, etc.)
I mapped my data to an ontology… now what?
(or: what is “ontology support”…?)
• role of reasoning
• “hiding” reasoning works well
• transitive taxonomies & owl:sameAs
(including inverse functional properties)
New interfaces
and abstractions
Libraries, “ontology
engine(s)”…?
© 2021, Amazon Web Services, Inc. or its Affiliates.
The way forward (or: what are the real pain points?)
What about property graphs, Gremlin, …?
• choosing between RDF and property graph
models is hard and leads to confusion
• RDF-star to the rescue?
Is there a way out of the silos that does not
involve never-ending bespoke integrations?
• modern tools do not offer a single unified
view of all data
Graphs as a unifying
logical view
“One Graph”
unified metamodel
Lassila et al: “Graph? Yes! Which one?
Help!”, SCG2021 @ Semantics 2021
(arXiv:2110.13348)
© 2021, Amazon Web Services, Inc. or its Affiliates.
Questions?
Contact: ora@amazon.com
More info: https://aws.amazon.com/neptune/knowledge-graphs-on-aws/
https://arxiv.org/abs/2110.13348

More Related Content

What's hot

The Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphThe Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphCambridge Semantics
 
Introduction to Knowledge Graphs
Introduction to Knowledge GraphsIntroduction to Knowledge Graphs
Introduction to Knowledge Graphsmukuljoshi
 
Enterprise Knowledge Graph
Enterprise Knowledge GraphEnterprise Knowledge Graph
Enterprise Knowledge GraphLukas Masuch
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 
Chapter 1 semantic web
Chapter 1 semantic webChapter 1 semantic web
Chapter 1 semantic webR A Akerkar
 
DSpace 7 - The Angular UI from a user’s perspective
DSpace 7 - The Angular UI from a user’s perspectiveDSpace 7 - The Angular UI from a user’s perspective
DSpace 7 - The Angular UI from a user’s perspectiveAtmire
 
SHACL in Apache jena - ApacheCon2020
SHACL in Apache jena - ApacheCon2020SHACL in Apache jena - ApacheCon2020
SHACL in Apache jena - ApacheCon2020andyseaborne
 
NoSQL Graph Databases - Why, When and Where
NoSQL Graph Databases - Why, When and WhereNoSQL Graph Databases - Why, When and Where
NoSQL Graph Databases - Why, When and WhereEugene Hanikblum
 
Debunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative FactsDebunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative FactsNeo4j
 
The Semantic Web: An Introduction
The Semantic Web: An IntroductionThe Semantic Web: An Introduction
The Semantic Web: An IntroductionElena Simperl
 
Tailwind CSS.11.pptx
Tailwind CSS.11.pptxTailwind CSS.11.pptx
Tailwind CSS.11.pptxHarish Verma
 
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...Jeff Z. Pan
 
Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceCambridge Semantics
 
Html / CSS Presentation
Html / CSS PresentationHtml / CSS Presentation
Html / CSS PresentationShawn Calvert
 
Comparison and mapping between various cloud services 2019
Comparison and mapping between various cloud services 2019Comparison and mapping between various cloud services 2019
Comparison and mapping between various cloud services 2019jones4u
 

What's hot (20)

The Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphThe Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge Graph
 
Apache Atlas: Governance for your Data
Apache Atlas: Governance for your DataApache Atlas: Governance for your Data
Apache Atlas: Governance for your Data
 
Introduction to Knowledge Graphs
Introduction to Knowledge GraphsIntroduction to Knowledge Graphs
Introduction to Knowledge Graphs
 
Introduction to GraphQL
Introduction to GraphQLIntroduction to GraphQL
Introduction to GraphQL
 
Enterprise Knowledge Graph
Enterprise Knowledge GraphEnterprise Knowledge Graph
Enterprise Knowledge Graph
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Chapter 1 semantic web
Chapter 1 semantic webChapter 1 semantic web
Chapter 1 semantic web
 
DSpace 7 - The Angular UI from a user’s perspective
DSpace 7 - The Angular UI from a user’s perspectiveDSpace 7 - The Angular UI from a user’s perspective
DSpace 7 - The Angular UI from a user’s perspective
 
Introduction to Amazon DynamoDB
Introduction to Amazon DynamoDBIntroduction to Amazon DynamoDB
Introduction to Amazon DynamoDB
 
SHACL in Apache jena - ApacheCon2020
SHACL in Apache jena - ApacheCon2020SHACL in Apache jena - ApacheCon2020
SHACL in Apache jena - ApacheCon2020
 
NoSQL Graph Databases - Why, When and Where
NoSQL Graph Databases - Why, When and WhereNoSQL Graph Databases - Why, When and Where
NoSQL Graph Databases - Why, When and Where
 
Debunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative FactsDebunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative Facts
 
The Semantic Web: An Introduction
The Semantic Web: An IntroductionThe Semantic Web: An Introduction
The Semantic Web: An Introduction
 
Tailwind CSS.11.pptx
Tailwind CSS.11.pptxTailwind CSS.11.pptx
Tailwind CSS.11.pptx
 
Mongo DB
Mongo DBMongo DB
Mongo DB
 
Introducing DynamoDB
Introducing DynamoDBIntroducing DynamoDB
Introducing DynamoDB
 
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...
Linked Data and Knowledge Graphs -- Constructing and Understanding Knowledge ...
 
Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data Science
 
Html / CSS Presentation
Html / CSS PresentationHtml / CSS Presentation
Html / CSS Presentation
 
Comparison and mapping between various cloud services 2019
Comparison and mapping between various cloud services 2019Comparison and mapping between various cloud services 2019
Comparison and mapping between various cloud services 2019
 

Similar to Graph Abstractions Matter by Ora Lassila

Graph database in sv meetup
Graph database in sv meetupGraph database in sv meetup
Graph database in sv meetupJoshua Bae
 
Arches Getty Brownbag Talk
Arches Getty Brownbag TalkArches Getty Brownbag Talk
Arches Getty Brownbag Talkbenosteen
 
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...Jean Ihm
 
Making the semantic web work
Making the semantic web workMaking the semantic web work
Making the semantic web workPaul Houle
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital.AI
 
Graph Database and Neo4j
Graph Database and Neo4jGraph Database and Neo4j
Graph Database and Neo4jSina Khorami
 
Sharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsSharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsGeorge Stathis
 
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)Emil Eifrem
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?Samet KILICTAS
 
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)Emil Eifrem
 
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SFTed Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SFMLconf
 
Evolution of the Graph Schema
Evolution of the Graph SchemaEvolution of the Graph Schema
Evolution of the Graph SchemaJoshua Shinavier
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data21Style
 
Data Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneData Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneDoug Needham
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data ModelingVital.AI
 
CSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialLeeFeigenbaum
 
Large Scale Graph Analytics with JanusGraph
Large Scale Graph Analytics with JanusGraphLarge Scale Graph Analytics with JanusGraph
Large Scale Graph Analytics with JanusGraphP. Taylor Goetz
 

Similar to Graph Abstractions Matter by Ora Lassila (20)

Graph database in sv meetup
Graph database in sv meetupGraph database in sv meetup
Graph database in sv meetup
 
Arches Getty Brownbag Talk
Arches Getty Brownbag TalkArches Getty Brownbag Talk
Arches Getty Brownbag Talk
 
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
 
Making the semantic web work
Making the semantic web workMaking the semantic web work
Making the semantic web work
 
JSON-LD and SHACL for Knowledge Graphs
JSON-LD and SHACL for Knowledge GraphsJSON-LD and SHACL for Knowledge Graphs
JSON-LD and SHACL for Knowledge Graphs
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Graph Database and Neo4j
Graph Database and Neo4jGraph Database and Neo4j
Graph Database and Neo4j
 
Sharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsSharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data Lessons
 
Presentation1
Presentation1Presentation1
Presentation1
 
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?
 
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
 
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SFTed Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
 
Some NoSQL
Some NoSQLSome NoSQL
Some NoSQL
 
Evolution of the Graph Schema
Evolution of the Graph SchemaEvolution of the Graph Schema
Evolution of the Graph Schema
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
 
Data Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneData Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZone
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data Modeling
 
CSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web Tutorial
 
Large Scale Graph Analytics with JanusGraph
Large Scale Graph Analytics with JanusGraphLarge Scale Graph Analytics with JanusGraph
Large Scale Graph Analytics with JanusGraph
 

More from Connected Data World

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenConnected Data World
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Connected Data World
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine LearningConnected Data World
 
The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is hereConnected Data World
 
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2Connected Data World
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3Connected Data World
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data ModelConnected Data World
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseConnected Data World
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Connected Data World
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Connected Data World
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleConnected Data World
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Connected Data World
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the WebConnected Data World
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsConnected Data World
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsConnected Data World
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...Connected Data World
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGOConnected Data World
 
What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?Connected Data World
 

More from Connected Data World (20)

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van Harmelen
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine Learning
 
Graphs in sustainable finance
Graphs in sustainable financeGraphs in sustainable finance
Graphs in sustainable finance
 
The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is here
 
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data Model
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
 
Graph Realities
Graph RealitiesGraph Realities
Graph Realities
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scale
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the Web
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needs
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property Graphs
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGO
 
What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?
 

Recently uploaded

Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Recently uploaded (20)

Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

Graph Abstractions Matter by Ora Lassila

  • 1. © 2021, Amazon Web Services, Inc. or its Affiliates. Graph Abstractions Matter (a Personal View) Dr. Ora Lassila Principal Technologist Amazon Neptune
  • 2. © 2021, Amazon Web Services, Inc. or its Affiliates. Game plan: 1. Some graph history 2. Graph abstractions, queries and access patterns 3. Graph as an abstraction: “One graph to rule them all” 4. The way forward
  • 3. © 2021, Amazon Web Services, Inc. or its Affiliates. A brief history of graphs and ontologies 1730s: Graph theory (Euler) 1870s: Library classification (Dewey) 1900: Semantics, ontology and logic (Husserl) 1950s and onwards: Graphs as the essential underpinning of computer science 1960s-1970s: Network databases (CODASYL), semantic networks (Quillian et al) 1970s-1990s: Predicate logic as the foundation of Knowledge Representation (Hayes et al) 1997 and onwards: The Semantic Web, RDF, OWL, etc. (Lassila et al) Today: Modern knowledge graphs and graph databases 3rd Century BCE: Categories & logic (Aristotle) 1730s: Taxonomical classification of plants and animals (Linnaeus) 1960s: Social networks, “small-world experiment”, Erdős number (Milgram et al)
  • 4. © 2021, Amazon Web Services, Inc. or its Affiliates. 1730s: Graph theory (Euler) 1870s: Library classification (Dewey) 1900: Semantics, ontology and logic (Husserl) 1950s and onwards: Graphs as the essential underpinning of computer science 1960s-1970s: Network databases (CODASYL), semantic networks (Quillian et al) 1970s-1990s: Predicate logic as the foundation of Knowledge Representation (Hayes et al) Today: Modern knowledge graphs and graph databases 3rd Century BCE: Categories & logic (Aristotle) 1730s: Taxonomical classification of plants and animals (Linnaeus) 1960s: Social networks, “small-world experiment”, Erdős number (Milgram et al) A brief history of graphs and ontologies 1997 and onwards: The Semantic Web, RDF, OWL, etc. (Lassila et al) Despite all this history, programming with graphs and ontologies is still cumbersome
  • 5. © 2021, Amazon Web Services, Inc. or its Affiliates. RDF Labeled property graphs Features: • vertices and edge types: identifiers • graphs decompose into “triples” • scalar data types from XSD, values are special vertices • no composite data types; objects use graph structure Query languages: • SPARQL (declarative) Features: • vertices and edges: objects • no common identifier scheme • scalar and composite data types borrowed from implementation languages Query languages: • Gremlin (imperative) • Cypher (declarative)
  • 6. © 2021, Amazon Web Services, Inc. or its Affiliates. Graph abstractions Triples are a very low-level abstraction of graphs • akin to assembly language Mapping graphs to trees does not necessarily work • metamodel mismatch • RDF/XML and JSON-LD as warning examples • (also: syntax does not matter!) “Object-graph mapping”…? • how to determine what constitutes an object? • in RDF, blank nodes can be difficult to handle
  • 7. © 2021, Amazon Web Services, Inc. or its Affiliates. Graph queries (or: “how do I get there from here?”) Traversal is an excellent graph access paradigm • “where can I go from here given this path pattern?” • “are these two nodes connected, and how?” • using path patterns for traversal is akin to pattern matching [Lassila 2007] SPARQL is a ”very big hammer” for many access problems We need simple APIs: • search, faceted search (filtering) • “give me everything you know about this node” (cf. object-graph mapping) • traversal
  • 8. © 2021, Amazon Web Services, Inc. or its Affiliates. Models and ontologies Problems: 1. LPGs (by and large) do not have schema languages 2. RDF has too many schema languages (RDFS, OWL, SHACL) 3. how to map models to code? What does mapping your data to a shared ontology “buy” you? Can we support ontologies with predefined libraries? • ”ontology engines”
  • 9. © 2021, Amazon Web Services, Inc. or its Affiliates. What about reasoning? Symbolic reasoning: • generative reasoning can be easily hidden from the application/client • restrictions as a means of data validation (SHACL rather than OWL) Non-symbolic reasoning: • ML techniques can be used much like symbolic reasoning We need an expanded view of reasoning, and we need to rethink what an “application” is: Logic should be associated with data and models, not with an application
  • 10. © 2021, Amazon Web Services, Inc. or its Affiliates. Graphs as a logical abstraction Modern enterprise data practice is messy Data silos are commonplace Data integration often happens via bespoke and ad hoc solutions We need a unifying logical view of data (a lingua franca for data integration) or risk ongoing and ever-increasing fragmentation Until some better option shows up, RDF and OWL can provide this view “Tower of Babel”, Pieter Brueghel the Elder, 1563; Kunsthistorisches Museum, Wien We are here
  • 11. © 2021, Amazon Web Services, Inc. or its Affiliates. RDF vs. labeled property graphs…? Confuses users (when given the choice) Should not matter The real issue may in fact be Graph as a logical representation vs. graph as a data structure The practical problem is still that we have two separate ecosystems
  • 12. © 2021, Amazon Web Services, Inc. or its Affiliates. RDF Labeled property graphs Pros: • W3C standard, well established • easy to use external data sources • schema language(s) • formal semantics for graph merging • supports reasoning Cons: • often considered “too academic” • no easy “on-ramp” Pros: • intuitive for software developers • integrates well with programming languages (esp. Gremlin) Cons: • no standard, many divergent (proprietary) implementations • no schema language • no formal semantics
  • 13. © 2021, Amazon Web Services, Inc. or its Affiliates. RDF Labeled property graphs Pros: • W3C standard, well established • easy to use external data sources • schema language(s) • formal semantics for graph merging • supports reasoning Pros: • intuitive for software developers • integrates well with programming languages (esp. Gremlin) We want both!
  • 14. © 2021, Amazon Web Services, Inc. or its Affiliates. Graph as an abstraction What we really want is a unifying metamodel that covers both RDF and LPGs • helps adoption à important for the graph industry • gives users more choices (e.g., Gremlin over RDF) We are working on this (codename “One Graph”) This is not an easy undertaking, however…
  • 15. © 2021, Amazon Web Services, Inc. or its Affiliates. Some challenges in unifying RDF and LPGs RDF and RDF-star semantics • triples are unique, but LPG edges are not • (“old style” reification does not have this problem) Lack of formal semantics for LPGs • difficult to determine “correct” semantics • also: diversity of scalar datatypes Graph partitioning • no named graphs in current LPGs • in RDF vertices are just identifiers (no structure), but in LPGs vertices are structured objects Update semantics • deleting non-unique triples: what happens? • cascading delete for edge properties?
  • 16. © 2021, Amazon Web Services, Inc. or its Affiliates. The way forward (or: what are the real pain points?) How do I write software that leverages a knowledge graph? • bridging the “graph ←→ code -gap” • low-level interfaces (lots of boilerplate code, etc.) I mapped my data to an ontology… now what? (or: what is “ontology support”…?) • role of reasoning • “hiding” reasoning works well • transitive taxonomies & owl:sameAs (including inverse functional properties) New interfaces and abstractions Libraries, “ontology engine(s)”…?
  • 17. © 2021, Amazon Web Services, Inc. or its Affiliates. The way forward (or: what are the real pain points?) What about property graphs, Gremlin, …? • choosing between RDF and property graph models is hard and leads to confusion • RDF-star to the rescue? Is there a way out of the silos that does not involve never-ending bespoke integrations? • modern tools do not offer a single unified view of all data Graphs as a unifying logical view “One Graph” unified metamodel Lassila et al: “Graph? Yes! Which one? Help!”, SCG2021 @ Semantics 2021 (arXiv:2110.13348)
  • 18. © 2021, Amazon Web Services, Inc. or its Affiliates. Questions? Contact: ora@amazon.com More info: https://aws.amazon.com/neptune/knowledge-graphs-on-aws/ https://arxiv.org/abs/2110.13348