SlideShare a Scribd company logo
Selecting the right database for
your semantic storage needs
Taxonomy Boot Camp London 2017
Jim Sweeney, Senior Product Manager
Taxonomy and Ontology Solutions
Database Types
There are a number of different
database types available. We will look
at three key designs as they relate to
storing semantic information.
• Relational Database Management
Systems (RDBMS)
• Property Graph Databases
• RDF Databases (a specialized form
of Graph Database)
A Brief History
• The relational database did not appear until the late
1970’s replacing Navigational database designs of the
previous decade.
• The first commercial versions started to appear in the
early 1980’s with IBM’s DB2 and Oracle’s products
leading the way.
• Microsoft released their SQL Server platform in 1989.
• These three providers continue to be among the leaders
as far as market share across all database types.
A Brief History (Continued)
• Graph database systems begin
to appear in the late 1990’s,
coinciding with the adoption of
RDF as a W3C recommendation.
• Leading Property Graph and
RDF database providers such as
Neo4J, GraphDB, and Jena (to
name just a few) continue to
grow in market share but still
represent just a fraction of the
total market share.
Rank DBMS
Database
Model Score
Oct Oct Oct Sep Oct
2017 2016 2017 2017 2016
1 1 Oracle Relational DBMS 1348.8 -10.29 -68.3
2 2 MySQL Relational DBMS 1298.83 -13.78 -63.82
3 3 Microsoft SQL Server Relational DBMS 1210.32 -2.23 -3.86
4 5 PostgreSQL Relational DBMS 373.27 0.91 54.58
5 4 MongoDB Document store 329.4 -3.33 10.6
6 6 DB2 Relational DBMS 194.59 -3.75 14.03
7 8 Microsoft Access Relational DBMS 129.45 0.64 4.78
8 7 Cassandra Wide column store 124.79 -1.41 -10.27
9 9 Redis Key-value store 122.05 1.65 12.51
10 11 Elasticsearch Search engine 120.23 0.23 21.12
21 21 Neo4j Graph DBMS 37.95 -0.48 1.5
89 86 Jena RDF store 2.31 0.04 0.5
Relational Databases
Management Systems (RDBMS)
• RDBMS are designed using multiple
tables to store data.
• Each table is made up of headers,
rows, and columns, much like a
spreadsheet.
• Each row contains different
elements, including a primary key
and zero to many foreign keys.
• These keys are used to link
information in one table to another
table, which will contain distinct but
related information. Image from neo4j.com
RDBMS Design
• The use of a key in each
table means that there is
no “relationship” as we
would find in natural
language or a graph
structure.
• Table values themselves
are what link the data
together by creating a
join between data
elements using keys. Image from neo4j.com
Pros and Cons of
RDBMS
Pros:
• RDBMS are tried and tested over many
decades, and generally have more
features available than newer
platforms.
• RDBMS are very efficient at storing
indexable information that is described
well in “tabular” structures, such as
accounting or inventory data.
• Data is often indexed at the time of
entry, making keyword searches and
those with limited table joins relatively
fast.
Pros and Cons of
RDBMS
Cons:
• RDBMS require a foreknowledge of all
the data, and consequently data tables,
that you will be storing. (Graph
structures provide more ad hoc design
flexibility.)
• RDBMS don’t perform as well in pattern
matching searches as the number of
relationships to be traversed increases.
• SQL language, while widely known, is
slightly more complex than those used
with Property Graph or RDF systems.
RDBMS
Deconstructed
• As we have seen,
Relational Databases use
tables and key values to
link information
together.
• Let’s focus on just the
highlighted components
to compare this design
with that of a Graph
database.
Image from neo4j.com
The Graph
Model
• With a graph database, such as
Neo4j, each node is a standalone
item that describes a fully-formed
member of the database.
• To link the nodes together, one uses
relationships. These are referred to
as edges.
• The nodes and edges form uniform
structures which are repeated to
relate all of the data elements,
completely describing all necessary
aspects of the information in the
database. Image from neo4j.com
The Graph Model
(continued)
• If we look at our example with
Alice, we see that she belongs to
several departments. The nature of
the link from the person, Alice, to
the Departments is described by
the relationship itself, in this case
“:BELONGS_TO”.
• The node-edge-node combination
that is formed has Alice belonging
to the department “4FUTURE”.
• No longer are we linking items via
table joins. We are now describing
the links is a more human
understandable format.
Image from neo4j.com
RDF – Standardizing the Triple
• RDF or Resource Description
Framework provides standards that
we use to uniformly describe the
relationships between nodes, as well
as describe other attributes that we
want to assign to our data as
metadata.
• The standard establishes a subject-
predicate-object unit called a triple.
• One such RDF standard, FOAF
(Friend of a Friend), is a collection of
relationships and attributes
describing people, their relationships
to one another, and their activities.
• In this example, we see Alice is not
just identified as a member of
departments, but also as a family
member based near a location, and
with a descriptive home page.
RDF – Standardizing the Triple
• To extend the previous
example, Denver might be
linked to the state of Colorado,
in the United States, in North
America using RDF OWL.
• Using Graph database design,
and employing RDF descriptive
language collections, one can
create links between entities
that may virtually go on and on.
• What we create becomes a
dynamic, three dimensional
database model capable of
establishing true semantic
Ontologies.
Linked Data and
the Semantic Web
• In 1991 Tim Berners-Lee described what he
called the World Wide Web, which was
meant to, “allow links to be made to any
information anywhere.”1
• In the 2010’s we are seeing the concept of
Linked Data realizing this goal, utilizing RDF
standards and other technologies such as
SPARQL (Protocol and RDF Query Language)
and JSON (JavaScript Object Notation).
• With Linked Data protocols, one may send
queries across the web to retrieve specific
information from any source published in this
format.
• While this is possible using RDBMS
technologies as well, Graph databases speak
the native language of Linked Data and start
to introduce some of the most exciting
potential of the World Wide Web.
Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae, Paul Buitelaar,
Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/
1 https://www.w3.org/People/Berners-Lee/1991/08/art-6484.txt
Pros and Cons of Graph
and RDF Databases
Pros:
• Graph databases are very well
suited to storing information that
contains many relationships and
data points, leading to fast
performance for pattern based
searches.
• SPARQL, Cypher, and other Graph
query languages are generally
simple and powerful.
• RDF data stores can connect easily
and natively to Linked Data sources.
Pros and Cons of Graph
and RDF Databases
Cons:
• Because they are not indexed as
RDBMS are, they are not as effective
in handing very large numbers of
transactions or queries that need to
capture linked information across an
entire database at once.
• Newer technology, less tested, not
as many providers, and not as well
supported, yet.
• May not integrate well with legacy
systems that will be consuming
semantic information.
Conclusions
• RDMBS offer a tried and true technology that is more
than capable of providing the database layer for
most semantic technology applications and services.
• Their wide adoption in nearly all data management
environments make them easy to support and
integrate with external systems.
• When properly indexed and designed, they are very
good at handling many, simultaneous transactions.
• And, they are able to handle some advanced
functionality enabled by RDF and Linked Data with a
little design expertise.
Conclusions
• Graph and RDF databases are newer to the market and have
yet to come close to the saturation levels that RDBMS enjoy,
but they offer a design that is intrinsically effective at
managing semantic information and the technologies around
RDF and Linked Data.
• Graph and RDF databases don’t require foreknowledge or pre-
definition necessary for RDMBS design, making them able to
evolve rapidly and organically.
• Because Graph and RDF databases use basic, semantic
structure as a key design characteristic they are very good at
answering questions posed by real-life queries that we find in
everyday language.
Resources
• Explanation of Graph Structure: http://www.zdnet.com/pictures/a-
graph-is-a-graph-is-graph-rdf-vs-lpg/
• Data Modeling with Graph Databases: https://youtu.be/r4IGWNovzes
• Designing and Building Semantic Applications with Linked Data:
https://youtu.be/ToALLwM7KIw
Thank You!
Jim Sweeney, Senior Product Manager
jim.sweeney@synaptica.com
www.synaptica.com
Twitter: @synaptica

More Related Content

What's hot

NoSQL Basics - a quick tour
NoSQL Basics - a quick tourNoSQL Basics - a quick tour
NoSQL Basics - a quick tour
Bikram Sinha. MBA, PMP
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapubeswcsummerschool
 
Chapter 4 terminolgy of keyvalue databses from nosql for mere mortals
Chapter 4 terminolgy of keyvalue databses from nosql for mere mortalsChapter 4 terminolgy of keyvalue databses from nosql for mere mortals
Chapter 4 terminolgy of keyvalue databses from nosql for mere mortals
nehabsairam
 
Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Anja Jentzsch
 
NoSQL Basics and MongDB
NoSQL Basics and  MongDBNoSQL Basics and  MongDB
NoSQL Basics and MongDB
Shamima Yeasmin Mukta
 
Providing Linked Data
Providing Linked DataProviding Linked Data
Providing Linked Data
EUCLID project
 
Big data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edgeBig data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edge
Bhavya Gulati
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Database
nehabsairam
 
What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection?
Debra Shapiro
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
EUCLID project
 
The Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open DataThe Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open Data
Ontotext
 
Type of Database Management System
Type of Database Management SystemType of Database Management System
Type of Database Management System
Param Radadiya
 
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural HeritageBuild Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Ontotext
 
Semantic Web (Web 3.0)
Semantic Web (Web 3.0)Semantic Web (Web 3.0)
Semantic Web (Web 3.0)
John Dougherty
 
Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...
Aksw Group
 
What_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfWhat_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdf
Heiko Paulheim
 
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
andimou
 
Oslo baksia2014
Oslo baksia2014Oslo baksia2014
Oslo baksia2014
Max Neunhöffer
 

What's hot (20)

NoSQL Basics - a quick tour
NoSQL Basics - a quick tourNoSQL Basics - a quick tour
NoSQL Basics - a quick tour
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapub
 
Chapter 4 terminolgy of keyvalue databses from nosql for mere mortals
Chapter 4 terminolgy of keyvalue databses from nosql for mere mortalsChapter 4 terminolgy of keyvalue databses from nosql for mere mortals
Chapter 4 terminolgy of keyvalue databses from nosql for mere mortals
 
Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)
 
NoSQL Basics and MongDB
NoSQL Basics and  MongDBNoSQL Basics and  MongDB
NoSQL Basics and MongDB
 
Providing Linked Data
Providing Linked DataProviding Linked Data
Providing Linked Data
 
Big data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edgeBig data analytics: Technology's bleeding edge
Big data analytics: Technology's bleeding edge
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Database
 
What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection?
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
The Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open DataThe Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open Data
 
Graph db
Graph dbGraph db
Graph db
 
Type of Database Management System
Type of Database Management SystemType of Database Management System
Type of Database Management System
 
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural HeritageBuild Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
 
Semantic Web (Web 3.0)
Semantic Web (Web 3.0)Semantic Web (Web 3.0)
Semantic Web (Web 3.0)
 
Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...
 
What_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfWhat_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdf
 
Presentation1
Presentation1Presentation1
Presentation1
 
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
 
Oslo baksia2014
Oslo baksia2014Oslo baksia2014
Oslo baksia2014
 

Similar to Selecting the right database type for your knowledge management needs.

NOSQL Databases types and Uses
NOSQL Databases types and UsesNOSQL Databases types and Uses
NOSQL Databases types and Uses
Suvradeep Rudra
 
NoSQL Databases
NoSQL DatabasesNoSQL Databases
NoSQL Databases
Carlos Alberto Benitez
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
Ramakant Soni
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdf
ajajkhan16
 
Modern database
Modern databaseModern database
Modern database
Rashid Ansari
 
UNIT-2.pptx
UNIT-2.pptxUNIT-2.pptx
UNIT-2.pptx
SIVAKUMARM603675
 
No SQL- The Future Of Data Storage
No SQL- The Future Of Data StorageNo SQL- The Future Of Data Storage
No SQL- The Future Of Data Storage
Bethmi Gunasekara
 
data base system to new data science lerne
data base system to new data science lernedata base system to new data science lerne
data base system to new data science lerne
tarunprajapati0t
 
Presentation On NoSQL Databases
Presentation On NoSQL DatabasesPresentation On NoSQL Databases
Presentation On NoSQL Databases
Abiral Gautam
 
NoSQL and MongoDB
NoSQL and MongoDBNoSQL and MongoDB
NoSQL and MongoDB
Rajesh Menon
 
No sql – rise of the clusters
No sql – rise of the clustersNo sql – rise of the clusters
No sql – rise of the clusters
responseteam
 
Unit-10.pptx
Unit-10.pptxUnit-10.pptx
Unit-10.pptx
GhanashyamBK1
 
Choosing your NoSQL storage
Choosing your NoSQL storageChoosing your NoSQL storage
Choosing your NoSQL storage
Imteyaz Khan
 
Unit II -BIG DATA ANALYTICS.docx
Unit II -BIG DATA ANALYTICS.docxUnit II -BIG DATA ANALYTICS.docx
Unit II -BIG DATA ANALYTICS.docx
vvpadhu
 
Report 2.0.docx
Report 2.0.docxReport 2.0.docx
Report 2.0.docx
pinstechwork
 
Database Technologies
Database TechnologiesDatabase Technologies
Database Technologies
Michel de Goede
 
2.Introduction to NOSQL (Core concepts).pptx
2.Introduction to NOSQL (Core concepts).pptx2.Introduction to NOSQL (Core concepts).pptx
2.Introduction to NOSQL (Core concepts).pptx
RushikeshChikane2
 
Report 1.0.docx
Report 1.0.docxReport 1.0.docx
Report 1.0.docx
pinstechwork
 
NOsql Presentation.pdf
NOsql Presentation.pdfNOsql Presentation.pdf
NOsql Presentation.pdf
AkshayDwivedi31
 

Similar to Selecting the right database type for your knowledge management needs. (20)

NOSQL Databases types and Uses
NOSQL Databases types and UsesNOSQL Databases types and Uses
NOSQL Databases types and Uses
 
NoSQL Databases
NoSQL DatabasesNoSQL Databases
NoSQL Databases
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdf
 
Modern database
Modern databaseModern database
Modern database
 
UNIT-2.pptx
UNIT-2.pptxUNIT-2.pptx
UNIT-2.pptx
 
No SQL- The Future Of Data Storage
No SQL- The Future Of Data StorageNo SQL- The Future Of Data Storage
No SQL- The Future Of Data Storage
 
data base system to new data science lerne
data base system to new data science lernedata base system to new data science lerne
data base system to new data science lerne
 
Presentation On NoSQL Databases
Presentation On NoSQL DatabasesPresentation On NoSQL Databases
Presentation On NoSQL Databases
 
NoSQL and MongoDB
NoSQL and MongoDBNoSQL and MongoDB
NoSQL and MongoDB
 
No sql – rise of the clusters
No sql – rise of the clustersNo sql – rise of the clusters
No sql – rise of the clusters
 
Unit-10.pptx
Unit-10.pptxUnit-10.pptx
Unit-10.pptx
 
Choosing your NoSQL storage
Choosing your NoSQL storageChoosing your NoSQL storage
Choosing your NoSQL storage
 
NoSQL
NoSQLNoSQL
NoSQL
 
Unit II -BIG DATA ANALYTICS.docx
Unit II -BIG DATA ANALYTICS.docxUnit II -BIG DATA ANALYTICS.docx
Unit II -BIG DATA ANALYTICS.docx
 
Report 2.0.docx
Report 2.0.docxReport 2.0.docx
Report 2.0.docx
 
Database Technologies
Database TechnologiesDatabase Technologies
Database Technologies
 
2.Introduction to NOSQL (Core concepts).pptx
2.Introduction to NOSQL (Core concepts).pptx2.Introduction to NOSQL (Core concepts).pptx
2.Introduction to NOSQL (Core concepts).pptx
 
Report 1.0.docx
Report 1.0.docxReport 1.0.docx
Report 1.0.docx
 
NOsql Presentation.pdf
NOsql Presentation.pdfNOsql Presentation.pdf
NOsql Presentation.pdf
 

More from Synaptica, LLC

Using ontologies for more than information categorization
Using ontologies for more than information categorizationUsing ontologies for more than information categorization
Using ontologies for more than information categorization
Synaptica, LLC
 
Text Analytics for Non-Experts
Text Analytics for Non-ExpertsText Analytics for Non-Experts
Text Analytics for Non-Experts
Synaptica, LLC
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
Synaptica, LLC
 
Taxonomies Crossing Boundaries: Thomson Reuters Life Sciences Taxonomy Use Cases
Taxonomies Crossing Boundaries: Thomson Reuters Life Sciences Taxonomy Use CasesTaxonomies Crossing Boundaries: Thomson Reuters Life Sciences Taxonomy Use Cases
Taxonomies Crossing Boundaries: Thomson Reuters Life Sciences Taxonomy Use Cases
Synaptica, LLC
 
SKOS-XL vs. Traditional Term Based Taxonomy Management
SKOS-XL vs. Traditional Term Based Taxonomy ManagementSKOS-XL vs. Traditional Term Based Taxonomy Management
SKOS-XL vs. Traditional Term Based Taxonomy Management
Synaptica, LLC
 
Successfully Managing Multilingual Taxonomies: 3 Methods
Successfully Managing Multilingual Taxonomies: 3 MethodsSuccessfully Managing Multilingual Taxonomies: 3 Methods
Successfully Managing Multilingual Taxonomies: 3 Methods
Synaptica, LLC
 
Enterprise vs. Federated Taxonomy Management - Taxonomy Boot Camp 2012
Enterprise vs. Federated Taxonomy Management - Taxonomy Boot Camp 2012Enterprise vs. Federated Taxonomy Management - Taxonomy Boot Camp 2012
Enterprise vs. Federated Taxonomy Management - Taxonomy Boot Camp 2012
Synaptica, LLC
 
ProQuest Taxonomy Boot Camp Presentation 2008
ProQuest Taxonomy Boot Camp Presentation 2008ProQuest Taxonomy Boot Camp Presentation 2008
ProQuest Taxonomy Boot Camp Presentation 2008
Synaptica, LLC
 
Re-engineering Taxonomy Warehouse as an Ontology
Re-engineering Taxonomy Warehouse as an OntologyRe-engineering Taxonomy Warehouse as an Ontology
Re-engineering Taxonomy Warehouse as an Ontology
Synaptica, LLC
 
Synaptica Proquest Talk Taxonomy Boot Camp 2009
Synaptica Proquest Talk Taxonomy Boot Camp 2009Synaptica Proquest Talk Taxonomy Boot Camp 2009
Synaptica Proquest Talk Taxonomy Boot Camp 2009
Synaptica, LLC
 

More from Synaptica, LLC (10)

Using ontologies for more than information categorization
Using ontologies for more than information categorizationUsing ontologies for more than information categorization
Using ontologies for more than information categorization
 
Text Analytics for Non-Experts
Text Analytics for Non-ExpertsText Analytics for Non-Experts
Text Analytics for Non-Experts
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
Taxonomies Crossing Boundaries: Thomson Reuters Life Sciences Taxonomy Use Cases
Taxonomies Crossing Boundaries: Thomson Reuters Life Sciences Taxonomy Use CasesTaxonomies Crossing Boundaries: Thomson Reuters Life Sciences Taxonomy Use Cases
Taxonomies Crossing Boundaries: Thomson Reuters Life Sciences Taxonomy Use Cases
 
SKOS-XL vs. Traditional Term Based Taxonomy Management
SKOS-XL vs. Traditional Term Based Taxonomy ManagementSKOS-XL vs. Traditional Term Based Taxonomy Management
SKOS-XL vs. Traditional Term Based Taxonomy Management
 
Successfully Managing Multilingual Taxonomies: 3 Methods
Successfully Managing Multilingual Taxonomies: 3 MethodsSuccessfully Managing Multilingual Taxonomies: 3 Methods
Successfully Managing Multilingual Taxonomies: 3 Methods
 
Enterprise vs. Federated Taxonomy Management - Taxonomy Boot Camp 2012
Enterprise vs. Federated Taxonomy Management - Taxonomy Boot Camp 2012Enterprise vs. Federated Taxonomy Management - Taxonomy Boot Camp 2012
Enterprise vs. Federated Taxonomy Management - Taxonomy Boot Camp 2012
 
ProQuest Taxonomy Boot Camp Presentation 2008
ProQuest Taxonomy Boot Camp Presentation 2008ProQuest Taxonomy Boot Camp Presentation 2008
ProQuest Taxonomy Boot Camp Presentation 2008
 
Re-engineering Taxonomy Warehouse as an Ontology
Re-engineering Taxonomy Warehouse as an OntologyRe-engineering Taxonomy Warehouse as an Ontology
Re-engineering Taxonomy Warehouse as an Ontology
 
Synaptica Proquest Talk Taxonomy Boot Camp 2009
Synaptica Proquest Talk Taxonomy Boot Camp 2009Synaptica Proquest Talk Taxonomy Boot Camp 2009
Synaptica Proquest Talk Taxonomy Boot Camp 2009
 

Recently uploaded

E-commerce Application Development Company.pdf
E-commerce Application Development Company.pdfE-commerce Application Development Company.pdf
E-commerce Application Development Company.pdf
Hornet Dynamics
 
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Neo4j
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
Google
 
Launch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in MinutesLaunch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in Minutes
Roshan Dwivedi
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
NYGGS Automation Suite
 
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
Google
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
Donna Lenk
 
Empowering Growth with Best Software Development Company in Noida - Deuglo
Empowering Growth with Best Software  Development Company in Noida - DeugloEmpowering Growth with Best Software  Development Company in Noida - Deuglo
Empowering Growth with Best Software Development Company in Noida - Deuglo
Deuglo Infosystem Pvt Ltd
 
Mobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona InfotechMobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona Infotech
Drona Infotech
 
Transform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR SolutionsTransform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR Solutions
TheSMSPoint
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
Boni García
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
Octavian Nadolu
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
Max Andersen
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOMLORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
lorraineandreiamcidl
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata
 
Graspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code AnalysisGraspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code Analysis
Aftab Hussain
 
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdfAutomated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
timtebeek1
 
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Mind IT Systems
 
openEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain SecurityopenEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain Security
Shane Coughlan
 

Recently uploaded (20)

E-commerce Application Development Company.pdf
E-commerce Application Development Company.pdfE-commerce Application Development Company.pdf
E-commerce Application Development Company.pdf
 
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissancesAtelier - Innover avec l’IA Générative et les graphes de connaissances
Atelier - Innover avec l’IA Générative et les graphes de connaissances
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
 
Launch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in MinutesLaunch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in Minutes
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
 
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI AppAI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
AI Fusion Buddy Review: Brand New, Groundbreaking Gemini-Powered AI App
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
 
Empowering Growth with Best Software Development Company in Noida - Deuglo
Empowering Growth with Best Software  Development Company in Noida - DeugloEmpowering Growth with Best Software  Development Company in Noida - Deuglo
Empowering Growth with Best Software Development Company in Noida - Deuglo
 
Mobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona InfotechMobile App Development Company In Noida | Drona Infotech
Mobile App Development Company In Noida | Drona Infotech
 
Transform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR SolutionsTransform Your Communication with Cloud-Based IVR Solutions
Transform Your Communication with Cloud-Based IVR Solutions
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
 
Artificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension FunctionsArtificia Intellicence and XPath Extension Functions
Artificia Intellicence and XPath Extension Functions
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOMLORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE ZOOM
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 5th June 2024
 
Graspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code AnalysisGraspan: A Big Data System for Big Code Analysis
Graspan: A Big Data System for Big Code Analysis
 
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdfAutomated software refactoring with OpenRewrite and Generative AI.pptx.pdf
Automated software refactoring with OpenRewrite and Generative AI.pptx.pdf
 
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
 
openEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain SecurityopenEuler Case Study - The Journey to Supply Chain Security
openEuler Case Study - The Journey to Supply Chain Security
 

Selecting the right database type for your knowledge management needs.

  • 1. Selecting the right database for your semantic storage needs Taxonomy Boot Camp London 2017 Jim Sweeney, Senior Product Manager Taxonomy and Ontology Solutions
  • 2. Database Types There are a number of different database types available. We will look at three key designs as they relate to storing semantic information. • Relational Database Management Systems (RDBMS) • Property Graph Databases • RDF Databases (a specialized form of Graph Database)
  • 3. A Brief History • The relational database did not appear until the late 1970’s replacing Navigational database designs of the previous decade. • The first commercial versions started to appear in the early 1980’s with IBM’s DB2 and Oracle’s products leading the way. • Microsoft released their SQL Server platform in 1989. • These three providers continue to be among the leaders as far as market share across all database types.
  • 4. A Brief History (Continued) • Graph database systems begin to appear in the late 1990’s, coinciding with the adoption of RDF as a W3C recommendation. • Leading Property Graph and RDF database providers such as Neo4J, GraphDB, and Jena (to name just a few) continue to grow in market share but still represent just a fraction of the total market share. Rank DBMS Database Model Score Oct Oct Oct Sep Oct 2017 2016 2017 2017 2016 1 1 Oracle Relational DBMS 1348.8 -10.29 -68.3 2 2 MySQL Relational DBMS 1298.83 -13.78 -63.82 3 3 Microsoft SQL Server Relational DBMS 1210.32 -2.23 -3.86 4 5 PostgreSQL Relational DBMS 373.27 0.91 54.58 5 4 MongoDB Document store 329.4 -3.33 10.6 6 6 DB2 Relational DBMS 194.59 -3.75 14.03 7 8 Microsoft Access Relational DBMS 129.45 0.64 4.78 8 7 Cassandra Wide column store 124.79 -1.41 -10.27 9 9 Redis Key-value store 122.05 1.65 12.51 10 11 Elasticsearch Search engine 120.23 0.23 21.12 21 21 Neo4j Graph DBMS 37.95 -0.48 1.5 89 86 Jena RDF store 2.31 0.04 0.5
  • 5. Relational Databases Management Systems (RDBMS) • RDBMS are designed using multiple tables to store data. • Each table is made up of headers, rows, and columns, much like a spreadsheet. • Each row contains different elements, including a primary key and zero to many foreign keys. • These keys are used to link information in one table to another table, which will contain distinct but related information. Image from neo4j.com
  • 6. RDBMS Design • The use of a key in each table means that there is no “relationship” as we would find in natural language or a graph structure. • Table values themselves are what link the data together by creating a join between data elements using keys. Image from neo4j.com
  • 7. Pros and Cons of RDBMS Pros: • RDBMS are tried and tested over many decades, and generally have more features available than newer platforms. • RDBMS are very efficient at storing indexable information that is described well in “tabular” structures, such as accounting or inventory data. • Data is often indexed at the time of entry, making keyword searches and those with limited table joins relatively fast.
  • 8. Pros and Cons of RDBMS Cons: • RDBMS require a foreknowledge of all the data, and consequently data tables, that you will be storing. (Graph structures provide more ad hoc design flexibility.) • RDBMS don’t perform as well in pattern matching searches as the number of relationships to be traversed increases. • SQL language, while widely known, is slightly more complex than those used with Property Graph or RDF systems.
  • 9. RDBMS Deconstructed • As we have seen, Relational Databases use tables and key values to link information together. • Let’s focus on just the highlighted components to compare this design with that of a Graph database. Image from neo4j.com
  • 10. The Graph Model • With a graph database, such as Neo4j, each node is a standalone item that describes a fully-formed member of the database. • To link the nodes together, one uses relationships. These are referred to as edges. • The nodes and edges form uniform structures which are repeated to relate all of the data elements, completely describing all necessary aspects of the information in the database. Image from neo4j.com
  • 11. The Graph Model (continued) • If we look at our example with Alice, we see that she belongs to several departments. The nature of the link from the person, Alice, to the Departments is described by the relationship itself, in this case “:BELONGS_TO”. • The node-edge-node combination that is formed has Alice belonging to the department “4FUTURE”. • No longer are we linking items via table joins. We are now describing the links is a more human understandable format. Image from neo4j.com
  • 12. RDF – Standardizing the Triple • RDF or Resource Description Framework provides standards that we use to uniformly describe the relationships between nodes, as well as describe other attributes that we want to assign to our data as metadata. • The standard establishes a subject- predicate-object unit called a triple. • One such RDF standard, FOAF (Friend of a Friend), is a collection of relationships and attributes describing people, their relationships to one another, and their activities. • In this example, we see Alice is not just identified as a member of departments, but also as a family member based near a location, and with a descriptive home page.
  • 13. RDF – Standardizing the Triple • To extend the previous example, Denver might be linked to the state of Colorado, in the United States, in North America using RDF OWL. • Using Graph database design, and employing RDF descriptive language collections, one can create links between entities that may virtually go on and on. • What we create becomes a dynamic, three dimensional database model capable of establishing true semantic Ontologies.
  • 14. Linked Data and the Semantic Web • In 1991 Tim Berners-Lee described what he called the World Wide Web, which was meant to, “allow links to be made to any information anywhere.”1 • In the 2010’s we are seeing the concept of Linked Data realizing this goal, utilizing RDF standards and other technologies such as SPARQL (Protocol and RDF Query Language) and JSON (JavaScript Object Notation). • With Linked Data protocols, one may send queries across the web to retrieve specific information from any source published in this format. • While this is possible using RDBMS technologies as well, Graph databases speak the native language of Linked Data and start to introduce some of the most exciting potential of the World Wide Web. Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae, Paul Buitelaar, Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/ 1 https://www.w3.org/People/Berners-Lee/1991/08/art-6484.txt
  • 15. Pros and Cons of Graph and RDF Databases Pros: • Graph databases are very well suited to storing information that contains many relationships and data points, leading to fast performance for pattern based searches. • SPARQL, Cypher, and other Graph query languages are generally simple and powerful. • RDF data stores can connect easily and natively to Linked Data sources.
  • 16. Pros and Cons of Graph and RDF Databases Cons: • Because they are not indexed as RDBMS are, they are not as effective in handing very large numbers of transactions or queries that need to capture linked information across an entire database at once. • Newer technology, less tested, not as many providers, and not as well supported, yet. • May not integrate well with legacy systems that will be consuming semantic information.
  • 17. Conclusions • RDMBS offer a tried and true technology that is more than capable of providing the database layer for most semantic technology applications and services. • Their wide adoption in nearly all data management environments make them easy to support and integrate with external systems. • When properly indexed and designed, they are very good at handling many, simultaneous transactions. • And, they are able to handle some advanced functionality enabled by RDF and Linked Data with a little design expertise.
  • 18. Conclusions • Graph and RDF databases are newer to the market and have yet to come close to the saturation levels that RDBMS enjoy, but they offer a design that is intrinsically effective at managing semantic information and the technologies around RDF and Linked Data. • Graph and RDF databases don’t require foreknowledge or pre- definition necessary for RDMBS design, making them able to evolve rapidly and organically. • Because Graph and RDF databases use basic, semantic structure as a key design characteristic they are very good at answering questions posed by real-life queries that we find in everyday language.
  • 19. Resources • Explanation of Graph Structure: http://www.zdnet.com/pictures/a- graph-is-a-graph-is-graph-rdf-vs-lpg/ • Data Modeling with Graph Databases: https://youtu.be/r4IGWNovzes • Designing and Building Semantic Applications with Linked Data: https://youtu.be/ToALLwM7KIw
  • 20. Thank You! Jim Sweeney, Senior Product Manager jim.sweeney@synaptica.com www.synaptica.com Twitter: @synaptica