Benchmarking Commercial RDF Stores with Publications Office Dataset

Mondeca in a nutshell Benchmark Context Publications Office of the EU Datasets Benchmark Configuration Query Analysis Benchmark Results Conclu
Benchmarking Commercial RDF Stores with Publications
Office Dataset
Ghislain Auguste Atemezing, Ph.D1
1Mondeca, 35 Boulevard de Strasbourg, 75010, Paris, France,
Twitter: @gatemezing
Web: http://www.mondeca.com
Benchmark material: https://github.com/gatemezing/posb
04th June, 2018
Ghislain Auguste Atemezing, Ph.D MONDECA QuWeDa2018 @ ESWC 2018 04th June, 2018 1 / 24

Agenda
1 Mondeca in a nutshell
Who we are
Why do clients come to us
2 Benchmark Context
3 Publications Office of the EU Datasets
Data Workflow & Use cases
Ontology
Datasets
Requirements
4 Benchmark Configuration
Experimental set up
5 Query Analysis
Instantaneous Queries
Analytical Queries
Read/Write queries
6 Benchmark Results
7 Conclusion
8 Aknowledgements

Who we are
Mondeca in a nutshell
Located in Paris, France
Leading French semantic technology solution provider since 1999
SMA : agile and ﬂat structure
Our solution : Smart Content Factory combines data management + content
annotation + semantic search.
Major clients in publishing activities(e.g.,Turner, AP, NPR), Insurance domain,
goods industry, national government and international organizations

Why do clients come to us
Mondeca in a nutshell

Why Benchmarking PO datasets?
1 To match the current and planned use cases of the Publications Ofﬁce of the
European Union (OP) w.r.t current state-of-art of RDF stores
2 To analyze deeply both functional requirements and documentation of 7
commercial RDF stores : Virtuoso, GraphDB, Neo4j, Stardog, Oracle,
Blazegraph and Marklogic.
3 To document and motivate the choice of a given RDF stores based on key
requirements deﬁned internally after interviews.
The end goal of the study is to identify
the RDF Store(s) that will best match the
OP’s planned use cases and requirements
in terms of scalability, stability and
reliability.

Bench Context - OP
Publications Office of the European Union
publishes the daily Official Journal of the European Union in 23 official EU
languages (24 when Irish is required).
produces and disseminates of legal and general publications in a variety of paper
and electronic formats
Online services
EUR-Lex 1 : provides free access to European Union law
EU Bookshop : the online library and bookshop of publications from the
institutions and other bodies of the EU.
EU Open Data Portal is the single point of access to data from the institutions and
other bodies of the European Union.
Eurovoc : is a multilingual, multidisciplinary thesaurus covering the activities of the
EU
Whoiswho is the official directory of the EU.
CORDIS : repository and portal for EU-funded research projects
1. http://eur-lex.europa.eu/

Data Workﬂow & Use cases
CELLAR RDF is the semantic repository at
OP, with ODP store featuring Linked Data
applications.
Current RDF usage/ Wish-list
Volume : approximately 730 million
triples.
The size of the RDF store increase
500 million triples after 2 years
OP foresees a volume of 1,5 billion
triples in the next 2 years as a
minimum.
Wish : handle 10x today’s volume
(ca.7 billion triples.)
OP receives 100k to 200k SPARQL
queries / day with strong growth. The
target architecture must handle 2mio
queries/day minimum
Search via browse by subject tab :
http ://publications.europa.eu/en/browse-
by-subject
example : https ://goo.gl/Yci9Nz

Ontology
OP Dataset - Ontology
Common Data Model (CDM)
CDM is the ontology used to generate
RDF dataset at OOPCE
CDM is based on FRBR model to
represent work, expression, and
manifestation
Instances in PROD dataset
Dataset with 187 instantiated classes
covering 61% of CDM
4,958,220 blank nodes
Top 3 classes : cdm:item (4.77%);
cdm:expression (4.52%) and
cdm:manifestation (2.30%)
CDM ontology statistics
Metric Number
Class 308
Object Property 803
Data Property 690
SubClassOf 615
SubObjectProp. 485
InverseObjectProp. 248
SubDataProperty 405
DL Expressivity ALHOIQ(D)

Datasets
OP Dataset - Explicit knowledge
The values in the tables are explicit triples in the knowledge base.
Top ﬁve instances by class in PROD dataset
Class #Instance Percentage
cdm:item 34,747,955 4.77
cdm:expression 32,898,325 4.52
cdm:manifestation 16,768,690 2.30
cdm:work 7,771,103 1.06
cdm:resource_legal 7,674,632 1.05
Size of dump datasets
Dataset name Disk size #Files #Triples RDF
format
Normalized (.zip) 226 GB 2,195 727,442,978 NQUADS
Non normalized (.tgz) 12 GB 64 728,163,464 NQUADS
NAL dataset 282 MB 72 402,926 RDF/XML

Requirements
RDF Stores Killer Requirements
Blazegraph v2.1.4 Open Edition
Poor results in the earlier stage of the bench : (i) too slow in loading data (90h
43min, almost 4 days!!)
Too many time out (15) in first test in queries from category 1
No support at all on repeated requests from them to improve results or validate
our configuration file.
Neo4J
All loading tests aborted after 40h 27min 2.
Need to port the code (ad-hoc importRDF) for each Neo4j upgrade : blueprints,
tinkerpop, gremlin
Too much maintenance on this stack.
2. A work in progress with Neo4J techs to improve our ad-hoc RDF import loader

Experimental set up
Bench Conﬁguration
Hardware Server
CPU : Intel(R) Xeon(R) CPU E5-2620 v3 @ 2.40GHz , 6C/12T
RAM : 128 GB; Disk capacity : 4 TO SATA.
Operating System : CentOS 7, 64 bits and Java 1.8.0 running.
Marklogic FO
CPU : Intel(R) Xeon(R) E3 1245 v5 4c/8T @ 3.5GHz
RAM : 64 GB; Disk storage : 3 x 500 Go SSD
Tools for benchmark
JENA qparse tool to validate all the queries
Open tool Sparql Query Benchmarker 3 used with 20 runs per categories to
warm up the server; 5 runs for current benchmark
3. https://github.com/rvesse/sparql-query-bm

Experimental set up
Triple stores setup
Virtuoso
NumberOfBuffers = 5450000 and MaxDirtyBuffers = 4000000
Stardog
Set Java heap size = 16GB and MaxDirectMemorySize = 8GB.
Deactivation of the strict parsing option, SL option by default
GraphDB
Set entity index size to 500000000 with entity predicate list enabled,
Disabling the content index.
Oracle
pga_aggregate_limit = 64GB and pga_aggregate_target = 32G

Instantaneous Queries
Query Analysis
FIGURE – Queries of Category 1
20 instantaneous queries
Query form #Total
SELECT 16
DESCRIBE 3
CONSTRUCT 1

Analytical Queries
Query Analysis
FIGURE – Queries of Category 2
Analytical queries
24 queries
Query form : SELECT

Read/Write queries
Query Analysis
All the queries were gathered and developed by OP’s metadata teams.
The queries were originally optimized for Virtuoso.
The results in the quantitative benchmark are probably biased in favor of the
current triple store.
To remove the bias, we asked to other vendors to provide us with optimized
queries for their engines.
We present the results of the quantitative
study, which is part of a more global study
containing 66 functional requirements .

Bulk loading
PROD Dataset (727Mio) : ranking order -> Virtuoso (3.8h), Stardog (4.59),
Marklogic (5.83), Oracle (23.07) and GraphDB (35.64). Oracle optimized to 8h!!
2Bio 4 : ranking order -> Virtuoso (13.01h), Stardog (13.30), GraphDB (17.46),
Marklogic (27.96) and Oracle (43.7). Oracle optimized to 32h!!
5Bio : ranking order -> Virtuoso (36.10h), GraphDB (44.14), Marklogic (169.95),
Stardog (unsuccessful), Oracle (N/A)
4. Generated by postﬁxing resources of type publications.europa.eu/resource/cellar

Results Category 1 - time out=60s
Virtuoso is faster than all the rest of
the triple stores.
No time out with Virtuoso. Marklogic (1
time out), Oracle (2 time out), Stardog
(2 time out), GraphDB (4 time out) and
Blazegraph ( 15 time out).
Stardog performs poorly compared to
GraphDB and Oracle.
Blazegraph was removed after this
test.
Marklogic is NOT constant in
multithreading.
Stardog performs poorly compared to
GraphDB and Oracle.

Virtuoso is faster than all the rest of
the triple stores.
No time out with Virtuoso, GraphDB
and Marklogic
1 timed out query (Q10) with Oracle.
4 timed out (Q15, Q16, Q19, Q22)
with Stardog.
Bench analytic queries ranking
RDF
Stores
#Time
Out
Rank
Virtuoso 0 1
Stardog 4 6
GraphDB
EE
0 3
GraphDB
EE
RDFS+
0 2
Marklogic 0 5
Oracle
12c
1 4

01 CONSTRUCT; 01 DELETE/INSERT and 03 INSERT IN query.
Virtuoso is faster, followed by Marklogic and GraphDB.
Oracle performs worse in monothread
Stardog and Oracle scores are signiﬁcantly lower than Marklogic and GraphDB

Oracle performs better in multithread
scenario. Why? -> Index/disk
calibration?!
Stardog is constant in magnitude of
QMpH.
GraphDB and Marklogic have
signiﬁcant changes from 5 clients to
20 clients.

Stability Test
Stress test on triple stores using instantaneous queries. (category 1)
The test starts by specifying the number of parallel clients = 128.
Each client completes the run of the mix queries in parallel.
The number of parallel clients is then multiplied by 2 and the process is repeated.
This repeats until either the maximum runtime (180min) or the maximum number
of threads are reached.
Result Stress Test
Stardog and Oracle ﬁnished with the limit of the parallel threads.
Virtuoso and GraphDB completed the test after 180 min, reaching 256 parallel
threads.
GraphDB shows fewer errors compared to Virtuoso.
GraphDB is likely to be more stable respectively in this order to Stardog, Oracle
and Virtuoso.

What We learned
Lessons learned
3 out of the 5 RDF stores come close to the key requirements (Robustness,
Scalability, Reliability and Stability)
None of the RDF stores perfectly matches OP’s business cases
When pushed to their limits, all of the RDF stores require extensive support from
the vendors (e.g., case of Oracle 12c and GraphDB)

Conclusions
We have presented a quantitative comparison of 5 commercial RDF stores :
Virtuoso, GraphDB, Oracle, Stardog and Marklogic based on OP datasets and
requirements.
The results show that Virtuoso and Stardog are faster in bulk loading.
Virtuoso outperforms respectively to GraphDB, Stardog and Oracle in
query-based performance.
GraphDB shows to be the winner in the stability test performed in this benchmark.
This study gives an overview of the current state of RDF stores performance with
respect to PO’s dataset
This work can be partly used to assess enterprise RDF stores
We plan to get query rewrites for all the stores vendors and evaluate the results
We also plan to perform the same benchmark on AWS Neptune 5
We plan to better compare this work with state-of-the-art benchmarking, maybe
using IGUANA framework.
5. http://blog.mondeca.com/2018/02/09/requetes-sparql-avec-neptune/

Acknowledgements
We would like to thank RDF teams at Onto-
text, Stardog Union, Oracle, Marklogic and
OpenLink.

Benchmarking Commercial RDF Stores with Publications Office Dataset

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Benchmarking Commercial RDF Stores with Publications Office Dataset

Similar to Benchmarking Commercial RDF Stores with Publications Office Dataset (20)

More from Ghislain Atemezing

More from Ghislain Atemezing (10)

Recently uploaded

Recently uploaded (20)

Benchmarking Commercial RDF Stores with Publications Office Dataset