SlideShare a Scribd company logo
1 of 21
Download to read offline
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Graph Databases Lifecycle Methodology 
and Tool to Support Index/Store 
Versioning
Pierfrancesco Bellini, Ivan Bruno, Paolo Nesi, Nadia Rauch
DISIT Lab
Dipartimento di Ingegneria dell’Informazione, DINFO
Università degli Studi di Firenze
Via S. Marta 3, 50139, Firenze, Italy
Tel: +39-055-2758511, fax: +39-055-2758570
http://www.disit.dinfo.unifi.it alias http://www.disit.org
Paolo.nesi@unifi.it
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 1
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Context and Motivations
• Graph database are taking place for systems exploiting 
knowledge base, KB
– Include a set of ontologies and data instances: static data, 
reconciliation data, real time data, historical data, gelocated
data, etc. 
– Smart city, smart cloud, smart learning, etc.
• KB need complex and non consolidated methodologies 
for their implementation
– Many issues may lead to invalidate a building in favor of new 
version, e.g., looking for more inference, corrections in model 
ontology, changes in historical data, etc.
– Methodologies present iterative processes that lead to 
restructure/rebuild the knowledge base 
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 2
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Context and Motivations (2)
• RDF stores (end they end points) are a way for 
Knowledge Base implementation 
• RDF stores see Ontologies and Instances as triple 
(quadruple) s‐p‐o (context)
• RDF stores presents several problems in managing:
– Versioning and thus deleting triples including the 
inferred triples… at real time performance ?
– Performance in db store building (inference)
– Performance in db store querying (inference)
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 3
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
The paper contributions
• A methodology for knowledge base life cycle 
building and improvement
– addressing more details and data kind and cases of 
the state of the art methodologies 
• A versioning system for RDF store building 
supporting the methodology
– Completely new tool that can save up to 90% of time 
in RDF rebuilding
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 4
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
RDF store service: building process
• built incrementally via progressive refinements 
mediating among
– reusing ontological models,
– increasing the capability of making deductions and 
reasoning on the knowledge base concepts,
– maintaining acceptable: query and rendering 
performances, 
– simplifying the design of the front‐end services, 
– flexibility to the arrival of additional data and models 
and/or corrections, 
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 5
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
RDF KB life cycle methodology
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 6
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Ontology Construction
• Domain Analysis: 
– concepts, abstractions, aggregation, 
classes vs attributes, etc. 
• Available Ontologies
– Diffusion, Licensing, quality, inference, 
etc.
• Ontology Integration
– Glued concepts, more inference 
• Ontology review
– Conceptual assessment
– Formal verification with metrics and 
tools
• Points from which one has to restart, 
have to be saved
• Integrated version of the ontology goes 
IN USE
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 7
In use
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Km4city example
• reuses: 
– dcterms to set of properties and classes for modeling 
metadata; 
– foaf dedicated to relations among people or groups; 
– schema.org for a description of people and organizations; 
– wgs84_pos representing latitude and longitude; 
– GoodRelations for a description of business entities and their 
locations; 
– OWL‐Time for temporal modeling; 
– OTN for transport aspects; 
– GIS Dictionary, to represent the spatial component of 
geographic features
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 8
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Data ingestion and enrichment
• Static Data can be: 
– Open Data, 
– Historical real time data (from 
– to)
– Enrichment data (to be 
identified)
– Reconciliation Data (to be 
identified)
• Problems may derive from: 
– Inconsistencies instance level ?
– Incompleteness: missing onto 
concepts, missing links, …
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 9
In use
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Real Time Data
• Need to be already verified and 
reconciled 
– No changes in their structure
– No or known variability on 
instances
• May produce large volume of 
Cumulated data 
– can become the substantial part of 
the KB
– cannot be deleted
– Cannot be easily extracted from 
the RDF store, thus historical data 
may be saved for rebuilding
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 10
In use
Histori
cal
triples
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
RDF Indexing Flow and Requirements
• There are several reasons to revise/change the 
RDF Indexing and thus the RDF Store itself 
–the ontology and thus the:
• data mapping and triplification invalidating the 
indexing and the materialization of triples
–the data triples coming from ingestion, 
historical, reconciliation, as: 
• a new data mapping, quality improvement, 
• changes in performing reconciliation, enrichment 
and triplification processes. 
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 11
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
RDF Index and RDF Index Descriptor
• The RDF Index is substantially the RDF Store 
containing the triple (ontology, static data, etc…)
• The RDF Index Descriptor is:
– the recipe to create the RDF Index and 
– the set of triples adopted to build it
• Since the version of the RDF Store is not an a 
viable task (without redesigning the RDF store) 
we performed the versioning of the RDF Index 
Descriptor
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 12
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Versioning: File and RDF Index
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 13
Contains reconciliation
of triples connecting
parking locations wrt
civic numbers depends
on the ontology and on
the parking area data
sets.
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Verification and Validation
• In a real production of big data RDF store, 
– hundreds of files containing triples are produced
– The restarting from scratch is time consuming, may 
be error prone, may lose the versioning / evolution 
value
• Not all changes can produce consistent RDF KB 
Store.
• The saving of intermediate consistent version 
can lead t o save time in exploiting the saved 
version as restarting point.
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 14
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Example: 4 versions on the same core
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 15
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
RDF Index Manager tool
• Keep tracing RDF KB Store Versions, RKBSV, in terms of files of triples, 
index‐description, and RDF Index;
• Maintaining a repository of RKBSVs where they could be stored and 
retrieved; 
• Selecting a RKBSV from the repository for modification, to examine 
changes and the history version, to be used as base for building a new 
version;
• Managing the index descriptor as a list of files containing triples;
• Generating a RDF KB index on the basis of an RKBSV independently 
from the RDF store kind automatically, and in particular for SESAME 
OWLIM and Virtuoso;
• Monitoring the RDF KB index generation and the feeding state;
• Suggest the closest version of the RKBSV with respect to the 
demanded new index in terms of files of triples;
• Avoiding manually managing the script file of indexing, since it is time 
consuming and an error prone process.
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 16
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
RIM tool: building monitor during production
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 17
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
RIM tool: during RDF assessment
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 18
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Experimental Results
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 19
Ontologi
es
+ street 
graphs
+ smart city 
Services
+Enrich& 
Reconciliations
+Historical 
data 1 month
Indexing process
Final number of triples 15.809 33.547.501 34.462.930 34.557.142 44.218.719
Final number of Files 12 137 178 185 27794
Added triples with respect to previous version 15809 33.531.692 915.429 94.212 9.661.577
Added Files with respect to previous version 12 125 41 7 27609
OWLIM SE 4.3
Indexing Time without RIM (s) 18 6536 6198 7516 12093
Indexing Time with RIM (s) 11 6029 514 343 5745
% of saved time,  RIM versioning 38,9 7,8 91,7 95,4 52,5
Final Number of triples (including geo + 
inferred) 16062 57.486.956 59.395.432 59.486.748 73.441.126
disk space in Mbyte 310 8669 8936 9039 13110
VIRTUOSO 7.2
Indexing Time without RIM (s) 146 806 964 1000 2487
Indexing Time with RIM (s) 156 833 421 296 1932
% of saved time,  RIM versioning ‐6,8 ‐3,3 56,3 70,4 22,3
Final Number of triples (including geo, no 
inferred) 21.628 35.452.613 36.301.322 36.420.445 46.232.510
disk space in Mbyte 68 1450 1632 1631 2294
GraphDB 6.1
Indexing Time without RIM (s) 9 7818 7929 7671 12915
Indexing Time with RIM (s) 2 6791 454 214 4849
% of saved time,  RIM versioning 77,8 13,1 94,3 97,2 62,45
Final Number of triples (including geo + 
inferred) 15.809 57.486.415 59.394.891 59.487.551 73.441.929
disk space in Mbyte 96 4276 4466 4643 5714
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
New Version
• Support Ontology Licensing
– To take into account in ontology building and in KB RDF 
store usage, querying
• Support Data Licensing
• To help selecting ontologies
• To take into account in KB RDF store usage, querying
• Support Data ingestion process with 
– integrated Data Ingestion Manager
– Maintaining under control the data sets to be included, 
their licenses, triplification, etc. 
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 20
DISIT Lab, Distributed Data Intelligence and Technologies
Distributed Systems and Internet Technologies
Department of Information Engineering (DINFO)
http://www.disit.dinfo.unifi.it
Conclusions
• The RIM model and tool allow:
– Keeping under control and trace the RDF life cycle 
from construction of the ontology to the indexing 
and validation. 
– Reducing the time for indexing (RDF store 
construction) up to the 97% in some cases.
The benefits have been demonstrated for the 
most diffused RDF stores: OWLIM, GraphDB and 
Virtuoso.
DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 21

More Related Content

What's hot

Archaeology and cultural heritage application working group
Archaeology and cultural heritage application working groupArchaeology and cultural heritage application working group
Archaeology and cultural heritage application working groupManolis Vavalis
 
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Research Data Alliance
 
LinkedUp - Linked Data & Education
LinkedUp - Linked Data & EducationLinkedUp - Linked Data & Education
LinkedUp - Linked Data & EducationStefan Dietze
 
Machine Support for Interacting with Scientific Publications Improving Inform...
Machine Support for Interacting with Scientific Publications Improving Inform...Machine Support for Interacting with Scientific Publications Improving Inform...
Machine Support for Interacting with Scientific Publications Improving Inform...Christoph Lange
 
Advanced Probabilistic Modeling Algorithms for Clustering ...
Advanced Probabilistic Modeling Algorithms for Clustering ...Advanced Probabilistic Modeling Algorithms for Clustering ...
Advanced Probabilistic Modeling Algorithms for Clustering ...butest
 
Combining a co-occurrence-based and a semantic measure for entity linking
Combining a co-occurrence-based and a semantic measure for entity linkingCombining a co-occurrence-based and a semantic measure for entity linking
Combining a co-occurrence-based and a semantic measure for entity linkingBesnik Fetahu
 
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept AnalysisExtracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept AnalysisMathieu d'Aquin
 
Semantic Web / Linked Data Technologies
Semantic Web / Linked Data TechnologiesSemantic Web / Linked Data Technologies
Semantic Web / Linked Data TechnologiesMathieu d'Aquin
 
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
Retrieval, Crawling and Fusion of Entity-centric Data on the WebRetrieval, Crawling and Fusion of Entity-centric Data on the Web
Retrieval, Crawling and Fusion of Entity-centric Data on the WebStefan Dietze
 
Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...Mathieu d'Aquin
 
KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013Stefan Dietze
 
Doing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebDoing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebMathieu d'Aquin
 
Publications Fatima Dargam - 09.05.2016
Publications  Fatima Dargam - 09.05.2016Publications  Fatima Dargam - 09.05.2016
Publications Fatima Dargam - 09.05.2016Fatima Dargam
 

What's hot (16)

Archaeology and cultural heritage application working group
Archaeology and cultural heritage application working groupArchaeology and cultural heritage application working group
Archaeology and cultural heritage application working group
 
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
 
LinkedUp - Linked Data & Education
LinkedUp - Linked Data & EducationLinkedUp - Linked Data & Education
LinkedUp - Linked Data & Education
 
Machine Support for Interacting with Scientific Publications Improving Inform...
Machine Support for Interacting with Scientific Publications Improving Inform...Machine Support for Interacting with Scientific Publications Improving Inform...
Machine Support for Interacting with Scientific Publications Improving Inform...
 
Advanced Probabilistic Modeling Algorithms for Clustering ...
Advanced Probabilistic Modeling Algorithms for Clustering ...Advanced Probabilistic Modeling Algorithms for Clustering ...
Advanced Probabilistic Modeling Algorithms for Clustering ...
 
Combining a co-occurrence-based and a semantic measure for entity linking
Combining a co-occurrence-based and a semantic measure for entity linkingCombining a co-occurrence-based and a semantic measure for entity linking
Combining a co-occurrence-based and a semantic measure for entity linking
 
Towards Knowledge-Enabled Society
Towards Knowledge-Enabled SocietyTowards Knowledge-Enabled Society
Towards Knowledge-Enabled Society
 
CV
CVCV
CV
 
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept AnalysisExtracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
 
Semantic Web / Linked Data Technologies
Semantic Web / Linked Data TechnologiesSemantic Web / Linked Data Technologies
Semantic Web / Linked Data Technologies
 
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
Retrieval, Crawling and Fusion of Entity-centric Data on the WebRetrieval, Crawling and Fusion of Entity-centric Data on the Web
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
 
Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...
 
KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013
 
10 problems 06
10 problems 0610 problems 06
10 problems 06
 
Doing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebDoing Clever Things with the Semantic Web
Doing Clever Things with the Semantic Web
 
Publications Fatima Dargam - 09.05.2016
Publications  Fatima Dargam - 09.05.2016Publications  Fatima Dargam - 09.05.2016
Publications Fatima Dargam - 09.05.2016
 

Viewers also liked

Km4City: A reusable example of a Metropolitan-Wide Data Platform, MAJORCITIES...
Km4City: A reusable example of a Metropolitan-Wide Data Platform, MAJORCITIES...Km4City: A reusable example of a Metropolitan-Wide Data Platform, MAJORCITIES...
Km4City: A reusable example of a Metropolitan-Wide Data Platform, MAJORCITIES...Paolo Nesi
 
Aggregatore di Open Data del territorio fiorentino e toscano
Aggregatore di Open Data del territorio fiorentino e toscano Aggregatore di Open Data del territorio fiorentino e toscano
Aggregatore di Open Data del territorio fiorentino e toscano Paolo Nesi
 
Smart City Strategic Forecast, SmartCity360, Bratislava
Smart City Strategic Forecast, SmartCity360, BratislavaSmart City Strategic Forecast, SmartCity360, Bratislava
Smart City Strategic Forecast, SmartCity360, BratislavaPaolo Nesi
 
RESOLUTE: Resilience management guidelines and Operationalization applied to ...
RESOLUTE: Resilience management guidelines and Operationalization applied to ...RESOLUTE: Resilience management guidelines and Operationalization applied to ...
RESOLUTE: Resilience management guidelines and Operationalization applied to ...Paolo Nesi
 
Estrazione e Deduzione della Conoscenza via Modelli Semantici: From Social N...
Estrazione e Deduzione della Conoscenza via Modelli Semantici: From Social N...Estrazione e Deduzione della Conoscenza via Modelli Semantici: From Social N...
Estrazione e Deduzione della Conoscenza via Modelli Semantici: From Social N...Paolo Nesi
 
Km4City, Smart City Urban Platform, From Data to Services for the Sentient Ci...
Km4City, Smart City Urban Platform, From Data to Services for the Sentient Ci...Km4City, Smart City Urban Platform, From Data to Services for the Sentient Ci...
Km4City, Smart City Urban Platform, From Data to Services for the Sentient Ci...Paolo Nesi
 
ICARO: business cloud accelerator !
ICARO: business cloud accelerator !ICARO: business cloud accelerator !
ICARO: business cloud accelerator !Paolo Nesi
 
FODD 2015 Mobile App based on ServiceMap, http://www.disit.org/fodd
FODD 2015 Mobile App based on ServiceMap, http://www.disit.org/foddFODD 2015 Mobile App based on ServiceMap, http://www.disit.org/fodd
FODD 2015 Mobile App based on ServiceMap, http://www.disit.org/foddPaolo Nesi
 
Smart City and Open Data Projects and tools of DISIT Lab
Smart City and Open Data Projects and tools of DISIT LabSmart City and Open Data Projects and tools of DISIT Lab
Smart City and Open Data Projects and tools of DISIT LabPaolo Nesi
 
Km4City Smart City API: an integrated support for mobility services
Km4City Smart City API: an integrated support for mobility servicesKm4City Smart City API: an integrated support for mobility services
Km4City Smart City API: an integrated support for mobility servicesPaolo Nesi
 
Km4City White Paper: Production tools for Smart City, from data to services f...
Km4City White Paper: Production tools for Smart City, from data to services f...Km4City White Paper: Production tools for Smart City, from data to services f...
Km4City White Paper: Production tools for Smart City, from data to services f...Paolo Nesi
 
Functional Resonance Analysis Method based- Decision Support tool for Urban T...
Functional Resonance Analysis Method based- Decision Support tool for Urban T...Functional Resonance Analysis Method based- Decision Support tool for Urban T...
Functional Resonance Analysis Method based- Decision Support tool for Urban T...Paolo Nesi
 
Km4City: Knowledge Model 4 the City: molti dati + km4city = +conoscenza e se...
Km4City: Knowledge Model 4 the City:  molti dati + km4city = +conoscenza e se...Km4City: Knowledge Model 4 the City:  molti dati + km4city = +conoscenza e se...
Km4City: Knowledge Model 4 the City: molti dati + km4city = +conoscenza e se...Paolo Nesi
 
Monitoring Public Attention on Environment Issues with Twitter Vigilance
Monitoring Public Attention on Environment Issues with Twitter VigilanceMonitoring Public Attention on Environment Issues with Twitter Vigilance
Monitoring Public Attention on Environment Issues with Twitter VigilancePaolo Nesi
 
Overview on Smart City, DISIT lab solution for beginners, 2015, Part 7: Distr...
Overview on Smart City, DISIT lab solution for beginners, 2015, Part 7: Distr...Overview on Smart City, DISIT lab solution for beginners, 2015, Part 7: Distr...
Overview on Smart City, DISIT lab solution for beginners, 2015, Part 7: Distr...Paolo Nesi
 
Twitter Vigilance: Modelli e Strumenti per l’Analisi e lo Studio di Dati Soci...
Twitter Vigilance: Modelli e Strumenti per l’Analisi e lo Studio di Dati Soci...Twitter Vigilance: Modelli e Strumenti per l’Analisi e lo Studio di Dati Soci...
Twitter Vigilance: Modelli e Strumenti per l’Analisi e lo Studio di Dati Soci...Paolo Nesi
 
Km4City: una soluzione aperta per erogare servizi Smart City
Km4City: una soluzione aperta per erogare servizi Smart CityKm4City: una soluzione aperta per erogare servizi Smart City
Km4City: una soluzione aperta per erogare servizi Smart CityPaolo Nesi
 

Viewers also liked (17)

Km4City: A reusable example of a Metropolitan-Wide Data Platform, MAJORCITIES...
Km4City: A reusable example of a Metropolitan-Wide Data Platform, MAJORCITIES...Km4City: A reusable example of a Metropolitan-Wide Data Platform, MAJORCITIES...
Km4City: A reusable example of a Metropolitan-Wide Data Platform, MAJORCITIES...
 
Aggregatore di Open Data del territorio fiorentino e toscano
Aggregatore di Open Data del territorio fiorentino e toscano Aggregatore di Open Data del territorio fiorentino e toscano
Aggregatore di Open Data del territorio fiorentino e toscano
 
Smart City Strategic Forecast, SmartCity360, Bratislava
Smart City Strategic Forecast, SmartCity360, BratislavaSmart City Strategic Forecast, SmartCity360, Bratislava
Smart City Strategic Forecast, SmartCity360, Bratislava
 
RESOLUTE: Resilience management guidelines and Operationalization applied to ...
RESOLUTE: Resilience management guidelines and Operationalization applied to ...RESOLUTE: Resilience management guidelines and Operationalization applied to ...
RESOLUTE: Resilience management guidelines and Operationalization applied to ...
 
Estrazione e Deduzione della Conoscenza via Modelli Semantici: From Social N...
Estrazione e Deduzione della Conoscenza via Modelli Semantici: From Social N...Estrazione e Deduzione della Conoscenza via Modelli Semantici: From Social N...
Estrazione e Deduzione della Conoscenza via Modelli Semantici: From Social N...
 
Km4City, Smart City Urban Platform, From Data to Services for the Sentient Ci...
Km4City, Smart City Urban Platform, From Data to Services for the Sentient Ci...Km4City, Smart City Urban Platform, From Data to Services for the Sentient Ci...
Km4City, Smart City Urban Platform, From Data to Services for the Sentient Ci...
 
ICARO: business cloud accelerator !
ICARO: business cloud accelerator !ICARO: business cloud accelerator !
ICARO: business cloud accelerator !
 
FODD 2015 Mobile App based on ServiceMap, http://www.disit.org/fodd
FODD 2015 Mobile App based on ServiceMap, http://www.disit.org/foddFODD 2015 Mobile App based on ServiceMap, http://www.disit.org/fodd
FODD 2015 Mobile App based on ServiceMap, http://www.disit.org/fodd
 
Smart City and Open Data Projects and tools of DISIT Lab
Smart City and Open Data Projects and tools of DISIT LabSmart City and Open Data Projects and tools of DISIT Lab
Smart City and Open Data Projects and tools of DISIT Lab
 
Km4City Smart City API: an integrated support for mobility services
Km4City Smart City API: an integrated support for mobility servicesKm4City Smart City API: an integrated support for mobility services
Km4City Smart City API: an integrated support for mobility services
 
Km4City White Paper: Production tools for Smart City, from data to services f...
Km4City White Paper: Production tools for Smart City, from data to services f...Km4City White Paper: Production tools for Smart City, from data to services f...
Km4City White Paper: Production tools for Smart City, from data to services f...
 
Functional Resonance Analysis Method based- Decision Support tool for Urban T...
Functional Resonance Analysis Method based- Decision Support tool for Urban T...Functional Resonance Analysis Method based- Decision Support tool for Urban T...
Functional Resonance Analysis Method based- Decision Support tool for Urban T...
 
Km4City: Knowledge Model 4 the City: molti dati + km4city = +conoscenza e se...
Km4City: Knowledge Model 4 the City:  molti dati + km4city = +conoscenza e se...Km4City: Knowledge Model 4 the City:  molti dati + km4city = +conoscenza e se...
Km4City: Knowledge Model 4 the City: molti dati + km4city = +conoscenza e se...
 
Monitoring Public Attention on Environment Issues with Twitter Vigilance
Monitoring Public Attention on Environment Issues with Twitter VigilanceMonitoring Public Attention on Environment Issues with Twitter Vigilance
Monitoring Public Attention on Environment Issues with Twitter Vigilance
 
Overview on Smart City, DISIT lab solution for beginners, 2015, Part 7: Distr...
Overview on Smart City, DISIT lab solution for beginners, 2015, Part 7: Distr...Overview on Smart City, DISIT lab solution for beginners, 2015, Part 7: Distr...
Overview on Smart City, DISIT lab solution for beginners, 2015, Part 7: Distr...
 
Twitter Vigilance: Modelli e Strumenti per l’Analisi e lo Studio di Dati Soci...
Twitter Vigilance: Modelli e Strumenti per l’Analisi e lo Studio di Dati Soci...Twitter Vigilance: Modelli e Strumenti per l’Analisi e lo Studio di Dati Soci...
Twitter Vigilance: Modelli e Strumenti per l’Analisi e lo Studio di Dati Soci...
 
Km4City: una soluzione aperta per erogare servizi Smart City
Km4City: una soluzione aperta per erogare servizi Smart CityKm4City: una soluzione aperta per erogare servizi Smart City
Km4City: una soluzione aperta per erogare servizi Smart City
 

Similar to Graph Databases Lifecycle Methodology and Tool to Support Index/Store Versioning

DISIT Lab overview: smart city, big data, semantic computing, cloud
DISIT Lab overview: smart city, big data, semantic computing, cloudDISIT Lab overview: smart city, big data, semantic computing, cloud
DISIT Lab overview: smart city, big data, semantic computing, cloudPaolo Nesi
 
Knowledge mining and Semantic Models: from Cloud to Smart City
Knowledge mining and Semantic Models: from Cloud to Smart CityKnowledge mining and Semantic Models: from Cloud to Smart City
Knowledge mining and Semantic Models: from Cloud to Smart CityPaolo Nesi
 
Twitter Vigilance: a Multi-User platform for Cross-Domain Twitter Data Analyt...
Twitter Vigilance: a Multi-User platform for Cross-Domain Twitter Data Analyt...Twitter Vigilance: a Multi-User platform for Cross-Domain Twitter Data Analyt...
Twitter Vigilance: a Multi-User platform for Cross-Domain Twitter Data Analyt...Paolo Nesi
 
Open Data Day 2016, Km4City, L’universita’ come aggregatore di Open Data del ...
Open Data Day 2016, Km4City, L’universita’ come aggregatore di Open Data del ...Open Data Day 2016, Km4City, L’universita’ come aggregatore di Open Data del ...
Open Data Day 2016, Km4City, L’universita’ come aggregatore di Open Data del ...Paolo Nesi
 
Km4City: Smart City Ontology Building for Effective Erogation of Services
Km4City: Smart City Ontology Building for Effective Erogation of ServicesKm4City: Smart City Ontology Building for Effective Erogation of Services
Km4City: Smart City Ontology Building for Effective Erogation of ServicesPaolo Nesi
 
Big Data Smart City processes and tools, Real Time data processing tools
Big Data Smart City processes and tools, Real Time data processing toolsBig Data Smart City processes and tools, Real Time data processing tools
Big Data Smart City processes and tools, Real Time data processing toolsPaolo Nesi
 
Technologies for Enhancing Knowledge and Training, the future of e-learning t...
Technologies for Enhancing Knowledge and Training, the future of e-learning t...Technologies for Enhancing Knowledge and Training, the future of e-learning t...
Technologies for Enhancing Knowledge and Training, the future of e-learning t...Paolo Nesi
 
"Km4City: Smart City Ontology Building for Effective Erogation of Services"
"Km4City: Smart City Ontology Building for Effective Erogation of Services""Km4City: Smart City Ontology Building for Effective Erogation of Services"
"Km4City: Smart City Ontology Building for Effective Erogation of Services"diannepatricia
 
A Smart City Development kit for designing Web and Mobile Apps
A Smart City Development kit for designing  Web and Mobile AppsA Smart City Development kit for designing  Web and Mobile Apps
A Smart City Development kit for designing Web and Mobile AppsPaolo Nesi
 
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...Paolo Nesi
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euEUDAT
 
Industria 4.0 @ DISIT lab
Industria 4.0 @ DISIT labIndustria 4.0 @ DISIT lab
Industria 4.0 @ DISIT labPaolo Nesi
 
DISIT lab Overview on Tourism and Training, June 2014
DISIT lab Overview on Tourism and Training, June 2014DISIT lab Overview on Tourism and Training, June 2014
DISIT lab Overview on Tourism and Training, June 2014Paolo Nesi
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
ESWC 2015 - EU Networking Session
ESWC 2015 - EU Networking SessionESWC 2015 - EU Networking Session
ESWC 2015 - EU Networking SessionErik Mannens
 
DSpace-CRIS_An open source solution for Research_EDU15
DSpace-CRIS_An open source solution for Research_EDU15DSpace-CRIS_An open source solution for Research_EDU15
DSpace-CRIS_An open source solution for Research_EDU15Michele Mennielli
 
Overview on Smart City: Smart City for Beginners
Overview on Smart City: Smart City for BeginnersOverview on Smart City: Smart City for Beginners
Overview on Smart City: Smart City for BeginnersPaolo Nesi
 
KM4city, Il Valore degli #OpenData: Esperienze a confronto
KM4city, Il Valore degli #OpenData: Esperienze a confrontoKM4city, Il Valore degli #OpenData: Esperienze a confronto
KM4city, Il Valore degli #OpenData: Esperienze a confrontoPaolo Nesi
 
DAI DATI INTELLIGENTI AI SERVIZI Smart City API Hackathon
DAI DATI INTELLIGENTI AI SERVIZI Smart City API HackathonDAI DATI INTELLIGENTI AI SERVIZI Smart City API Hackathon
DAI DATI INTELLIGENTI AI SERVIZI Smart City API HackathonPaolo Nesi
 

Similar to Graph Databases Lifecycle Methodology and Tool to Support Index/Store Versioning (20)

DISIT Lab overview: smart city, big data, semantic computing, cloud
DISIT Lab overview: smart city, big data, semantic computing, cloudDISIT Lab overview: smart city, big data, semantic computing, cloud
DISIT Lab overview: smart city, big data, semantic computing, cloud
 
Knowledge mining and Semantic Models: from Cloud to Smart City
Knowledge mining and Semantic Models: from Cloud to Smart CityKnowledge mining and Semantic Models: from Cloud to Smart City
Knowledge mining and Semantic Models: from Cloud to Smart City
 
Twitter Vigilance: a Multi-User platform for Cross-Domain Twitter Data Analyt...
Twitter Vigilance: a Multi-User platform for Cross-Domain Twitter Data Analyt...Twitter Vigilance: a Multi-User platform for Cross-Domain Twitter Data Analyt...
Twitter Vigilance: a Multi-User platform for Cross-Domain Twitter Data Analyt...
 
Open Data Day 2016, Km4City, L’universita’ come aggregatore di Open Data del ...
Open Data Day 2016, Km4City, L’universita’ come aggregatore di Open Data del ...Open Data Day 2016, Km4City, L’universita’ come aggregatore di Open Data del ...
Open Data Day 2016, Km4City, L’universita’ come aggregatore di Open Data del ...
 
Km4City: Smart City Ontology Building for Effective Erogation of Services
Km4City: Smart City Ontology Building for Effective Erogation of ServicesKm4City: Smart City Ontology Building for Effective Erogation of Services
Km4City: Smart City Ontology Building for Effective Erogation of Services
 
Big Data Smart City processes and tools, Real Time data processing tools
Big Data Smart City processes and tools, Real Time data processing toolsBig Data Smart City processes and tools, Real Time data processing tools
Big Data Smart City processes and tools, Real Time data processing tools
 
Technologies for Enhancing Knowledge and Training, the future of e-learning t...
Technologies for Enhancing Knowledge and Training, the future of e-learning t...Technologies for Enhancing Knowledge and Training, the future of e-learning t...
Technologies for Enhancing Knowledge and Training, the future of e-learning t...
 
"Km4City: Smart City Ontology Building for Effective Erogation of Services"
"Km4City: Smart City Ontology Building for Effective Erogation of Services""Km4City: Smart City Ontology Building for Effective Erogation of Services"
"Km4City: Smart City Ontology Building for Effective Erogation of Services"
 
A Smart City Development kit for designing Web and Mobile Apps
A Smart City Development kit for designing  Web and Mobile AppsA Smart City Development kit for designing  Web and Mobile Apps
A Smart City Development kit for designing Web and Mobile Apps
 
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...
Linked Open Graph: browsing multiple SPARQL entry points to build your own LO...
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.eu
 
Industria 4.0 @ DISIT lab
Industria 4.0 @ DISIT labIndustria 4.0 @ DISIT lab
Industria 4.0 @ DISIT lab
 
DISIT lab Overview on Tourism and Training, June 2014
DISIT lab Overview on Tourism and Training, June 2014DISIT lab Overview on Tourism and Training, June 2014
DISIT lab Overview on Tourism and Training, June 2014
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Information entanglement
Information entanglementInformation entanglement
Information entanglement
 
ESWC 2015 - EU Networking Session
ESWC 2015 - EU Networking SessionESWC 2015 - EU Networking Session
ESWC 2015 - EU Networking Session
 
DSpace-CRIS_An open source solution for Research_EDU15
DSpace-CRIS_An open source solution for Research_EDU15DSpace-CRIS_An open source solution for Research_EDU15
DSpace-CRIS_An open source solution for Research_EDU15
 
Overview on Smart City: Smart City for Beginners
Overview on Smart City: Smart City for BeginnersOverview on Smart City: Smart City for Beginners
Overview on Smart City: Smart City for Beginners
 
KM4city, Il Valore degli #OpenData: Esperienze a confronto
KM4city, Il Valore degli #OpenData: Esperienze a confrontoKM4city, Il Valore degli #OpenData: Esperienze a confronto
KM4city, Il Valore degli #OpenData: Esperienze a confronto
 
DAI DATI INTELLIGENTI AI SERVIZI Smart City API Hackathon
DAI DATI INTELLIGENTI AI SERVIZI Smart City API HackathonDAI DATI INTELLIGENTI AI SERVIZI Smart City API Hackathon
DAI DATI INTELLIGENTI AI SERVIZI Smart City API Hackathon
 

Recently uploaded

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 

Graph Databases Lifecycle Methodology and Tool to Support Index/Store Versioning

  • 1. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Graph Databases Lifecycle Methodology  and Tool to Support Index/Store  Versioning Pierfrancesco Bellini, Ivan Bruno, Paolo Nesi, Nadia Rauch DISIT Lab Dipartimento di Ingegneria dell’Informazione, DINFO Università degli Studi di Firenze Via S. Marta 3, 50139, Firenze, Italy Tel: +39-055-2758511, fax: +39-055-2758570 http://www.disit.dinfo.unifi.it alias http://www.disit.org Paolo.nesi@unifi.it DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 1
  • 2. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Context and Motivations • Graph database are taking place for systems exploiting  knowledge base, KB – Include a set of ontologies and data instances: static data,  reconciliation data, real time data, historical data, gelocated data, etc.  – Smart city, smart cloud, smart learning, etc. • KB need complex and non consolidated methodologies  for their implementation – Many issues may lead to invalidate a building in favor of new  version, e.g., looking for more inference, corrections in model  ontology, changes in historical data, etc. – Methodologies present iterative processes that lead to  restructure/rebuild the knowledge base  DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 2
  • 3. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Context and Motivations (2) • RDF stores (end they end points) are a way for  Knowledge Base implementation  • RDF stores see Ontologies and Instances as triple  (quadruple) s‐p‐o (context) • RDF stores presents several problems in managing: – Versioning and thus deleting triples including the  inferred triples… at real time performance ? – Performance in db store building (inference) – Performance in db store querying (inference) DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 3
  • 4. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it The paper contributions • A methodology for knowledge base life cycle  building and improvement – addressing more details and data kind and cases of  the state of the art methodologies  • A versioning system for RDF store building  supporting the methodology – Completely new tool that can save up to 90% of time  in RDF rebuilding DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 4
  • 5. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it RDF store service: building process • built incrementally via progressive refinements  mediating among – reusing ontological models, – increasing the capability of making deductions and  reasoning on the knowledge base concepts, – maintaining acceptable: query and rendering  performances,  – simplifying the design of the front‐end services,  – flexibility to the arrival of additional data and models  and/or corrections,  DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 5
  • 6. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it RDF KB life cycle methodology DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 6
  • 7. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Ontology Construction • Domain Analysis:  – concepts, abstractions, aggregation,  classes vs attributes, etc.  • Available Ontologies – Diffusion, Licensing, quality, inference,  etc. • Ontology Integration – Glued concepts, more inference  • Ontology review – Conceptual assessment – Formal verification with metrics and  tools • Points from which one has to restart,  have to be saved • Integrated version of the ontology goes  IN USE DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 7 In use
  • 8. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Km4city example • reuses:  – dcterms to set of properties and classes for modeling  metadata;  – foaf dedicated to relations among people or groups;  – schema.org for a description of people and organizations;  – wgs84_pos representing latitude and longitude;  – GoodRelations for a description of business entities and their  locations;  – OWL‐Time for temporal modeling;  – OTN for transport aspects;  – GIS Dictionary, to represent the spatial component of  geographic features DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 8
  • 9. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Data ingestion and enrichment • Static Data can be:  – Open Data,  – Historical real time data (from  – to) – Enrichment data (to be  identified) – Reconciliation Data (to be  identified) • Problems may derive from:  – Inconsistencies instance level ? – Incompleteness: missing onto  concepts, missing links, … DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 9 In use
  • 10. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Real Time Data • Need to be already verified and  reconciled  – No changes in their structure – No or known variability on  instances • May produce large volume of  Cumulated data  – can become the substantial part of  the KB – cannot be deleted – Cannot be easily extracted from  the RDF store, thus historical data  may be saved for rebuilding DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 10 In use Histori cal triples
  • 11. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it RDF Indexing Flow and Requirements • There are several reasons to revise/change the  RDF Indexing and thus the RDF Store itself  –the ontology and thus the: • data mapping and triplification invalidating the  indexing and the materialization of triples –the data triples coming from ingestion,  historical, reconciliation, as:  • a new data mapping, quality improvement,  • changes in performing reconciliation, enrichment  and triplification processes.  DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 11
  • 12. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it RDF Index and RDF Index Descriptor • The RDF Index is substantially the RDF Store  containing the triple (ontology, static data, etc…) • The RDF Index Descriptor is: – the recipe to create the RDF Index and  – the set of triples adopted to build it • Since the version of the RDF Store is not an a  viable task (without redesigning the RDF store)  we performed the versioning of the RDF Index  Descriptor DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 12
  • 13. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Versioning: File and RDF Index DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 13 Contains reconciliation of triples connecting parking locations wrt civic numbers depends on the ontology and on the parking area data sets.
  • 14. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Verification and Validation • In a real production of big data RDF store,  – hundreds of files containing triples are produced – The restarting from scratch is time consuming, may  be error prone, may lose the versioning / evolution  value • Not all changes can produce consistent RDF KB  Store. • The saving of intermediate consistent version  can lead t o save time in exploiting the saved  version as restarting point. DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 14
  • 15. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Example: 4 versions on the same core DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 15
  • 16. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it RDF Index Manager tool • Keep tracing RDF KB Store Versions, RKBSV, in terms of files of triples,  index‐description, and RDF Index; • Maintaining a repository of RKBSVs where they could be stored and  retrieved;  • Selecting a RKBSV from the repository for modification, to examine  changes and the history version, to be used as base for building a new  version; • Managing the index descriptor as a list of files containing triples; • Generating a RDF KB index on the basis of an RKBSV independently  from the RDF store kind automatically, and in particular for SESAME  OWLIM and Virtuoso; • Monitoring the RDF KB index generation and the feeding state; • Suggest the closest version of the RKBSV with respect to the  demanded new index in terms of files of triples; • Avoiding manually managing the script file of indexing, since it is time  consuming and an error prone process. DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 16
  • 17. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it RIM tool: building monitor during production DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 17
  • 18. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it RIM tool: during RDF assessment DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 18
  • 19. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Experimental Results DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 19 Ontologi es + street  graphs + smart city  Services +Enrich&  Reconciliations +Historical  data 1 month Indexing process Final number of triples 15.809 33.547.501 34.462.930 34.557.142 44.218.719 Final number of Files 12 137 178 185 27794 Added triples with respect to previous version 15809 33.531.692 915.429 94.212 9.661.577 Added Files with respect to previous version 12 125 41 7 27609 OWLIM SE 4.3 Indexing Time without RIM (s) 18 6536 6198 7516 12093 Indexing Time with RIM (s) 11 6029 514 343 5745 % of saved time,  RIM versioning 38,9 7,8 91,7 95,4 52,5 Final Number of triples (including geo +  inferred) 16062 57.486.956 59.395.432 59.486.748 73.441.126 disk space in Mbyte 310 8669 8936 9039 13110 VIRTUOSO 7.2 Indexing Time without RIM (s) 146 806 964 1000 2487 Indexing Time with RIM (s) 156 833 421 296 1932 % of saved time,  RIM versioning ‐6,8 ‐3,3 56,3 70,4 22,3 Final Number of triples (including geo, no  inferred) 21.628 35.452.613 36.301.322 36.420.445 46.232.510 disk space in Mbyte 68 1450 1632 1631 2294 GraphDB 6.1 Indexing Time without RIM (s) 9 7818 7929 7671 12915 Indexing Time with RIM (s) 2 6791 454 214 4849 % of saved time,  RIM versioning 77,8 13,1 94,3 97,2 62,45 Final Number of triples (including geo +  inferred) 15.809 57.486.415 59.394.891 59.487.551 73.441.929 disk space in Mbyte 96 4276 4466 4643 5714
  • 20. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it New Version • Support Ontology Licensing – To take into account in ontology building and in KB RDF  store usage, querying • Support Data Licensing • To help selecting ontologies • To take into account in KB RDF store usage, querying • Support Data ingestion process with  – integrated Data Ingestion Manager – Maintaining under control the data sets to be included,  their licenses, triplification, etc.  DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 20
  • 21. DISIT Lab, Distributed Data Intelligence and Technologies Distributed Systems and Internet Technologies Department of Information Engineering (DINFO) http://www.disit.dinfo.unifi.it Conclusions • The RIM model and tool allow: – Keeping under control and trace the RDF life cycle  from construction of the ontology to the indexing  and validation.  – Reducing the time for indexing (RDF store  construction) up to the 97% in some cases. The benefits have been demonstrated for the  most diffused RDF stores: OWLIM, GraphDB and  Virtuoso. DISIT Lab (DINFO UNIFI), DMS 2015, Vancouver, Canada 21