These slides can be reused as they are according to the CC BY 4.0 license.
Here's the paper presented @ ISWC2019: https://arxiv.org/pdf/1905.02840.pdf
ArCo is the Italian Cultural Heritage knowledge graph, consisting of a network of seven vocabularies and 169 million triples about 820 thousand cultural entities. It is distributed jointly with a SPARQL endpoint, a software for converting catalogue records to RDF, and a rich suite of documentation material (testing, evaluation, how-to, examples, etc.). ArCo is based on the official General Catalogue of the Italian Ministry of Cultural Heritage and Activities (MiBAC) - and its associated encoding regulations - which collects and validates the catalogue records of (ideally) all Italian Cultural Heritage properties (excluding libraries and archives), contributed by CH administrators from all over Italy.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
ArCo: the Italian Cultural Heritage Knowledge Graph
1. ArCo: the Italian Cultural
Heritage Knowledge Graph
Valentina Anita Carriero1,2, Aldo Gangemi1,2,
Maria Letizia Mancinelli3, Ludovica Marinucci2,
Andrea Giovanni Nuzzolese2, Valentina Presutti1,2
and Chiara Veninata3
ISWC 2019
October 29th
Auckland, New Zealand
1 University of Bologna, Bologna, IT
2 STLab, ISTC-CNR, Rome, IT
3 ICCD, MiBAC, Rome, IT
Testa di fanciulla, Leonardo da Vinci
2. Italian Cultural Heritage
ICCD (MiBAC) coordinates
cataloguing activities and CH data
management
General Catalogue of
Italian CH
cataloguing standards,
controlled lists
almost 3M
catalogue records
+800K
catalogue records~15M
catalogue numbers
released
30 types of cultural
properties
2
3. Italian Cultural Heritage
ICCD (MiBAC) coordinates
cataloguing activities and CH data
management
General Catalogue of
Italian CH
cataloguing standards,
controlled lists
almost 3M
catalogue records
+800K
catalogue records~15M
catalogue numbers
released
30 types of cultural
properties
3
4. Italian Cultural Heritage
ICCD (MiBAC) coordinates
cataloguing activities and CH data
management
General Catalogue of
Italian CH
cataloguing standards,
controlled lists
almost 3M
catalogue records
+800K
catalogue records~15M
catalogue numbers
released
30 types of cultural
properties
4
6. ArCo KG in numbers
Ontology network
• 7 ontology modules
• 5,058 axioms
• 1,049 predicates
• reuse of 11 other ontologies
Data
• 169,151,644 triples
• 20,838 owl:sameAs linking to 20,479 distinct entities
in 8 other datasets and 2 controlled vocabularies
6
7. Why ?
• Improving PA data management, enhancing cultural heritage
• Connection to other Knowledge Bases
• Fostering reuse by third parties
• Addressing new modelling issues
• Not only metadata, not only the catalogue’s perspective
à representing implicit knowledge embedded
in the metadata à discovery science
epistemological
!=
ontological
data for
scholars and
researchers
data &
ontologies
mutual
enrichment
= a KG of Italian Cultural
Heritage,
but not only!
e.g. different types of locations
classifications of musical instruments
diagnosis of paleopathologies
coin issuance, …
better research
results
catalogue record
cultural
property
7
not limited to the
Italian domain
8. How to use ArCo
Docker container available on GitHub
+ its running instance online
https://w3id.org/arco
8
easily
installable
multi-language
last 8 months
2 clones
260 views
29 unique visitors
last 3 months
1,371 views
766 unique visitors
12. ArCo at a glance
supporting new users
& users from different backgrounds
Examples
CQ
SPARQL
query
results
RDF
visualization
12
13. Using XD for developing ArCo KG
eXtreme Design
13
detailed
documentation of
the applied
methodology
14. requirements collected
in the form of stories
Type: Linking my data to ArCo data
Title: Cultural heritage and residential property
Story: I am looking for a residential property to
buy, and I want to filter the results based on the
type of cultural heritage nearby.
applications
linking LOD
publishing LOD
e.g.
translated as
Competency Questions
and other constraints
CQs
Which is the geometry of
a cultural property?
e.g.
continuous feedback
& new emerging
requirements
customer
team
design
team
ICCD
wider community
private & public
companies and
institutions
+
requirements from
beyond domain
experts
14
tools
16. design
team design principles
pattern-based design
ODP
repository
external
ontologies
OPLa
ontology
annotated with
12 reused ODPs
over the network
shortcut binary
relations along
with
n-ary relations
16
direct &
indirect
reuse
+ 1 new ODP
for recurrent event series as
collections of events
@ WOP2019
http://bit.do/fbQWk
1
2
3
e.g.
situation
collection
classification
componency
[…]
18. Poppi Castle
Poppi
The Uffizi
Florence
Woman Portrait
by
Netscher Caspar
17th century
Modelling issue (1)
18
currently
located
Pitti Palace
Florence
involved in an
exhibition in1773
temporarily stored
in1942
19. Example of reused ODPs (1) Cultural
property
time indexed
typed locations
19
time-indexed
situation
time-indexed
situation
e.g.
CIDOC-CRM
E9 Move (P26 moved to / P27 moved from)
P53 has former or current location
P54 has current permanent location
P55 has current location
20. Modelling issue (2)
20
Syracusan coin
4th century B.C.
“conservation status
of the coin:
bad”
catalogue record
“conservation status
of the coin:
good”
21. Example of reused ODPs (2) Catalogue
record
catalogue
record
versions
21
information
object
sequence
time-indexed
situation
22. data productiondesign
team
RDFizer
ICCD-XML data
to
ArCo-RDF
http://bit.do/fbqxL
ID generation
deduplication
disambiguation
entity-linking
LIMES
strings
to
entities
entity-linking against
ArCo itself
entities
fingerprints
e.g. active period of
an author
e.g.
<AUTN>Friscia
Albert</AUTN>
friscia-albert
albert-friscia
e.g.
“Andrea d’Agnolo”
“Andrea d’Agnolo
detto del Sarto”
MD5 checksum
dcd4ca7b54dd3d7dac083dd4c54a9eef
LInk discovery framework for MEtric Spaces
22
ULAN
TGN
23. testing
testing
team
https://w3id.org/testalod
CQ verification
inference
verification
error
provocation
refactoring &
integration
design
team
release &
versioning
huge manual effort
à prototype for
running unit tests
e.g. “When was a cultural property created?”
à SPARQL query
e.g. ex:MonumentX a :CulturalProperty .
ex:MonumentX
:hasCulturalPropertyComponent ex:ComponentY .
Expected inference: ex:MonumentX a
:ComplexCulturalProperty .
@ ISWC2019 demo session
e.g. a-cd:Dating owl:disjointWith a-cd:Author .
ex:Example a a-cd:Dating, a-cd:Author .
Expected error: inconsistency
all tests on GitHub
http://bit.do/fbuHS
23
24. Conclusion
24
• The biggest knowledge graph of Italian Cultural Heritage
• Available on GitHub, as a docker, on Zenodo, LOV and DataHub
• Released with CC BY 4.0 license (Attribution-ShareAlike)
• Developed using a rigorous methodology based on ODP, supporting its
reuse
• XD-by-example
• Addressing diverse requirements
• from applications to discovery science
eXtreme Design
25. Ongoing & future work
25
• ArCo is an evolving creature
• concepts yet to be modelled
• knowledge extraction from text
• translation to other languages
• Improving eXtreme Design
• open requirements gathering
how to capture requirements from diverse types of users
• call for tools for facilitating testing and reuse
26. What can you do?
26
• e.g. are you a researcher and philosopher interested in the relation between
ontological and epistemological changes in cultural properties and their
metadata?
• e.g. do you want to support Cultural Heritage students by providing an app for
studying e.g. what are the most popular subjects per cultural property type?
• e.g. are you interested in analysing how chemicals are used in conservation
interventions over time?
• […]
Use ArCo for answering your (research) questions!
Leave us feedback, use cases and requirements we don’t satisfy yet!
Contribute to ArCo’s future development!
http://tiny.cc/arco-ggroup
http://tiny.cc/arco-github