ARIADNE is an EU-funded project that aims to integrate archaeological data repositories across Europe by overcoming fragmentation and fostering data sharing. It involves 24 partners from 17 countries. The project conducts networking activities to build community and standards, provides trans-national access to online resources and training, and performs research on data integration, management, and new tools. In its first nine months, ARIADNE has established special interest groups, collected information on partners' datasets and metadata schemas, and begun designing an integrated infrastructure and catalog data model.
ARCLib project presentation from Pasig 2016dp-blog-cz
Digital preservation project by a group of Czech Libraries, financed by the Ministery of Culture of Czech Republic applied research grant. First information.
Aggregation of cultural heritage datasets through the Web of DataNuno Freire
The existence of many digital libraries, maintained by different organizations, brings challenges to the discoverability of cultural heritage (CH) resources. Metadata aggregation is an approach where centralized efforts like Europeana facilitate their discoverability by collecting the resource’s metadata. Nowadays, CH institutions are increasingly applying technologies designed for the wider interoperability on the Web. In this context, we have identified the Schema.org vocabulary and linked data (LD) as potential technologies for innovating CH metadata aggregation. We present the results of an analysis using the case of the Europeana network of aggregators and data providers as basis. We have conducted a survey of the available linked data technology, and we defined a solution, which we have put into practice in a pilot implementation within the Europeana network. In this pilot, the National Library of The Netherlands fulfils the role of data provider, with the Dutch Digital Heritage Network, as national aggregator, supporting the provision of several datasets from the national library to Europeana. The metadata is published using LD practices, having Schema.org as the main vocabulary. The national library also implements all the necessary semantic web mechanisms, defined in our solution, for making the datasets discoverable and harvestable by Europeana. Our proposal involves the use of vocabularies for description of datasets, and their distributions, namely DCAT, VoID and Schema.org. Europeana implements the LD harvester side of the solution and applies it to harvest the Schema.org data from the national library.
ARCLib project presentation from Pasig 2016dp-blog-cz
Digital preservation project by a group of Czech Libraries, financed by the Ministery of Culture of Czech Republic applied research grant. First information.
Aggregation of cultural heritage datasets through the Web of DataNuno Freire
The existence of many digital libraries, maintained by different organizations, brings challenges to the discoverability of cultural heritage (CH) resources. Metadata aggregation is an approach where centralized efforts like Europeana facilitate their discoverability by collecting the resource’s metadata. Nowadays, CH institutions are increasingly applying technologies designed for the wider interoperability on the Web. In this context, we have identified the Schema.org vocabulary and linked data (LD) as potential technologies for innovating CH metadata aggregation. We present the results of an analysis using the case of the Europeana network of aggregators and data providers as basis. We have conducted a survey of the available linked data technology, and we defined a solution, which we have put into practice in a pilot implementation within the Europeana network. In this pilot, the National Library of The Netherlands fulfils the role of data provider, with the Dutch Digital Heritage Network, as national aggregator, supporting the provision of several datasets from the national library to Europeana. The metadata is published using LD practices, having Schema.org as the main vocabulary. The national library also implements all the necessary semantic web mechanisms, defined in our solution, for making the datasets discoverable and harvestable by Europeana. Our proposal involves the use of vocabularies for description of datasets, and their distributions, namely DCAT, VoID and Schema.org. Europeana implements the LD harvester side of the solution and applies it to harvest the Schema.org data from the national library.
Generating Executable Mappings from RDF Data Cube Data Structure DefinitionsChristophe Debruyne
Data processing is increasingly the subject of various internal and external regulations, such as GDPR which has recently come into effect. Instead of assuming that such processes avail of data sources (such as files and relational databases), we approach the problem in a more abstract manner and view these processes as taking datasets as input. These datasets are then created by pulling data from various data sources. Taking a W3C Recommendation for prescribing the structure of and for describing datasets, we investigate an extension of that vocabulary for the generation of executable R2RML mappings. This results in a top-down approach where one prescribes the dataset to be used by a data process and where to find the data, and where that prescription is subsequently used to retrieve the data for the creation of the dataset “just in time”. We argue that this approach to the generation of an R2RML mapping from a dataset description is the first step towards policy-aware mappings, where the generation takes into account regulations to generate mappings that are compliant. In this paper, we describe how one can obtain an R2RML mapping from a data structure definition in a declarative manner using SPARQL CONSTRUCT queries, and demonstrate it using a running example. Some of the more technical aspects are also described.
Reference: Christophe Debruyne, Dave Lewis, Declan O'Sullivan: Generating Executable Mappings from RDF Data Cube Data Structure Definitions. OTM Conferences (2) 2018: 333-350
Preseted at OR2017 - Brisbane
Panel Discussion: COAR Next Generation Repositories: Results and Recommendations
The presentation focus on the recommended technologies to implement in Repository platforms
The nearly ubiquitous deployment of repository systems in higher education and research institutions provides the foundation for a distributed, globally networked infrastructure for scholarly communication. However, repository platforms are still using technologies and protocols designed almost twenty years ago, before the boom of the Web and the dominance of Google, social networking, semantic web and ubiquitous mobile devices.
To that end, in April 2016, COAR launched a working group to identify the technologies and architectures of the next generation of repositories. There are two threads to our work: (1) increase the exposure by repositories of uniform behaviors that can be used by machine agents to fuel novel scholarly applications that reach beyond the scope of a single repository and that enable to smoothly embed repository content into mainstream web applications. (2) integrate with existing scholarly infrastructures, specifically those aimed at identification, as a means to solidly embed repositories in the overall scholarly communication landscape.
This panel will present the results of the COAR Next Generation Repositories Working Group including our vision, design assumptions, use cases, architectural and technical recommendations, and next steps. The session will also include time for audience discussion and feedback.
DSP3B: DSpace Interest Group 3B: DSpace-CRIS Workshop · 11/Jun/2015: 3:30pm-5:00pm · Location: Regency E
DSpace-CRIS Workshop
Andrea Bollini, Luigi Andrea Pascarelli, Michele Mennielli, David Palmer
Cineca, Italy; Hong Kong University
The 90-minute workshop will introduce attendees to the latest version of the DSpace-CRIS module, covering its functional and technical aspects.
DSpace-CRIS is an additional open-source module for the DSpace platform. It extends the DSpace data model providing the ability to manage, collect and expose data about any entities of the research domain, such as people, organizational units, projects, grants, awards, patents, publications, and so on. Before OR2015 a new version of the system will be released to follow the new DSpace 5.0 version. The new version contains, among other things, important enhancements of its integration with ORCID.
The DSpace-CRIS extensible data model will be explained in depth, through examples and discussion with participants.
Other main topics are DSpace-CRIS "components", management of relationships and network analysis functionalities.
At the end of the workshop, participants will be able to:
- understand the DSpace-CRIS data model
- evaluate if DSpace-CRIS fits the requirements of their institution
- use the DSpace-CRIS User Interface
- change the default configuration, adapting it to a specific data model.
euroCRIS DSpace-CRIS repository presentation delivered May 12th, 2014 at the #CRIS2014 euroCRIS membership business meeting in Rome. The new euroCRIS repository holds and offers open access to all the CRIS2014 conference submissions, http://bit.ly/1iFFZo9
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...Andrea Bollini
The idea of DSpace-CRIS has its origin in 2009 when the Hong Kong University decided to extend the information exposed in their DSpace IR adding information (people/projects) coming from other systems already in use (mainly) for administrative purpose: a CRIS.
One year ago, November 2012, DSpace-CRIS was released as an open source solution to enrich DSpace (1.8.2). After highlighting the important steps made by the DSpace Community in 2013, that will bring to the final release of DSpace 4.0 in December, Cineca focused its presentation on what DSpace-CRIS is today.
The most important announcement was that DSpace-CRIS is now compatible and compliant with the CERIF standard and that an export feature in CERIF XML will be available in the DSpace-CRIS 4.0 version. Indeed the key components of the CERIF data model are supported natively: UUID, timestamped relation, semantic characterization.
In addition to that, the dynamic, flexible and not hardcoded approach of DSpace-CRIS data model makes it very easy to create new entities (besides the few predefined ones) and configure instances compliant with CERIF.
There are several advantages that DSpace-CRIS brings to Institutional Repositories and to the DSpace community overall:
- CRIS entities as authority for Item metadata values;
- DSpace Items can be linked and displayed in the detail page of any CRIS entities;
- Ability to display selected publications (or any other related entities) in the researcher profile;
- It is possible to create lists of selected publications (or any other related entities);
- CRIS entity detailed page visit;
- Global & Top related CERIF Entity views & downloads referencing the CRIS entity (projects for researchers, researchers for OrgUnits, etc.);
- Global & Top item views & downloads referencing the CRIS entity;
- email and RSS alerts;
- Article level metrics for PubMed (extensible):
- Cited-by count in the item page
- Number of articles for researcher
- Total citations for researcher (only items in local DSpace database will be counted)
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...inside-BigData.com
Ellen Salmon from NASA gave this talk at the 2017 MSST conference. "This talk will describe recent developments at the NASA Center for Climate Simulation, which is funded by NASA’s Science Mission Directorate, and supports the specialized data storage and computational needs of weather, ocean, and climate researchers, as well as astrophysicists, heliophysicists, and planetary scientists. To meet requirements for higher-resolution, higher-fidelity simulations, the NCCS augments its High Performance Computing and storage/retrieval environment. As the petabytes of model and observational data grow, the NCCS is broadening data services offerings and deploying and expanding virtualization resources for high performance analytics."
Watch the video: http://wp.me/p3RLHQ-gPj
Learn more: https://www.nccs.nasa.gov/
and
http://storageconference.us/
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
Generating Executable Mappings from RDF Data Cube Data Structure DefinitionsChristophe Debruyne
Data processing is increasingly the subject of various internal and external regulations, such as GDPR which has recently come into effect. Instead of assuming that such processes avail of data sources (such as files and relational databases), we approach the problem in a more abstract manner and view these processes as taking datasets as input. These datasets are then created by pulling data from various data sources. Taking a W3C Recommendation for prescribing the structure of and for describing datasets, we investigate an extension of that vocabulary for the generation of executable R2RML mappings. This results in a top-down approach where one prescribes the dataset to be used by a data process and where to find the data, and where that prescription is subsequently used to retrieve the data for the creation of the dataset “just in time”. We argue that this approach to the generation of an R2RML mapping from a dataset description is the first step towards policy-aware mappings, where the generation takes into account regulations to generate mappings that are compliant. In this paper, we describe how one can obtain an R2RML mapping from a data structure definition in a declarative manner using SPARQL CONSTRUCT queries, and demonstrate it using a running example. Some of the more technical aspects are also described.
Reference: Christophe Debruyne, Dave Lewis, Declan O'Sullivan: Generating Executable Mappings from RDF Data Cube Data Structure Definitions. OTM Conferences (2) 2018: 333-350
Preseted at OR2017 - Brisbane
Panel Discussion: COAR Next Generation Repositories: Results and Recommendations
The presentation focus on the recommended technologies to implement in Repository platforms
The nearly ubiquitous deployment of repository systems in higher education and research institutions provides the foundation for a distributed, globally networked infrastructure for scholarly communication. However, repository platforms are still using technologies and protocols designed almost twenty years ago, before the boom of the Web and the dominance of Google, social networking, semantic web and ubiquitous mobile devices.
To that end, in April 2016, COAR launched a working group to identify the technologies and architectures of the next generation of repositories. There are two threads to our work: (1) increase the exposure by repositories of uniform behaviors that can be used by machine agents to fuel novel scholarly applications that reach beyond the scope of a single repository and that enable to smoothly embed repository content into mainstream web applications. (2) integrate with existing scholarly infrastructures, specifically those aimed at identification, as a means to solidly embed repositories in the overall scholarly communication landscape.
This panel will present the results of the COAR Next Generation Repositories Working Group including our vision, design assumptions, use cases, architectural and technical recommendations, and next steps. The session will also include time for audience discussion and feedback.
DSP3B: DSpace Interest Group 3B: DSpace-CRIS Workshop · 11/Jun/2015: 3:30pm-5:00pm · Location: Regency E
DSpace-CRIS Workshop
Andrea Bollini, Luigi Andrea Pascarelli, Michele Mennielli, David Palmer
Cineca, Italy; Hong Kong University
The 90-minute workshop will introduce attendees to the latest version of the DSpace-CRIS module, covering its functional and technical aspects.
DSpace-CRIS is an additional open-source module for the DSpace platform. It extends the DSpace data model providing the ability to manage, collect and expose data about any entities of the research domain, such as people, organizational units, projects, grants, awards, patents, publications, and so on. Before OR2015 a new version of the system will be released to follow the new DSpace 5.0 version. The new version contains, among other things, important enhancements of its integration with ORCID.
The DSpace-CRIS extensible data model will be explained in depth, through examples and discussion with participants.
Other main topics are DSpace-CRIS "components", management of relationships and network analysis functionalities.
At the end of the workshop, participants will be able to:
- understand the DSpace-CRIS data model
- evaluate if DSpace-CRIS fits the requirements of their institution
- use the DSpace-CRIS User Interface
- change the default configuration, adapting it to a specific data model.
euroCRIS DSpace-CRIS repository presentation delivered May 12th, 2014 at the #CRIS2014 euroCRIS membership business meeting in Rome. The new euroCRIS repository holds and offers open access to all the CRIS2014 conference submissions, http://bit.ly/1iFFZo9
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...Andrea Bollini
The idea of DSpace-CRIS has its origin in 2009 when the Hong Kong University decided to extend the information exposed in their DSpace IR adding information (people/projects) coming from other systems already in use (mainly) for administrative purpose: a CRIS.
One year ago, November 2012, DSpace-CRIS was released as an open source solution to enrich DSpace (1.8.2). After highlighting the important steps made by the DSpace Community in 2013, that will bring to the final release of DSpace 4.0 in December, Cineca focused its presentation on what DSpace-CRIS is today.
The most important announcement was that DSpace-CRIS is now compatible and compliant with the CERIF standard and that an export feature in CERIF XML will be available in the DSpace-CRIS 4.0 version. Indeed the key components of the CERIF data model are supported natively: UUID, timestamped relation, semantic characterization.
In addition to that, the dynamic, flexible and not hardcoded approach of DSpace-CRIS data model makes it very easy to create new entities (besides the few predefined ones) and configure instances compliant with CERIF.
There are several advantages that DSpace-CRIS brings to Institutional Repositories and to the DSpace community overall:
- CRIS entities as authority for Item metadata values;
- DSpace Items can be linked and displayed in the detail page of any CRIS entities;
- Ability to display selected publications (or any other related entities) in the researcher profile;
- It is possible to create lists of selected publications (or any other related entities);
- CRIS entity detailed page visit;
- Global & Top related CERIF Entity views & downloads referencing the CRIS entity (projects for researchers, researchers for OrgUnits, etc.);
- Global & Top item views & downloads referencing the CRIS entity;
- email and RSS alerts;
- Article level metrics for PubMed (extensible):
- Cited-by count in the item page
- Number of articles for researcher
- Total citations for researcher (only items in local DSpace database will be counted)
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...inside-BigData.com
Ellen Salmon from NASA gave this talk at the 2017 MSST conference. "This talk will describe recent developments at the NASA Center for Climate Simulation, which is funded by NASA’s Science Mission Directorate, and supports the specialized data storage and computational needs of weather, ocean, and climate researchers, as well as astrophysicists, heliophysicists, and planetary scientists. To meet requirements for higher-resolution, higher-fidelity simulations, the NCCS augments its High Performance Computing and storage/retrieval environment. As the petabytes of model and observational data grow, the NCCS is broadening data services offerings and deploying and expanding virtualization resources for high performance analytics."
Watch the video: http://wp.me/p3RLHQ-gPj
Learn more: https://www.nccs.nasa.gov/
and
http://storageconference.us/
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
Mr. Percy and Zee doing battle with the forces of everyday stuffCBT Partnership
Zee and Mr. Percy are doing battle with the forces of everyday stuff. Zee is the neurotic one while his loyal companion Mr. Percy is the voice of reason.
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...ariadnenetwork
This presentation, by Achille Felicetti of PIN, gives an introduction to the ARIADNE winter school, to the ARIADNE research infrastructure and to the integration of archaeological datasets into the infrastructure. The process of integrating diverse datasets using the ARIADNE Catalogue Data Model to provide a high level description and the strategies to support retrieval by subject, period and map location.
Introduction to ARIADNEplus, the follow-on project from ARIADNE which will extend the scope of the archaeological datasets in the Portal by addition of more disciplines, countries and data types as well as working on best practices and data interoperability.
CLARIAH Toogdag 2018: A distributed network of digital heritage informationEnno Meijers
Slides of my keynote at the CLARIAH Toogdag 2018 on 9 March at the National Library of the Netherlands. The main topics were the development of the distributed digital heritage network and the alignment to and cooperation with the CLARIAH infrastructure and data. It also points at some of the current limitations of the semantic web technology.
Un cloud pour comparer nos gènes aux images du cerveau" Le pionnier des bases de données, aujourd'hui disparu, Jim Gray avait annoncé en 2007 l'emergence d'un 4eme paradigme scientifique: celui d'une recherche scientifique numérique entierement guidée par l'exploration de données massives. Cette vision est aujourd'hui la réalité de tous les jours dans les laboratoire de recherche scientifique, et elle va bien au delà de ce que l'on appelle communément "BIG DATA". Microsoft Research et Inria on démarré en 2010 un projet intitulé Azure-Brain (ou A-Brain) dont l'originalité consiste à a la fois construire au dessus de Windows Azure une nouvelle plateforme d'acces aux données massives pour les applications scientifiques, et de se confronter à la réalité de la recherche scientifique. Dans cette session nous vous proposons dans une premiere partie de resituer les enjeux recherche concernant la gestion de données massives dans le cloud, et ensuite de vous presenter la plateforme "TOMUS Blob" cloud storage optimisé sur Azure. Enfin nous vous presenterons le projet A-Brain et les résultats que nous avons obtenus: La neuro-imagerie contribue au diagnostic de certaines maladies du système nerveux. Mais nos cerveaux s'avèrent tous un peu différents les uns des autres. Cette variabilité complique l'interprétation médicale. D'où l'idée de corréler ldes images IRM du cerveaux et le patrimoine génétique de chaque patient afin de mieux délimiter les régions cérébrales qui présentent un intérêt symptomatique. Les images IRM haute définition de ce projet sont produites par la plate-forme Neurospin du CEA (Saclay). Problème pour Les chercheurs : la masse d'informations à traiter. Le CV génétique d'un individu comporte environ un million de données. À cela s'ajoutent des volumes tout aussi colossaux de pixel 3D pour décrire les images. Un data deluge: des peta octets de donnés et potentiellement des années de calcul. C'est donc ici qu'entre en jeu le cloud et une plateforme optimisée sur Azure pour traiter des applications massivement parallèles sur des données massives... Comme l'explique Gabriel Antoniu, son responsable, cette équipe de recherche rennaise a développé “des mécanismes de stockage efficaces pour améliorer l'accès à ces données massives et optimiser leur traitement. Nos développements permettent de répondre aux besoins applicatifs de nos collègues de Saclay.
Dataverse repository for research data in the COVID-19 Museumvty
The Covid-19 Museum has an ambition to create a platform to deposit, consult, aggregate and study heterogeneous data about the pandemics using features of a distributed web service. To achieve this purpose, Dataverse has been selected as a reliable FAIR data repository with built-in search engine and functionality that allows adding computing resources to explore archived resources both on data and metadata. Presentation by
Slava Tykhonov, DANS-KNAW (The Royal Netherlands Academy of Arts and Sciences). Université Paris Cité, 19 April 2022.
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...4Science
Presented by Susanna Mornati at the 2019 DSpace North American User Group Meeting September 23 & 24, 2019 at the University of Minnesota in Minneapolis.
Abstract: DSpace-CRIS is a free open-source platform based on DSpace for Research Data and Information Management, adopted by a wide international community of universities and research centers: DSpace-CRIS Home. It complies with recommendations, open standards and technologies such as the OAI-PMH, SignPosting, and ResourceSync (recommended by the COAR Next Generation Repositories WG), it features complete ORCID integration, compliance with the CERIF model, the IIIF framework, and with the OpenAIRE Guidelines for Literature Repositories, Data Archives, CRIS Managers, to improve findability, accessibility, interoperability, and reuse of digital assets for research and cultural heritage. DSpace-CRIS collects and disseminates information about researchers' profiles, organizations, publications, patents, grants, awards, and all entities that populate the research domain and their relationships, besides storing and exposing full-text publications, datasets, and other relevant digital objects, providing persistent identifiers and long-term preservation capabilities. DSpace-RDM exposes datasets to visual exploration and M2M streaming for analysis thanks to the integration with CKAN. DSpace-GLAM enhances the fruition of the cultural heritage through the (crowd-funded) IIIF image viewer, providing remote fruition of cultural heritage and offering a great user experience. These flavors of DSpace allow to expose and share open data, open information, and open digital objects in a collaborative, interoperable, and sustainable way. The use cases of a variety of institutions in different countries and continents will be shared to show the use of this powerful technology.
A construction of a regional relational and spatial geodatabase as a tool for managing geological and mineral information. A presentation by João Henrique Gonçalves, database and geoprocessing expert.
Dec'2013 webinar from the EUCLID project on managing large volumes of Linked Data
webinar recording at https://vimeo.com/84126769 and https://vimeo.com/84126770
more info on EUCLID: http://euclid-project.eu/
Similar to ARIADNE: progress in the first nine month (20)
The Visual Media Service (VMS) provided by the ARIADNEplus Project enables large datasets to be processed and displayed seamlessly and quickly, enabling end users to view and explore in detail:
3D objects ranging from size from a few mm to tens m, e.g. from beads up to statues and friezes.
Large, high resolution images (that can’t usually be easily displayed online), e.g. gigapixel photographs of frescoes, paintings, or any other planar artwork.
Relightable images (RTI and PTM) commonly used on artefacts such as coins.
Sets of images such as a time series or sequence of pages relating to the same item.
The VMS technology provides a full set of interactive tools for the displayed object(s) and can be accessed for free through the ARIADNE Portal. As such, this service provides a valuable tool for the publication, exploration and dissemination of various aspects of Cultural Heritage.
DANS Data Trail Data Management Tools for Archaeologistsariadnenetwork
With the arrival of ARIADNEplus there is a searchable catalogue of datasets that helps archaeological researchers navigate the “maze” of data and archives. Especially for archaeological researchers, support staff and data managers, a set of tools has now been developed that helps in making your data management plan. Hella Holander, Peter Doorn and Paola Ronzino introduced the tools to the participants during the workshop.
The ARIADNEplus online toolset for data management consists of three parts:
a protocol for archaeological data management,
a template for researchers to create a data management plan with archaeological data,
a manual containing all guidelines, recommendations and practical examples of data management.
In just six steps, the protocol takes you through the entire process of making a Data Management Plan (DMP) for archaeological research. By using the templates and the accompanying manual with a clear set of guidelines and advice, it becomes much easier to meet the requirements of organisations that fund research. The DMP is then also in line with standards in the archaeological domain, which ultimately makes the data more findable, accessible, reusable and interoperable (FAIR).
Eaa2021 476 natália botica - from 2_archis to datarepositorium2ariadnenetwork
To promote open science and data reuse, it is necessary to have data available in open repositories that guarantee their accessibility and permanence, while facilitating their reuse.
Data classified as FAIR (Findable, Accessible, Interoperable and Reusable) must follow guidelines that ensure the use of an appropriate metadata scheme, persistent identifiers, well-defined vocabularies, procedures to standardize and improve data quality and sustainable file formats. We will present the methodology used for recording the coin findings from an archaeological excavation carried out by the Archaeology Unit of the University of Minho (UAUM) in the intervention of Casa da Bica, starting with the recording of data in the UAUM's 2ArchIS information system and ending with its availability in the scientific repository "DataRepositóriUM". We will also present some works of visualization and research as examples of the reuse of these data sets, which can be wider when they are integrated in structures of greater visibility like ARIADNE.
On one hand, COVID-19 world pandemic showed the people vulnerability and inability of face-to-face communication and ideas sharing. Through this point of view digital data that is Findable, Accessible, Interoperable and Re-usable (FAIR) showed its added value in even higher extent. On other hand, online communication became a daily routine enabling easier access of all interested parties regardless of their location. The latter helped focusing on particular tasks difficult to accomplish otherwise. The situation in Bulgaria concerning improving state-of-the-art of site and monument dataset “Archaeological Map of Bulgaria” is still in a work process based on online communication with interested participants. Scientists from the National Archaeological Institute with Museum at Bulgarian Academy of Sciences are responsible for all that. Legacy data available beyond local repositories using FAIR principles is a main focus in the development and up-to-date improvement. Sharing the most informative fields metadata and available digital data in ARIADNE portal enabled cleaning other issues in the information system.
This contribution will present digital assets and initiatives at the Museum of Cultural History (MCH), University of Oslo (UiO) and aims at sharing data. The COVID-19 restrictions have elevated the importance of digital assets. At the beginning of this period, metadata for the archaeological collections were, to a large
degree, already digitized and accessible online. This is the result of a national collaboration beginning in the 1990s and continue today in UniMus:Kultur. MCH had also published a map-based overview of all excavations in Eastern/Southern Norway, and
begun to release excavation reports through UiO’s science archive. Recently, focus has shifted towards 3D-documentation of exhibits and publication of existing 3D-models on 3DHOP—available through humgis.uiocloud.no MCH now concentrates on digitizing artefacts at the Viking Ship Museum. The 3D-models
from here will be included in the BItFROST project, which will address the active role of 3D-models in research and education. BItFROST will work on FAIRifcation of 3D-models and promote dialogue with researchers. The 3DHOP platform enables the creation of interactive user-interfaces for researchers and a public audience. Collaboration with DarkLab in Lund, Sweden will create common user-interfaces for Swedish and Norwegian
collections. The project will also utilize AR and VR in the presentation of data.
In addition, the infrastructure project ADED (Archaeological Digital Excavation Documentation) provides open-access to excavations in Norway. The five Norwegian university museums and the Directorate of Cultural Heritage take part in the project.
ADED’s map-based webpages will integrate excavation documentation and the museums’ artefact/photograph databases, making it possible to have an overview and
detailed information of excavations and finds. As part of migrating the data to a common repository, mapping it to CIDOC-CRMarcheo facilitates further mapping to ARIADNEplus and/or other datasets.
Abstracts for the ten presentations at EAA 2021 Session 476: Understanding and expanding capacity in archaeological data management beyond western Europe organised by ARIADNEplus and SEADDA under Theme 3: The new normality of heritage management and museums in post-Covid times on 8th September 2021.
Eaa2021 476 ways and capacity in archaeological data management in serbiaariadnenetwork
Over the past year and due to the COVID-19 pandemic, the entire world has witnessed inequalities across borders and societies. They also include access to archaeological resources, both physical and digital. Both archaeological data creators and users spent a lot of time working from their homes, away from artefact collections and research data. However, this was the perfect moment to understand the importance of making data
freely and openly available, both nationally and internationally.
This is why the authors of this paper chose to make a selection of data bases from various institutions responsible for preservation and protection of cultural heritage, in
order to understand their policies regarding accessibility and usage of the data they keep. This will be done by simple visits to various web-sites or data bases. They intend to check on the volume and content, but also importance of the offered archaeological heritage. In addition, the authors will estimate whether the heritage has adequately been classified and described and also check whether data is available in foreign languages. It needs to be seen whether it is possible to access digital objects (documents and the accompanying metadata), whether access is opened for all users or it requires a certain
hierarchy access, what is the policy of usage, reusage and distribution etc. It remains to be seen whether there are public API or whether it is possible to collect data through API.
In case that there is a public API, one needs to check whether datasets are interoperable or messy, requiring data cleaning.
After having visited a certain number of web-sites, the authors expect to collect enough data to make a satisfactory conclusion about accessibility and usage of Serbian archaeological data web bases.
Eaa2021 476 izeta cattaneo idacordig and suquiaariadnenetwork
The COVID-19 pandemic unleashed during 2020 implied a change in the way of doing archaeology on a global scale. In Argentina, in particular, activities had to move to the
domestic sphere and, most times, the possibility of carrying out fieldwork, material analysis and collection management in the usual workplaces was lost. This practice showed the need for repositories, libraries and online databases that would allow access to archaeological information. Suquía, the institutional repository of IDACOR, has been compiling and disseminating archaeological information since 2016, although it had not
yet developed its capacity to include databases that would allow meta-analysis of the information hosted. So, the needs raised by the lockdown led to implementing an action aimed at incorporating data from 1938 archaeological sites in the Province of Córdoba (Argentina) together with IDACORDIG (an implementation of the Arches software) which links this set to a spatial database, creating a gazetteer of archaeological sites for the region. This integration is the first of its kind in Argentina, and fosters an increase in primary information and grey literature visibility, together with publications preprints and
prints that allow continuity in the study of archaeology on a regional scale. In this presentation we will characterize this process and its technical aspects to aware on the potential of this type of platform for its integration into digital infrastructures of global impact.
Eaa2021 476 preserving historic building documentation pakistanariadnenetwork
Like many countries around the world, Pakistan was forced to go into a COVID-19 national lockdown in March 2020. While this confined most people to their homes, it also had the unintended consequence of catapulting many institutions into embracing going digital. At the National College of Arts (NCA), Pakistan’s oldest art school, this meant embracing online tools and digital resources that had previously been resisted or under utilized in the teaching of art, design, and architecture. The experiences of
lockdown have highlighted inadequacies and inequities within our systems, and as Pakistan returns to normal there is a renewed will to maintain the momentum gained during the pandemic, and an increased realization of the need for developing and sustaining digital infrastructures. The National College of Arts Archives collect and preserve the records, manuscripts, and other artefacts of historical and archaeological
significance at the National College of Arts. From March 2021, the NCA Archives are initiating a project to collect, preserve, and digitize historic building documentation created at the NCA over the past 145 years. This paper will follow this process and
document the NCA Archive’s attempt at creating a Findable, Accessible, Interoperable, and Reusable (FAIR) database of historic building documentation in Pakistan. It will summarize the experiences of the six-month pilot project, including opportunities that have arisen in the aftermath of the Covid-19 pandemic, and in light of the Government of Pakistan’s ongoing Digital Pakistan initiative. The paper will also document and analyze the difficulties and hurdles that might emerge during the course of the project as the NCA Archive’s digital infrastructure is built from the ground up in a post-colonial setting and a post-COVID world.
The COVID-19 pandemic has exacerbated or made more visible many known inequalities across borders and societies. This includes access to archaeological resources, both physical and digital. As both the creators and users of archaeological data adapted to working from their homes, cut off from artefact collections and research data siloed within organisations and institutions, the importance of making data freely and openly
available internationally became even more pronounced. The ARIADNE infrastructure (ariadne-infrastructure.eu) for archaeological data, and the SEADDA COST Action
(seadda.eu) are working to secure the sustainable future of archaeological data across Europe and beyond, in ways that are Findable, Accessible, Interoperable and Re-usable (FAIR). Experience within the ARIADNE partnership during the pandemic was largely positive, with many partners able to carry on as usual with accessing their digital resources, emphasising what is possible, while also emphasising what is not achievable
across archaeology, due to lack of capacity. ARIADNE and SEADDA invite papers discussing the challenges, opportunities and lessons learned across all aspects of archaeological data management during the pandemic, and how it may change and
inform our best practice going forward. We particularly invite papers from outside of Western Europe on how the COVID-19 pandemic created barriers or opportunities for accessing archaeological resources, so that we may better understand capacity building during a post-COVID era.
The Portable Antiquities of the Netherlands (PAN) portal and the data model behind the description of the findings are discussed in detail, and how this approach leads to publishing data that is FAIR .
The Innovation Strategy and Targeted activities report presents the ARIADNEplus innovation strategy, addressing its different dimensions and how each of these will approached.
The main dimensions of the strategy are:
Research policies: Alignment with the European research policies on FAIR data, Open Science practices, and the European Open Science Cloud (EOSC) initiative.
Data integration: Increase of the ARIADNE data pool through incorporation of datasets from more archaeological research domains.
Data infrastructure: Implementation and operation of a Cloud-based platform for data aggregation, integration, discovery, access and use across across institutional and national, as well as disciplinary boundaries.
Service portfolio: Provision of enhanced and new services for digital archaeology on the Cloud-based platform.
Stakeholder and user base: Extension of the stakeholder and user base in Europe and beyond, taking account of user needs regarding data, technical services and training.
The report concludes with the methodology that is being used to evaluate the impact of ARIADNEplus on the wider archaeological community.
The objectives for the ARIADNEplus online survey were to collect information on needs of the ARIADNEplus user community regarding data sharing, access and (re)use, new services (as developed by the project), and related training needs. Results of the ARIADNEplus survey were to be compared, where possible, to those of the ARIADNE 2013 survey (ARIADNE 2014) and, particularly, to planned new technical and other services. Furthermore, the analysis of the results had to focus on the match between the perceived user needs and planned ARIADNEplus services, and suggestions to be provided on activities likely to enable an optimal match.
This presentation provides an insightful view in the process of digitising agenda in Czech archaeology. A cornerstone of this is the Archaeological Information System of the Czech Republic (AIS CR), a national solution for research management, data gathering, curation and presentation. A key component AIS CR is the Archaeological Map of the Czech Republic (AMCR), operational since 2017.
OpenArchaeo is an application to query archaeological data via CIDOC CRM developed by the MASA Consortium (Mémoire des archéologues et des sites archéologiques). This exciting tool allows to query both the MASA triplestore and other sources of archaeological data mapped with the CIDOC CRM and can be used by other interfaces such as the ARIADNE portal.
INRAP is one of the biggest European institutions in charge of unmovable archaeological heritage. Although centralised, INRAP is so big that a lot of diversity in terms of standards and tools existed. Therefore, ARIADNE was very helpful for Kai, Amala and their co-workers to apply some of the ARIADNE’s tools and approaches to INRAP. One of the top achievements of INRAP due to ARIADNE was ‘changing the culture of sharing’.
DANS, the Dutch Data Archiving and Networked Services provides facilities for the deposit and archiving of archaeological data and provide a Trusted Digital Repository. Challenges involved mass ingestion of datasets and making use of thesauri, data mining and Linked Open-Data techniques.
The Swedish national Data Service (SND) were in the original ARIADNE project and learned how to organise and classify their data for both the Portal and their own web service. Able to display map, marker and polygon information now. Use Elasticsearch, AAT and Periodo.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
1. ARIADNE
is
funded
by
the
European
Commission's
Seventh
Framework
Programme
The ARIADNE project
Advanced Research Infrastructure for Archaeological
Data Networking in Europe
Franco Niccolucci – PIN
Project Coordinator
Facing the future, Berlin 21-22 November 2013
2. What
is
ARIADNE
• ARIADNE is a EU-funded project aimed at
integrating the European Research
Infrastructures on archaeological datasets
• Its overall goal is to overcome the
fragmentation of archaeological data
repositories and to foster a culture of
archaeological data sharing and re-using
• Start date: 1st February 2013
• www.ariadne-infrastructure.eu
4. ARIADNE
focus
• ARIADNE
is
focused
on
archaeology
and,
in
general,
on
tangible
heritage
• Its
disciplinary
scope
has
very
specific
methods
and
needs
for
integraGon:
it
deals
with
‘things’
• There
are
common
interests
with
digital
humaniGes
(texts)
and
history,
so
we
are
collaboraGng
with
DARIAH
on
content,
methods
and
tools
• Very
important
overlap
with
conservaGon
and
restoraGon,
so
we
are
establishing
strong
collaboraGons
with
projects
in
this
domain
5. Project activities
• Networking activities
– Community building: involving researchers and institutions in
sharing data and establishing together guidelines
– Standardization and good practices
• Trans-National Access
– Training
– Access to on-line resources
• Research activities
– Knowledge organization
– Data management
– New or improved tools to extract information
– Advances in methodology
6. Progress
in
the
iniGal
nine
months
Networking
• Set
up
users’
needs
survey
and
SIGs
• Collected
detailed
informaGon
on
partners’
datasets
TNA
• Outlined
the
training
program
for
2014
– Summer
schools
7. Progress
in
the
iniGal
nine
months
Joint
Research
• Started
the
design
of
integraGon
– Mapping
metadata
schemas
to
CIDOC-‐CRM
• Working
on
an
extension
of
CIDOC-‐CRM
suitable
for
archaeological
documentaGon
– DraZ
proposal
for
excavaGon
data
in
preparaGon
• CreaGon
of
the
ACDM
(ARIADNE
Catalog
Data
Model)
8. Overview
of
partners’
datasets
DBMS
Structured
data
Collec0ons
MM
Sparse
files
GIS
ZRC-‐SAZU
X
X
OEAW
X
X
DISCOVERY
X
X
X
ARHEO
X
INRAP
X
X
ARUP-‐CAS
X
NIAM-‐BAS
X
SND
X
X
ADS
X
X
X
X
X
X
DANS
X
X
X
X
MIBAC-‐ICCU
X
X
X
X
MNM-‐NOK
X
ATHENA
RTC-‐
CETI
X
AIAC
X
Cyi-‐STARC
X
X
DAI
X
• DBMS:
RDBMS,
MySQL,
Access
• Dataset:
repository
of
digital
objects
with
the
same
structure
• Collec0on:
set
of
text
files/images
stored
in
a
hierarchical
system
• Mul0media:
3D,
images,
videos
• Sparse
files:
mostly
grey
literature,
no
(common)
schema
• GIS:
informaGon
layers,
coordinates
Content
includes:
• ExcavaGon
data
• Monuments
and
sites
• ScienGfic
analyses
9. • 20
countries
• 24
languages
• 1,500,000+
database
records
•
40,000+
grey
literature
files
ARIADNE
in
numbers
1
2
3
4
5
6
DBMS
Structured
data
CollecGons
MM
Sparse
files
GIS
38%
DBMS
20%
Structured
data
(datasets)
9%
CollecGons
9%
MulGmedia
9%
Sparse
files
15%
GIS
10. Metadata
schemas
Eight
partners
(SND,
KNAW-‐DANS,
DISCOVERY,
MiBAC-‐ICCU,
INRAP,
ADS,
CYI-‐
STARC)
have
adopted
formal
metadata
standards
for
their
datasets.
The
metadata
standards
reported
are:
• DDI,
DataCite,
MARC/UNIMARC,
TriDAS,
Dublin
Core
applicaGon
profiles,
INSPIRE,
ISO
11915,
CARARE,
LIDO,
CIDOC-‐CRM.
Ten
partners
(ZRC
SAZU,
MiBAC-‐ICCU,
ADS,
AIAC,
MNM-‐NOK,
CYI-‐STARC,
ARUP-‐CAS,
ATHENA
RC,
NIAM-‐BAS)
have
developed
proprietary
metadata
schemas
for
some
of
their
datasets.
However,
all
these
can
be
mapped
onto
CIDOC-‐CRM.
Four
partners
(DISCOVERY,
INRAP,
ARHEO,
OAEW)
reported
some
datasets
for
which
a
metadata
definiGon
is
not
currently
available
but
could
be
derived
from
the
database
structure.
11. ARIADNE
Users
Framework
SND
Other…
eDNA
Data
Centers
(naGonal)
Domain/Subject-‐based
Repositories
&
Portals
(nat./internaGonal)
InsGtuGonal
Repositories
/
DBs
Research
Projects
Domain/Subject-‐based
Referatories
&
Portals
(nat./internaGonal)
ARIADNE
e-‐Infrastructure
and
Integrated
Services
LOD
cloud
diagram,
by
R.
Cyganiak
&
A.
Jentzsch,
hop://lod-‐cloud.net
ADS
FasG
Online
…
ARIADNE
Registry
Datasets,
metadata
schema,
KOSs,
mappings,…,
CRM
Several
ARIADNE
partners
Meta/data
records
=>
collect/transform2RDF
ARIADNE
Portal
Cross-‐search
and
other
applicaGons
L1:
Projects,
research
groups
L2:
Ins6tu6ons,
ins6tu6onal
repositories/DBs
L3:
Higher-‐
level
data
managers
L4:
infra
&
integrated
services
…
…
ARACHNE
accData
Search,
browse,
…
Legend:
dep=deposit,
coll=collect,
acc=access;
D=
Data,
M=Metadata
collM
SND
Other…
eDNA
Data
Centers
(nat./int.,
archaeology
or
general)
Subject/Domain-‐based
Repositories
&
Portals
(nat./internaGonal)
InsGtuGonal
Repositories
/
DBs
Research
Projects
Subject/Domain-‐based
Referatories
&
Portals
(nat./internaGonal)
ARIADNE
Infrastructure
and
Integrated
Services
ADS
FasG
Online
…
ARIADNE
Registry
Datasets,
metadata
schema,
KOSs,
mappings,…,
CRM
Several
ARIADNE
partners
Meta/data
records
=>
collect/transform
to/RDF
ARIADNE
Portal
Cross-‐search
and
other
applicaGons
…
…
ARACHNE
accData
Other
Infra
&
Services
ARIADNE
aggregators
(nat./
themaGc)?
LOD
Cloud
LOD
Cloud
collM
collM
collM
collM
depD+M
depD+M
Interoperability
Framework
12. The
ACDM
• The
ARIADNE
Catalog
Data
Model
(ACDM)
aims
at
describing
datasets,
services,
and
resources
in
the
archaeological
domain
– So
far
it
addresses
databases,
collecGons,
thesauri,
with
plans
for
covering
all
relevant
resource
types
• Based
on
DCAT
(W3C
recomm.)
and
other
widespread
ontologies
• Data
collected
into
a
registry,
using
a
data
acquisiGon
tool
• To
be
used
internally
to
support
integraGon
design
• Conceived
to
became
a
publicly
available
service
and
to
be
offered/
extended
to
other
domains
(e.g.
conservaGon/restoraGon)
13. ACDM
Model
dcat:Catalog
The catalog of the
ARIADNE datasets is
an instance of this class
dcat:Dataset dcat:Distribution
Represents an accessible form of a
dataset as for example a
downloadable file, an RSS feed or a
web service that provides the data.
*1 dcat:dataset *1 dcat:distribution
:numOfRecords
:OAI-PMHServerURI:
:platformDescription
:ARIADNEDistribution
skos:Concept
dcat:theme
:accessPolicy
:ARIADNEDataset
:simpleOrComplex?
:standardUsed:
:proprietaryFormatDesc:
:mappingTo:
:characterSet
:XSDAvailable?
ArchaeologicalRecordStr
ucture
An instance of this class represents an ARIADNE
dataset, independent of any specific distribution of
the dataset. A dataset is defined as a set of
archaeological records having all the same structure
An instance of this class
describes the structure of the
archaeological records of the
associated dataset
:hasRecordStructure
*
1
dct:Description
dct:Format [MIME]
dct:Identifier
:IdentifierType
:persistentIdentifier?
:DigitalObject
:hasAttachedObject
*
1
An instance of this
class describes a
type of digital object
An instance of this
association captures the
fact that the archaeological
record of a dataset may
have attached digital objects
:standardUsed:
:proprietaryFormatDesc:
:mappingTo:
:characterSet
:XSDAvailable?
MetadataRecordStructure
An instance of this class
describes the structure of the
metadata records of the
associated dataset
Represents an accessible form of a
dataset as for example a
downloadable file, an RSS feed or a
web service that provides the data.
dct:description
dct:identifier
dct:format
Vocabulary
:hasMetadataStructure
:usesVocabulary:usesVocabulary
*
0
*
*
*
*
An instance of this class
represents a vocabulary or
authority file, used in the
associated structure
*
:uri
AttachedDocuments
* 1
foaf:Agent
Documents useful to understand the
complexity of database made available to
ARIADNE. Provides: 1) database schema,
2) sample of records for each database.
*
1
dct:publisher
*
dct:publisher
*
1
:hasSimpleDigitalType
*
1
An instance of this association
captures the fact that the
archaeological record has a
simple structure and every record
is a digital object of the same type
:ARIADNEService
:applyTo
*
0
This class is described in a
separate figure
14. ARIADNE
Required
Services
• Services
for
resource
discovery
• Services
integraGng
datasets
on
a
geographic
base,
enabling
access
according
to
geographic
criteria
• Services
integraGng
datasets
on
a
temporal
base,
enabling
access
according
to
temporal
criteria
and
using
a
complex
Gmeline
• Services
implemenGng
interoperability
among
different
datasets
• Metadata
management
(semanGc
repositories
with
advanced
query
and
retrieval
funcGonaliGes)
• Repository
infrastructures
and
ingesGon
services
• VisualizaGon
services
for
– Images
– Videos
– 3D
Single
Objects
&
Large
Scenes
• Services
will
re-‐use,
as
far
as
possible,
concepts/tools
created
in
other
projects
18. Mapping
Archaeological Object
E22 Man-made Object
Inscription (ISR)
E34 Inscription
p128 carries
Object Name (OGTN)
Object Title (SGTT)
E35 Title
P102 has title
Materials (MTC)
E52 Material
P45 consists of
Object Dimensions
E54 Dimension
P43 has dimension
[Open Vocabulary]
Object "Position"
E46 Section Definition
P58 defines section
"Olpe Chigi"
"Foot"
P1 is identified by
OGTP
E41 Appellation
"Apollo del Belvedere"
P90 has value
MISA, MISL ...
E60 Number
Height
Width
Length
P91 has unit
MISU
E58 Measurement Unit
[Closed Vocabulary]
P3 has note
MISV
E62 String
Stamps, Badges,
Emblemes
E25 Mark
p128 carries
19. Challenges
• Languages
– Most
data
are
(and
must
remain)
in
their
original
languages
• TradiGons
– The
way
data
are
understood
and
captured
are
different
according
to
schools
and
scienGfic
tradiGons
• FragmentaGon
– There
is
a
mulGtude
of
micro-‐archives
that
contain
valuable
informaGon
and
are
at
risk
of
loss
• Technology
– Advances
in
the
use
of
technology
(e.g.
3D,
scienGfic
analyses)
are
creaGng
‘big
data’
archives,
not
properly
managed
with
the
currently
available
tools
• VirtualizaGon
– Virtual
tools
need
to
be
fully
integrated
in
the
research
methodology
of
the
discipline
20. ARIADNE is a project funded by the European Commission under the
Community’s Seventh Framework Programme, contract no. FP7-
INFRASTRUCTURES-2012-1-313193.
The views and opinions expressed in this presentation are the sole
responsibility of the authors and do not necessarily reflect the views
of the European Commission.
Contact: niccolucci@unifi.it
www.ariadne-infrastructure.eu