Tutorial on Hybrid Data Infrastructures: D4Science as a case studyBlue BRIDGE
An e-Infrastructure is a distributed network of service nodes, residing on multiple sites and managed by one or more organizations allowing scientists residing at distant places to collaborate. They may offer a multiplicity of facilities as-a-service, supporting data sharing and usage at different levels of abstraction. E-Infrastructures can have different implementations (Andronico et al 2011). A major distinction is between (i) Data e-Infrastructures, i.e. digital infrastructures promoting data sharing and consumption to a community of practice (e.g. MyOcean, Blanc 2008) and (ii) Computational e-Infrastructures, which support the processes required by a community of practice using GRID and Cloud computing facilities (e.g. Candela et al. 2013). A more recent type of e-Infrastructure is the Hybrid Data Infrastructure (HDI) (Candela et al. 2010), i.e. a Data and Computational e-Infrastructure that adopts a delivery model for data management, in which computing, storage, data and software are made available as-a-Service. HDIs support, for example, data transfer, data harmonization and data processing workflows. Hybrid Data e-Infrastructures have already been used in several European and international projects (e.g. i-Marine 2011; EuBrazil OpenBio 2011) and their exploitation is growing fast supporting new projects and initiatives, e.g. Parthenos, Ariadne, Descramble.
A particular HDI, named D4Science (Candela et al. 2009), has been used by communities of practice in the fields of biodiversity conservation, geothermal energy monitoring, fisheries management, and culture heritage. This e-Infrastructure hosts models and resources by several international organizations involved in these fields. Its capabilities help scientists to access and manage data, reuse data and models, obtain results in short time and share these results with other colleagues.
Workshop about research data archiving and open access publishing at the Rese...Dag Endresen
The Research Council of Norway (RCN) organizes a workshop on 1st November 2016 to collect experiences on research data archiving and open access data publishing. The Norwegian GBIF-node will present the GBIF framework including dataset DOIs and download DOIs.
See also:
GBIF.no (2016), http://www.gbif.no/news/2016/data-archiving-ncr.html
GBIF GB21 (2014), http://www.gbif.org/newsroom/news/gb21-science-symposium
GBIF GB21 Slides, http://www.gbif.org/resource/81918
Vimeo video (2014), https://vimeo.com/107148220#t=6m28s
De-centralized but global: Redesigning biodiversity data aggregation for impr...taxonbytes
Biodiversity data pose fundamental challenges for unification-based paradigms of data science. In particular, a hierarchical, backbone-driven approach to aggregating global biodiversity data tends to limit community engagement. Data quality, trust, fitness for use, and impact are similarly reduced. This presentation will outline an alternative, de-centralized design for aggregating biodiversity data globally. The design requires a coordinative approach to representing and reconciling evolving systematic perspectives, and further social but technologically mediated coordination between regionally and taxonomically constrained "communities of practice" (sensu Wenger, 2000, https://doi.org/10.1177/135050840072002). Important next steps in this direction include the development of use cases that quantify the benefits of a de-centralized biodiversity data aggregation - in terms of lowering costs to expert engagement, raising efficiency of curation, validating novel integration services, and improving reproducibility and provenance tracking across heterogenous data structures and portals.
Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)Dag Endresen
Presentation of web services for the CGIAR ICT-KM training workshop on information interoperability, 13th June 2005, at IPGRI Rome Italy. Dag Endresen (Nordic Gene Bank).
GlobusWorld 2021: Arecibo Observatory Data MovementGlobus
The story of how Globus helped move petabytes of data from the Arecibo Observatory to TACC, and thereby save 50+ years of data for posterity and future research. Presented at the GlobusWorld 2021 conference by George Robb III.
Session 02, Introduction to the 2015 Data Publishing Landscape at the GB22 No...Alberto González-Talaván
This presentation describes the evolving scenario of biodiversity data publishing as it is in 2015. It was first presented in the training event for GBIF Participant nodes part of the 22nd meeting of the GBIF Governing Board.
Slide deck developed and presented by L. Russell (Vertnet)
Tutorial on Hybrid Data Infrastructures: D4Science as a case studyBlue BRIDGE
An e-Infrastructure is a distributed network of service nodes, residing on multiple sites and managed by one or more organizations allowing scientists residing at distant places to collaborate. They may offer a multiplicity of facilities as-a-service, supporting data sharing and usage at different levels of abstraction. E-Infrastructures can have different implementations (Andronico et al 2011). A major distinction is between (i) Data e-Infrastructures, i.e. digital infrastructures promoting data sharing and consumption to a community of practice (e.g. MyOcean, Blanc 2008) and (ii) Computational e-Infrastructures, which support the processes required by a community of practice using GRID and Cloud computing facilities (e.g. Candela et al. 2013). A more recent type of e-Infrastructure is the Hybrid Data Infrastructure (HDI) (Candela et al. 2010), i.e. a Data and Computational e-Infrastructure that adopts a delivery model for data management, in which computing, storage, data and software are made available as-a-Service. HDIs support, for example, data transfer, data harmonization and data processing workflows. Hybrid Data e-Infrastructures have already been used in several European and international projects (e.g. i-Marine 2011; EuBrazil OpenBio 2011) and their exploitation is growing fast supporting new projects and initiatives, e.g. Parthenos, Ariadne, Descramble.
A particular HDI, named D4Science (Candela et al. 2009), has been used by communities of practice in the fields of biodiversity conservation, geothermal energy monitoring, fisheries management, and culture heritage. This e-Infrastructure hosts models and resources by several international organizations involved in these fields. Its capabilities help scientists to access and manage data, reuse data and models, obtain results in short time and share these results with other colleagues.
Workshop about research data archiving and open access publishing at the Rese...Dag Endresen
The Research Council of Norway (RCN) organizes a workshop on 1st November 2016 to collect experiences on research data archiving and open access data publishing. The Norwegian GBIF-node will present the GBIF framework including dataset DOIs and download DOIs.
See also:
GBIF.no (2016), http://www.gbif.no/news/2016/data-archiving-ncr.html
GBIF GB21 (2014), http://www.gbif.org/newsroom/news/gb21-science-symposium
GBIF GB21 Slides, http://www.gbif.org/resource/81918
Vimeo video (2014), https://vimeo.com/107148220#t=6m28s
De-centralized but global: Redesigning biodiversity data aggregation for impr...taxonbytes
Biodiversity data pose fundamental challenges for unification-based paradigms of data science. In particular, a hierarchical, backbone-driven approach to aggregating global biodiversity data tends to limit community engagement. Data quality, trust, fitness for use, and impact are similarly reduced. This presentation will outline an alternative, de-centralized design for aggregating biodiversity data globally. The design requires a coordinative approach to representing and reconciling evolving systematic perspectives, and further social but technologically mediated coordination between regionally and taxonomically constrained "communities of practice" (sensu Wenger, 2000, https://doi.org/10.1177/135050840072002). Important next steps in this direction include the development of use cases that quantify the benefits of a de-centralized biodiversity data aggregation - in terms of lowering costs to expert engagement, raising efficiency of curation, validating novel integration services, and improving reproducibility and provenance tracking across heterogenous data structures and portals.
Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)Dag Endresen
Presentation of web services for the CGIAR ICT-KM training workshop on information interoperability, 13th June 2005, at IPGRI Rome Italy. Dag Endresen (Nordic Gene Bank).
GlobusWorld 2021: Arecibo Observatory Data MovementGlobus
The story of how Globus helped move petabytes of data from the Arecibo Observatory to TACC, and thereby save 50+ years of data for posterity and future research. Presented at the GlobusWorld 2021 conference by George Robb III.
Session 02, Introduction to the 2015 Data Publishing Landscape at the GB22 No...Alberto González-Talaván
This presentation describes the evolving scenario of biodiversity data publishing as it is in 2015. It was first presented in the training event for GBIF Participant nodes part of the 22nd meeting of the GBIF Governing Board.
Slide deck developed and presented by L. Russell (Vertnet)
Open Science: how to serve the needs of the researcher? Carole Goble
Open science Jisc CNI roundtable 2018
Lightning talk
What should the future look like?
What are the essential characteristics we desire in a relatively near future system to support scholarly communication across the full research life cycle?
What are the key areas requiring attention, action, or investment today to reach the future that we want to reach?
What are the best opportunities to build upon existing practices, investments and infrastructure, both
open and commercially provided?
Where must alternatives be developed?
What areas are already on good trajectories and can be left to evolve without additional intervention
About the Virtual Conference
With the expansion of digital data collection and the increased expectations of data sharing, researchers are turning to their libraries or institutional repositories as a place to store and preserve that data. Many institutions have created such data management services and see the data curation role as a growing and important element of their service portfolio. While some of the experience in managing other types of digital resources is transferrable, the management of large-scale scientific data has many special requirements and challenges. From metadata collection and cataloging data sources, to identification, discovery, and preservation, best practices and standards are still in their infancy.
This Virtual Conference will explore in greater depth than traditional webinars some of the practical lessons from those who have implemented data management and developed best practices, as well as provide some insight into the evolving issues the community faces. It will include discussions related to certification of trusted repositories, provenance and identification issues around data, data citation, preservation, and the work of several repository networks to advance distribution of scientific information.
IDB-Cloud Providing Bioinformatics Services on Cloudstratuslab
A presentation of IDB (Infrastructure Distributed for Biology) using StratusLab technology by Christophe Blanchet and Clément Gauthey at Lille, France, May 2013.
RDAP13 Lorrie Johnson: Facilitating Access to Scientific DataASIS&T
Lorrie Johnson, U.S. Department of Energy/Office of Science and Technical Information: “Facilitating Access to Scientific Data: The DataCite, Science.gov, and WorldWideScience.org Initiatives”
Panel: Linked data and metadata (co-sponsored by the ASIS&T Digital Libraries SIG)
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
Phidias: Steps forward in detection and identification of anomalous atmospher...Phidias
PHIDIAS is organised a webinar entitled "Steps forward in detection and identification of anomalous atmospheric events" held on 13 October 2020 at 15:00 CEST in collaboration with ESCAPE project. The webinar aimed at showcasing how PHIDIAS is going to improve the usage of HPC and high performance data management services for the development of intelligent screening approaches for the exploitation of large amounts of satellite atmospheric data in an operational context.
This is one out of a series of presentations which I have given during a recent trip to the United States. I will make them all public, but content does not vary a lot between some of them
Poster delivered by Robin Rice at the Open Repositories 2016 conference. Covers:
* Creating a data management plan
* Storing data
* Synchronising data
* Finding and analysing data
* Training
* Online training
* Support
* Sharing open data
* Archiving data
* Recording datasets using PURE
Federation and Interoperability in the Nectar Research CloudOpenStack
Audience Level
Beginner
Synopsis
The Nectar Research Cloud provides an OpenStack cloud for Australia’s academic researchers. Since its inception in 2012 it has grown steadily to over 30,000 CPUs, with over 10,000 registered users from more than 50 research institutions. It is different to many clouds in being a federation across eight organisations, each of which runs cloud infrastructure in one or more data centres and contributes to a distributed help desk and user support. A Nectar core services team runs centralised cloud services. This presentation will give an overview of the experiences, challenges and benefits of running a federated OpenStack cloud and a short demonstration on using the Nectar cloud. We will also describe some current approaches that are looking to extend this federation to encompass other institutions including some in New Zealand, to extend the infrastructure using commercial cloud providers, and to move towards interoperability with the growing number of international science and research clouds through the new Open Research Cloud initiative.
Speaker Bio
Dr Paul Coddington is a Deputy Director of Nectar, responsible for the Nectar national Research Cloud, and also Deputy Director of eResearch SA. He has over 30 years experience in eResearch including computational science, high performance and distributed computing, cloud computing, software development, and research data management.
I am text block. Click edit button to change this text. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.
Creación de una nueva categoría del longboard: el oddboard. Portfolio planner...Alba Lucena Zaher
Lanzar y conceptualizar el producto Star Rover, un
longboard con un mecanismo de 8 ruedas. Poner el foco e
identificar targets que puedan estar interesados. Y crear un
Naming y un concepto creativo. Nos dirigimos a personas peculiares con algún tipo de rareza, aunque al fin y al cabo todos somos un poco así. Invención de la categoría del oddboard.
Open Science: how to serve the needs of the researcher? Carole Goble
Open science Jisc CNI roundtable 2018
Lightning talk
What should the future look like?
What are the essential characteristics we desire in a relatively near future system to support scholarly communication across the full research life cycle?
What are the key areas requiring attention, action, or investment today to reach the future that we want to reach?
What are the best opportunities to build upon existing practices, investments and infrastructure, both
open and commercially provided?
Where must alternatives be developed?
What areas are already on good trajectories and can be left to evolve without additional intervention
About the Virtual Conference
With the expansion of digital data collection and the increased expectations of data sharing, researchers are turning to their libraries or institutional repositories as a place to store and preserve that data. Many institutions have created such data management services and see the data curation role as a growing and important element of their service portfolio. While some of the experience in managing other types of digital resources is transferrable, the management of large-scale scientific data has many special requirements and challenges. From metadata collection and cataloging data sources, to identification, discovery, and preservation, best practices and standards are still in their infancy.
This Virtual Conference will explore in greater depth than traditional webinars some of the practical lessons from those who have implemented data management and developed best practices, as well as provide some insight into the evolving issues the community faces. It will include discussions related to certification of trusted repositories, provenance and identification issues around data, data citation, preservation, and the work of several repository networks to advance distribution of scientific information.
IDB-Cloud Providing Bioinformatics Services on Cloudstratuslab
A presentation of IDB (Infrastructure Distributed for Biology) using StratusLab technology by Christophe Blanchet and Clément Gauthey at Lille, France, May 2013.
RDAP13 Lorrie Johnson: Facilitating Access to Scientific DataASIS&T
Lorrie Johnson, U.S. Department of Energy/Office of Science and Technical Information: “Facilitating Access to Scientific Data: The DataCite, Science.gov, and WorldWideScience.org Initiatives”
Panel: Linked data and metadata (co-sponsored by the ASIS&T Digital Libraries SIG)
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
Phidias: Steps forward in detection and identification of anomalous atmospher...Phidias
PHIDIAS is organised a webinar entitled "Steps forward in detection and identification of anomalous atmospheric events" held on 13 October 2020 at 15:00 CEST in collaboration with ESCAPE project. The webinar aimed at showcasing how PHIDIAS is going to improve the usage of HPC and high performance data management services for the development of intelligent screening approaches for the exploitation of large amounts of satellite atmospheric data in an operational context.
This is one out of a series of presentations which I have given during a recent trip to the United States. I will make them all public, but content does not vary a lot between some of them
Poster delivered by Robin Rice at the Open Repositories 2016 conference. Covers:
* Creating a data management plan
* Storing data
* Synchronising data
* Finding and analysing data
* Training
* Online training
* Support
* Sharing open data
* Archiving data
* Recording datasets using PURE
Federation and Interoperability in the Nectar Research CloudOpenStack
Audience Level
Beginner
Synopsis
The Nectar Research Cloud provides an OpenStack cloud for Australia’s academic researchers. Since its inception in 2012 it has grown steadily to over 30,000 CPUs, with over 10,000 registered users from more than 50 research institutions. It is different to many clouds in being a federation across eight organisations, each of which runs cloud infrastructure in one or more data centres and contributes to a distributed help desk and user support. A Nectar core services team runs centralised cloud services. This presentation will give an overview of the experiences, challenges and benefits of running a federated OpenStack cloud and a short demonstration on using the Nectar cloud. We will also describe some current approaches that are looking to extend this federation to encompass other institutions including some in New Zealand, to extend the infrastructure using commercial cloud providers, and to move towards interoperability with the growing number of international science and research clouds through the new Open Research Cloud initiative.
Speaker Bio
Dr Paul Coddington is a Deputy Director of Nectar, responsible for the Nectar national Research Cloud, and also Deputy Director of eResearch SA. He has over 30 years experience in eResearch including computational science, high performance and distributed computing, cloud computing, software development, and research data management.
I am text block. Click edit button to change this text. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.
Creación de una nueva categoría del longboard: el oddboard. Portfolio planner...Alba Lucena Zaher
Lanzar y conceptualizar el producto Star Rover, un
longboard con un mecanismo de 8 ruedas. Poner el foco e
identificar targets que puedan estar interesados. Y crear un
Naming y un concepto creativo. Nos dirigimos a personas peculiares con algún tipo de rareza, aunque al fin y al cabo todos somos un poco así. Invención de la categoría del oddboard.
Un accord relatif aux garanties de rémunération effective (GRE), aux rémunérations minimales hiérarchiques et à la prime de vacances a été signé dans la CC Métallurgie de l'Aisne.
L'accord fixe un barème pour les GRE, pour l'année 2016.
La valeur du point est fixée à 5,36 euros à compter du 1er août 2016.
Une prime de vacances fixée à 540 euros est instituée à compter de l'année 2016.
L'organisme patronal signataire est l'Union des industries et métiers de la métallurgie de l'Aisne. Les syndicats de salariés signataires de l'accord sont la CFDT, la CTC; FO et la CFE/CGC.
The BlueBRIDGE approach to collaborative researchBlue BRIDGE
Gianpaolo Coro, ISTI-CNR, at BlueBRIDGE workshop on "Data Management services to support stock assessement", held during the Annual ICES Science conference 2016
Virtual Research Environments supporting tailor-made data management service...Blue BRIDGE
Presented by Pasquale Pagano of CNR at the BlueBRIDGE Workshop at SeaTech Week 2016 in Brest, France. http://www.bluebridge-vres.eu/events/join-bluebridge-10th-biennial-sea-tech-week-brest-france
On 29 January 2020 ARCHIVER launched its Request for Tender with the purpose to award several Framework Agreements and work orders for the provision of R&D for hybrid end-to-end archival and preservation services that meet the innovation challenges of European Research communities, in the context of the European Open Science Cloud.
The tender was closed on 28 April 2020 and 15 R&D bids were submitted, with consortia that included 43 companies and organisations. The best bids have been selected and will start the first phase of the ARCHIVER R&D (Solution Design) in June 2020.
On Monday 8 June the selected consortia for the ARCHIVER design phase have been announced during a Public Award Ceremony starting at 14.00 CEST.
In light of the COVID-19 outbreak and the and consequent movement restrictions imposed in several countries, the event has been organised as a webinar, virtually hosted by Port d’Informació Científica (PIC), a member of the Buyers Group of the ARCHIVER consortium.
The Kick-off marks the beginning of the Solution Design Phase.
The AGINFRA+ Virtual Research Environment (VRE)AGINFRA
Massimiliano Assante from CNR on The AGINFRA+ Virtual Research Environment (VRE).
Joint Workshop on Food Risk Assessment Research & Practice
24th November 2017, Wageningen University & Research, Netherlands
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
Heinz Pampel | GFZ German Research Centre for Geosciences, LIS
Maxi Kindling | Humboldt-Universität zu Berlin, Berlin School of Library and Information Science Frank Scholze | Karlsruhe Institute of Technology, KIT Library
RDA-Deutschland-Treffen 2015| Potsdam, November 26, 2015
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
https://datascience.nih.gov/news/march-data-sharing-and-reuse-seminar 11 March 2022
Starting in 2023, the US National Institutes of Health (NIH) will require institutes and researchers receiving funding to include a Data Management Plan (DMP) in their grant applications, including the making their data publicly available. Similar mandates are already in place in Europe, for example a DMP is mandatory in Horizon Europe projects involving data.
Policy is one thing - practice is quite another. How do we provide the necessary information, guidance and advice for our bioscientists, researchers, data stewards and project managers? There are numerous repositories and standards. Which is best? What are the challenges at each step of the data lifecycle? How should different types of data? What tools are available? Research Data Management advice is often too general to be useful and specific information is fragmented and hard to find.
ELIXIR, the pan-national European Research Infrastructure for Life Science data, aims to enable research projects to operate “FAIR data first”. ELIXIR supports researchers across their whole RDM lifecycle, navigating the complexity of a data ecosystem that bridges from local cyberinfrastructures to pan-national archives and across bio-domains.
The ELIXIR RDMkit (https://rdmkit.elixir-europe.org (link is external)) is a toolkit built by the biosciences community, for the biosciences community to provide the RDM information they need. It is a framework for advice and best practice for RDM and acts as a hub of RDM information, with links to tool registries, training materials, standards, and databases, and to services that offer deeper knowledge for DMP planning and FAIR-ification practices.
Launched in March 2021, over 120 contributors have provided nearly 100 pages of content and links to more than 300 tools. Content covers the data lifecycle and specialized domains in biology, national considerations and examples of “tool assemblies” developed to support RDM. It has been accessed by over 123 countries, and the top of the access list is … the United States.
The RDMkit is already a recommended resource of the European Commission. The platform, editorial, and contributor methods helped build a specialized sister toolkit for infectious diseases as part of the recently launched BY-COVID project. The toolkit’s platform is the simplest we could manage - built on plain GitHub - and the whole development and contribution approach tailored to be as lightweight and sustainable as possible.
In this talk, Carole and Frederik will present the RDMkit; aims and context, content, community management, how folks can contribute, and our future plans and potential prospects for trans-Atlantic cooperation.
Data policy must be partnered with data practice. Our researchers need to be the best informed in order to meet these new data management and data sharing mandates.
Approach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) projectAlex Hardisty
Describes what we set out to do, what we achieved, and some of the lessons learnt during the BioVeL project. This presentation was given at the BioVeL final event "BioVeL In Practice and In Future", Paris, 13th November 2014
Researchers require infrastructures that ensure a maximum of accessibility, stability and reliability to facilitate working with and sharing of research data. Such infrastructures are being increasingly summarised under the term Research Data Repositories (RDR). The project re3data.org – Registry of Research Data Repositories – began to index research data repositories in 2012 and offers researchers, funding organisations, libraries and publishers an overview of the heterogeneous research data repository landscape. In December 2014 re3data.org listed more than 1,030 research data repositories, which are described in detail using the re3data.org schema (http://dx.doi.org/10.2312/re3.003). Information icons help researchers to identify easily an adequate repository for the storage and reuse of their data. This talk describes the heterogeneous RDR landscape and presents a typology of institutional, disciplinary, multidisciplinary and project-specific RDR. Further, it outlines the features of re3data. org and it shows current developments for integration into data management planning tools and other services.
By the end of 2015 re3data.org and Databib (Purdue University, USA) will merge their services, which will then be managed under the auspices of DataCite. The aim of this merger is to reduce duplication of effort and to serve the research community better with a single, sustainable registry of research data repositories. The talk will present this organisational development as a best practice example for the development of international research information services.
Hybrid Cloud storage deployment models: ARCHIVER presentation at the CS3 Work...Archiver
This presentation explored the use-cases driving ARCHIVER at the audience gathered at INFN, in Rome, Italy, for the Open Data Ecosystem and CS3 conference, 27-29 January 2020
Similar to Bridging Environmental Data Providers and SeaDataNet DIVA Service within a Collaborative and Distributed e-Infrastructure (20)
Managing tuna fisheries data at a global scale: the Tuna Atlas VREBlue BRIDGE
On 18th January 2018, 3pm CET BlueBRIDGE will hosted the webinar: "Managing tuna fisheries data at a global scale: the Tuna Atlas VRE" that presented how, through the Tuna Atlas Virtual Research Environment (VRE), users can easily produce their own datasets of fisheries at regional, multi-regional or global scale and how they can share these datasets in ways that allow other users to access, process and visualise them efficiently.