Shaun will give an overview to the EUDAT service suite, explaining the key function and role of the different B2 services and how they interconnect. Examples will be given of how each service has been used by communities to explain the nature and scale of the service provision. We show how the B2 Service Suite can be linked to the data lifecycle and the role of each component in any data management planning. By the end of this talk, users should have a good overview of each of the B2 services and how they do, or will, fit together, and how they can be used as a part of a coherent data management plan.
Visit https://eudat.eu/eudat-summer-school
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)EUDAT
EUDAT and PRACE joined forces to help research communities gain access to high quality managed e-Infrastructures whose resources can be connected together to enable cross-utilization use cases and make them accessible without any technical barrier. The capability to couple data and compute resources together is considered one of the key factors to accelerate scientific innovation and advance research frontiers. The goal of this session was to present the EUDAT services, the results of the collaboration activity achieved so far and delivers a hands-on on how to write a Data Management Plan or DMP. The DMP is a useful instrument for researchers to reflect on and communicate about the way they will deal with their data. It prompts them to think about how they will generate, analyse and share data during their research project and afterwards.
Visit: https://www.eudat.eu/eudat-summer-school
The Data Lifecycle - EUDAT Summer School (Yann Le Franc)EUDAT
Yann will introduce the notion of data life cycles (DLCs) as an overarching framework for the workshop. This presentation will explain the key activities and roles identified by EUDAT and undertaken by researchers and data service providers in the process of creating, analysing, managing, sharing and archiving research data. It will highlight how the EUDAT service suite addresses this data lifecycle to support researchers with their key data requirements. He will then present the current research work undertaken in EUDAT to model community specific DLCs, the relation with the concept of provenance and the prototype services being currently developed to bridge the identified gaps in DLC coverage.
Visit https://eudat.eu/eudat-summer-school
Long-term data curation, aka data preservation - EUDAT Summer School (Marjan ...EUDAT
Marjan will give an overview of the role of data archives in ensuring the safe stewardship and preservation of data over time. She will explain what it means to be a Trustworthy Digital Repository and the associated policies and processes that need to be in place to ensure data provenance and authenticity. This session will link to Monday’s exploration of the re3data.org portal
Visit: https://www.eudat.eu/eudat-summer-school
Data Discoverability and Persistent Identifiers - EUDAT Summer School (Chris...EUDAT
We will introduce the concept of persistent identifiers. We will explain how PIDs can be used, which PID systems exist and which use cases they are fit for. The use cases highlight that PIDs are a vital technology to enable FAIR data. The focus will lie on gathering hands-on experience with the Handle system. Participants will mint PIDs, i.e. not only create a resolvable PID but will also learn how to add, alter and delete metadata in the PID entry by employing the handle API directly and EUDAT’s B2HANDLE library.
Visit: https://www.eudat.eu/eudat-summer-school
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)EUDAT
Shaun will explain the importance of metadata for data discovery, provenance, reproducibility and reuse. Without sufficient metadata and documentation, research data cannot be found or understood. Providing this contextual information is critical for data to be FAIR. The topics of metadata ontologies and folksonomies are also discussed. This talk aims at giving the participants an understanding of the importance of metadata for both collaborative research and to ensure the usefulness of the data into the future, as well as an idea of what makes ‘good’ metadata.
Visit https://eudat.eu/eudat-summer-school
EUDAT Summer School Welcome - EUDAT Summer School (Yannis Tzitzikas, FORTH-ICS)EUDAT
Yannis will welcome the participants of the summer school on behalf of FORTH and will say a few words about the research center, its institutes and the hosting lab.
Visit https://eudat.eu/eudat-summer-school
Presentation given by Chris Higgens at the Annual Infrastructure for Spatial Information in European (INSPIRE) Conference Krakow, Poland. 22 June 2010.
1) The University of Edinburgh drafted an 18-month Research Data Management Roadmap in August 2012 to address institutional research data management and comply with their RDM policy.
2) The Roadmap outlines governance, data management planning support, development of an active data infrastructure including a data store, and data stewardship services such as a data repository and registry.
3) Services under the Roadmap include tailored data management plan assistance, customizing an online DMP tool, infrastructure for storing and accessing research data, and a data repository for depositing and long-term management of completed research outputs.
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)EUDAT
EUDAT and PRACE joined forces to help research communities gain access to high quality managed e-Infrastructures whose resources can be connected together to enable cross-utilization use cases and make them accessible without any technical barrier. The capability to couple data and compute resources together is considered one of the key factors to accelerate scientific innovation and advance research frontiers. The goal of this session was to present the EUDAT services, the results of the collaboration activity achieved so far and delivers a hands-on on how to write a Data Management Plan or DMP. The DMP is a useful instrument for researchers to reflect on and communicate about the way they will deal with their data. It prompts them to think about how they will generate, analyse and share data during their research project and afterwards.
Visit: https://www.eudat.eu/eudat-summer-school
The Data Lifecycle - EUDAT Summer School (Yann Le Franc)EUDAT
Yann will introduce the notion of data life cycles (DLCs) as an overarching framework for the workshop. This presentation will explain the key activities and roles identified by EUDAT and undertaken by researchers and data service providers in the process of creating, analysing, managing, sharing and archiving research data. It will highlight how the EUDAT service suite addresses this data lifecycle to support researchers with their key data requirements. He will then present the current research work undertaken in EUDAT to model community specific DLCs, the relation with the concept of provenance and the prototype services being currently developed to bridge the identified gaps in DLC coverage.
Visit https://eudat.eu/eudat-summer-school
Long-term data curation, aka data preservation - EUDAT Summer School (Marjan ...EUDAT
Marjan will give an overview of the role of data archives in ensuring the safe stewardship and preservation of data over time. She will explain what it means to be a Trustworthy Digital Repository and the associated policies and processes that need to be in place to ensure data provenance and authenticity. This session will link to Monday’s exploration of the re3data.org portal
Visit: https://www.eudat.eu/eudat-summer-school
Data Discoverability and Persistent Identifiers - EUDAT Summer School (Chris...EUDAT
We will introduce the concept of persistent identifiers. We will explain how PIDs can be used, which PID systems exist and which use cases they are fit for. The use cases highlight that PIDs are a vital technology to enable FAIR data. The focus will lie on gathering hands-on experience with the Handle system. Participants will mint PIDs, i.e. not only create a resolvable PID but will also learn how to add, alter and delete metadata in the PID entry by employing the handle API directly and EUDAT’s B2HANDLE library.
Visit: https://www.eudat.eu/eudat-summer-school
The Importance of Metadata - EUDAT Summer School (Shaun de Witt, CCFE)EUDAT
Shaun will explain the importance of metadata for data discovery, provenance, reproducibility and reuse. Without sufficient metadata and documentation, research data cannot be found or understood. Providing this contextual information is critical for data to be FAIR. The topics of metadata ontologies and folksonomies are also discussed. This talk aims at giving the participants an understanding of the importance of metadata for both collaborative research and to ensure the usefulness of the data into the future, as well as an idea of what makes ‘good’ metadata.
Visit https://eudat.eu/eudat-summer-school
EUDAT Summer School Welcome - EUDAT Summer School (Yannis Tzitzikas, FORTH-ICS)EUDAT
Yannis will welcome the participants of the summer school on behalf of FORTH and will say a few words about the research center, its institutes and the hosting lab.
Visit https://eudat.eu/eudat-summer-school
Presentation given by Chris Higgens at the Annual Infrastructure for Spatial Information in European (INSPIRE) Conference Krakow, Poland. 22 June 2010.
1) The University of Edinburgh drafted an 18-month Research Data Management Roadmap in August 2012 to address institutional research data management and comply with their RDM policy.
2) The Roadmap outlines governance, data management planning support, development of an active data infrastructure including a data store, and data stewardship services such as a data repository and registry.
3) Services under the Roadmap include tailored data management plan assistance, customizing an online DMP tool, infrastructure for storing and accessing research data, and a data repository for depositing and long-term management of completed research outputs.
Are you a researcher, citizen scientist, institution or community looking for data storage and value-added services? Do you want access to tools to make your research data more FAIR (findable, accessible, interoperable, and reusable)? Interested in seeing how the future European Open Science Cloud could support research data and practically foster cross-border, cross-disciplinary collaboration? Then this webinar is for you!
The document discusses the EU INSPIRE Directive and its implications for UK academia. The INSPIRE Directive aims to create a European Spatial Data Infrastructure (SDI) to improve sharing of spatial information between public authorities and accessibility for the public. This will allow better environmental policies and outcomes. While initially for environmental policy, INSPIRE intends to extend to other domains. The directive may apply to UK universities as they are considered public authorities. This could mean universities would need to make certain spatial datasets available according to INSPIRE specifications. The directive presents both obligations and opportunities for UK academia as data providers and data users.
The document summarizes a workshop on geospatial metadata and spatial data. It discusses the importance of metadata for discovering and managing spatial datasets. It introduces geospatial metadata standards like FGDC, ISO 19115, and INSPIRE. It also describes the UK AGMAP profile, Geodoc metadata editor tool, GoGeo portal, and ShareGeo open data repository for sharing spatial resources in academia. Hands-on sessions demonstrate creating metadata and accessing datasets.
The Go-Geo! Spatial Data Portal provides a discovery and research tool for UK academics to find geospatial resources. It includes over 2,820 searchable geospatial datasets and metadata records. Go-Geo! also provides geospatial metadata best practices and guidelines, metadata editing and publishing tools, workshops and training to support the use and sharing of spatial data across UK academia.
Now we are six: Integrating Edinburgh DataShare into local and internet in...Robin Rice
#iassist40 presentation, Toronto, 6/6/2014.
Abstract:
Edinburgh DataShare, an institutional data repository, is six years old. It was built as a demonstrator in DSpace by EDINA and Data Library and has been given new life by the University of Edinburgh’s Research Data Management initiative. Following testing by pilot users in various departments last year, DataShare is confirmed as a key RDM service. Since 2008 much external infrastructure has grown around data sharing, and software developers, publishers and librarians are creating new innovations around the sharing and re-use of data daily. How can DataShare be shaped to fit in to this ever-more-sophisticated environment? A number of ongoing developments are helping us integrate the repository in the global context. DataShare is being indexed in Thomson-Reuter’s Data Citation Index. We aspire to attain the Data Seal of Approval for DataShare, a badge that confers trustworthiness through peer review. It is listed in re3data.org and databib registries of data repositories. We offer via extension, peer review of datasets to our depositors by listing journals that publish ‘data papers’ such as F1000 Research. Locally, as Information Services builds new data services such as the Data Store, [private data] Vault and the [metadata-only] Register, we can focus DataShare on its named purpose.
B2SHARE - How to share and store research data using EUDAT’s B2SHARE | www.eu...EUDAT
| www.eudat.eu | B2SHARE is a user-friendly, reliable and trustworthy way for researchers, scientific communities and scientists to store and share small-scale research data from diverse contexts.
Overview of the world of geospatial metadata, and the role of the EDINA service GoGeo in creating, saving, and discovering it. Presented on 19 June 2014 by Tony Mathys in Aberdeen, Scotland.
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT
| www.eudat.eu | 2nd Session: July 14, 2016.
In this webinar, Sarah Jones (DCC) and Marjan Grootveld (DANS) talked through the aspects that Horizon 2020 requires from a DMP. They discussed examples from real DMPs and also touched upon the Software Management Plan, which for some projects can be a sensible addition
Research engagement in EUDAT| www.eudat.eu | EUDAT
| www.eudat.eu | EUDAT’s vision is to enable European researchers and practitioners from any research discipline to preserve, find, access, and process data in a trusted environment, as part of a Collaborative Data Infrastructure (CDI) conceived as a network of collaborating, cooperating centres, that combine community-specific data repositories with the permanence and persistence of some of Europe’s largest scientific data centres. EUDAT services are community driven solutions. This presentation describes the different ways EUDAT engages with the research communities
SDA (Survey Documentation and Analysis) is software that allows users to access and analyze numeric microdata from repositories without needing specialized statistical software. It generates descriptive and inferential statistics, and basic visualizations. SDA benefits researchers by providing statistical analysis capabilities and easy access to metadata. It benefits repositories by facilitating secondary use of data while protecting sensitive information. SDA shows the value of numeric data for teaching and research.
This document summarizes the COBWEB project, AIP-6, and how federated access management could help meet their goals. COBWEB aims to crowdsource environmental data while ensuring data quality and privacy. AIP-6 will set up a federation of organizations to enable single sign-on for the GEOSS system. The document discusses how federated access control could authenticate users while protecting sensitive data sources. COBWEB and AIP-6 plan to demonstrate how federations can help with these tasks and inform future work on authorization and commerce.
B2SHARE: Record lifecycle and HTTP API| www.eudat.eu | EUDAT
| www.eudat.eu | B2SHARE is a scientific data repository providing persistent storage and sharing data facilities. Building on the new Invenio 3.0 digital assets management platform, a new version of B2SHARE has been developed which is focused on an improved user experience. Answering the requests of the current user base, B2SHARE version 2 provides customizable metadata schemas and a simple but effective workflow for depositing user data, exposed in its RESTful HTTP API.
The presentation will introduce the B2SHARE service, its organizing principles and its basic operations. The metadata schemas and the dataset lifecycle, which are essentials in understanding the possibilities of the service, will be the main focus of the talk. The concrete output of the session can be a full paper expanding the presented topics.
Target Audience:Researchers of any scientific domain, which work with publishable data sets.
Presented by Tony Mathys at a Current Issues and Applications of the Geospatial Technologies Lecture, Department of Geography and Environment, Aberdeen University, 24 February 2012
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu | EUDAT
| www.eudat.eu | Welcome Overview of the EUDAT service suite and the FAIR principles.
Sarah Jones, Marjan Grootveld, Yann Le Franc - IDCC Conference, February 20, 2017
Are you a researcher, citizen scientist, institution or community looking for data storage and value-added services? Do you want access to tools to make your research data more FAIR (findable, accessible, interoperable, and reusable)? Interested in seeing how the future European Open Science Cloud could support research data and practically foster cross-border, cross-disciplinary collaboration? Then this webinar is for you!
The document discusses the EU INSPIRE Directive and its implications for UK academia. The INSPIRE Directive aims to create a European Spatial Data Infrastructure (SDI) to improve sharing of spatial information between public authorities and accessibility for the public. This will allow better environmental policies and outcomes. While initially for environmental policy, INSPIRE intends to extend to other domains. The directive may apply to UK universities as they are considered public authorities. This could mean universities would need to make certain spatial datasets available according to INSPIRE specifications. The directive presents both obligations and opportunities for UK academia as data providers and data users.
The document summarizes a workshop on geospatial metadata and spatial data. It discusses the importance of metadata for discovering and managing spatial datasets. It introduces geospatial metadata standards like FGDC, ISO 19115, and INSPIRE. It also describes the UK AGMAP profile, Geodoc metadata editor tool, GoGeo portal, and ShareGeo open data repository for sharing spatial resources in academia. Hands-on sessions demonstrate creating metadata and accessing datasets.
The Go-Geo! Spatial Data Portal provides a discovery and research tool for UK academics to find geospatial resources. It includes over 2,820 searchable geospatial datasets and metadata records. Go-Geo! also provides geospatial metadata best practices and guidelines, metadata editing and publishing tools, workshops and training to support the use and sharing of spatial data across UK academia.
Now we are six: Integrating Edinburgh DataShare into local and internet in...Robin Rice
#iassist40 presentation, Toronto, 6/6/2014.
Abstract:
Edinburgh DataShare, an institutional data repository, is six years old. It was built as a demonstrator in DSpace by EDINA and Data Library and has been given new life by the University of Edinburgh’s Research Data Management initiative. Following testing by pilot users in various departments last year, DataShare is confirmed as a key RDM service. Since 2008 much external infrastructure has grown around data sharing, and software developers, publishers and librarians are creating new innovations around the sharing and re-use of data daily. How can DataShare be shaped to fit in to this ever-more-sophisticated environment? A number of ongoing developments are helping us integrate the repository in the global context. DataShare is being indexed in Thomson-Reuter’s Data Citation Index. We aspire to attain the Data Seal of Approval for DataShare, a badge that confers trustworthiness through peer review. It is listed in re3data.org and databib registries of data repositories. We offer via extension, peer review of datasets to our depositors by listing journals that publish ‘data papers’ such as F1000 Research. Locally, as Information Services builds new data services such as the Data Store, [private data] Vault and the [metadata-only] Register, we can focus DataShare on its named purpose.
B2SHARE - How to share and store research data using EUDAT’s B2SHARE | www.eu...EUDAT
| www.eudat.eu | B2SHARE is a user-friendly, reliable and trustworthy way for researchers, scientific communities and scientists to store and share small-scale research data from diverse contexts.
Overview of the world of geospatial metadata, and the role of the EDINA service GoGeo in creating, saving, and discovering it. Presented on 19 June 2014 by Tony Mathys in Aberdeen, Scotland.
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT
| www.eudat.eu | 2nd Session: July 14, 2016.
In this webinar, Sarah Jones (DCC) and Marjan Grootveld (DANS) talked through the aspects that Horizon 2020 requires from a DMP. They discussed examples from real DMPs and also touched upon the Software Management Plan, which for some projects can be a sensible addition
Research engagement in EUDAT| www.eudat.eu | EUDAT
| www.eudat.eu | EUDAT’s vision is to enable European researchers and practitioners from any research discipline to preserve, find, access, and process data in a trusted environment, as part of a Collaborative Data Infrastructure (CDI) conceived as a network of collaborating, cooperating centres, that combine community-specific data repositories with the permanence and persistence of some of Europe’s largest scientific data centres. EUDAT services are community driven solutions. This presentation describes the different ways EUDAT engages with the research communities
SDA (Survey Documentation and Analysis) is software that allows users to access and analyze numeric microdata from repositories without needing specialized statistical software. It generates descriptive and inferential statistics, and basic visualizations. SDA benefits researchers by providing statistical analysis capabilities and easy access to metadata. It benefits repositories by facilitating secondary use of data while protecting sensitive information. SDA shows the value of numeric data for teaching and research.
This document summarizes the COBWEB project, AIP-6, and how federated access management could help meet their goals. COBWEB aims to crowdsource environmental data while ensuring data quality and privacy. AIP-6 will set up a federation of organizations to enable single sign-on for the GEOSS system. The document discusses how federated access control could authenticate users while protecting sensitive data sources. COBWEB and AIP-6 plan to demonstrate how federations can help with these tasks and inform future work on authorization and commerce.
B2SHARE: Record lifecycle and HTTP API| www.eudat.eu | EUDAT
| www.eudat.eu | B2SHARE is a scientific data repository providing persistent storage and sharing data facilities. Building on the new Invenio 3.0 digital assets management platform, a new version of B2SHARE has been developed which is focused on an improved user experience. Answering the requests of the current user base, B2SHARE version 2 provides customizable metadata schemas and a simple but effective workflow for depositing user data, exposed in its RESTful HTTP API.
The presentation will introduce the B2SHARE service, its organizing principles and its basic operations. The metadata schemas and the dataset lifecycle, which are essentials in understanding the possibilities of the service, will be the main focus of the talk. The concrete output of the session can be a full paper expanding the presented topics.
Target Audience:Researchers of any scientific domain, which work with publishable data sets.
Presented by Tony Mathys at a Current Issues and Applications of the Geospatial Technologies Lecture, Department of Geography and Environment, Aberdeen University, 24 February 2012
How EUDAT services support FAIR data - IDCC 2017| www.eudat.eu | EUDAT
| www.eudat.eu | Welcome Overview of the EUDAT service suite and the FAIR principles.
Sarah Jones, Marjan Grootveld, Yann Le Franc - IDCC Conference, February 20, 2017
EUDAT Research Data Services for all | www.eudat.eu | EUDAT
| www.eudat.eu | EUDAT offers common data services, supporting multiple research communities as well as individuals, through a geographically distributed, resilient network of 35 European organisations.
January 2017 - A new presentation is available https://www.slideshare.net/EUDAT/eudat-b2service-suite-november-2017-wwweudateu
EUDAT B2Service Suite| - A new version is available at http://ow.ly/fsCi30grKHVEUDAT
EUDAT offers a complete set of research data services and storage resources distributed across 15 European countries. These services include B2DROP for personal cloud storage and synchronization, B2SHARE for storing and sharing small research datasets, B2SAFE for replicating data across multiple sites, B2STAGE for transferring data to high-performance computing resources, B2FIND as a metadata catalogue to find data, B2HANDLE for persistent identifiers, and B2ACCESS for authentication and authorization. The services are designed based on user community requirements to enable European researchers to access, preserve, and process research data in a trusted environment.
EUDAT B2Service Suite - November 2017 | www.eudat.eu |EUDAT
EUDAT offers a suite of shared research data services across 15 European countries funded by the European Union's Horizon 2020 programme. The services include personal cloud storage (B2DROP), a repository for publishing small research datasets (B2SHARE), large-scale data storage and preservation (B2SAFE), high-speed data transfer (B2STAGE), a metadata catalogue (B2FIND), and persistent identifiers (B2HANDLE). EUDAT was established as a pan-European initiative to build a sustainable cross-disciplinary data infrastructure through close collaboration with research communities.
Research Data Services: The EUDAT B2SERVICE SUITE | www.eudat.eu | EUDAT
| www.eudat.eu | EUDAT offers common data services, supporting multiple research communities as well as individuals, through a geographically distributed, resilient network connecting general purpose data centres and community-specific data repositories.
EUDAT is a cross-disciplinary data infrastructure project in Horizon 2020 that aims to provide a collaborative framework for managing the exponential growth of research data. It brings together several European research communities and over 25 user communities to develop shared services and solutions for storing, finding, accessing, and analyzing large amounts of complex research data. Some key services EUDAT provides include a metadata catalogue, persistent identifiers, data staging between storage and high performance computing, a simple store for uploading and sharing data, and safe replication of data across multiple sites. The goals of EUDAT in Horizon 2020 are to consolidate and improve its core services, ensure financial sustainability, enhance interoperability with other e-infrastructures, and help bridge national and European data solutions.
AIS data management and time series analytics on TileDB Cloud (Webinar, Feb 3...Stavros Papadopoulos
Slides used in the webinar TileDB hosted with participation from Spire Maritime, describing the use and accessibility of massive time series maritime data on TileDB Cloud.
B2SHARE is a service that allows researchers to store, publish, and share research data in a findable, accessible, interoperable, and reusable (FAIR) manner. It supports metadata descriptions, assigns persistent identifiers like DOIs and Handles, allows versioning and community domains with custom metadata and access rules. Data is stored securely in data centers in Finland and replicas may be made at other trusted centers to ensure long-term preservation.
B2STAGE- how to shift large amounts of data| www.eudat.eu | EUDAT
| www.eudat.eu | B2STAGE is a reliable, efficient, light-weight and easy-to-use service to transfer research data sets between EUDAT storage resources and high-performance computing (HPC) workspaces.
The document outlines the vision, mission, and strategy of the STFC (Science and Technology Facilities Council) in implementing e-Science technologies. The goals are to exploit data from STFC facilities through innovative infrastructure, integrate activities nationally and internationally, and improve computation and data management capabilities to enable new scientific discoveries.
Watch full webinar here: https://bit.ly/2Y0vudM
What is Data Virtualization and why do I care? In this webinar we intend to help you understand not only what Data Virtualization is but why it's a critical component of any organization's data fabric and how it fits. How data virtualization liberates and empowers your business users via data discovery, data wrangling to generation of reusable reporting objects and data services. Digital transformation demands that we empower all consumers of data within the organization, it also demands agility too. Data Virtualization gives you meaningful access to information that can be shared by a myriad of consumers.
Register to attend this session to learn:
- What is Data Virtualization?
- Why do I need Data Virtualization in my organization?
- How do I implement Data Virtualization in my enterprise?
Introduction to Modern Data Virtualization (US)Denodo
Watch full webinar here: https://bit.ly/3uyvxN5
“Through 2022, 60% of all organizations will implement data virtualization as one key delivery style in their data integration architecture," according to Gartner. What is data virtualization and why is its adoption growing so quickly? Modern data virtualization accelerates that time to insights and data services without copying or moving data.
Watch this webinar to learn:
- Why organizations across the world are adopting data virtualization
- What is modern data virtualization
- How data virtualization works and how it compares to alternative approaches to data integration and management
- How modern data virtualization can significantly increase agility while reducing costs
- How to easily get started with Denodo Standard 8.0
Data management plans – EUDAT Best practices and case study | www.eudat.euEUDAT
| www.eudat.eu | Presentation given by Stéphane Coutin during the PRACE 2017 Spring School joint training event with the EU H2020 VI-SEEM project (https://vi-seem.eu/) organised by CaSToRC at The Cyprus Institute. Science and more specifically projects using HPC is facing a digital data explosion. Instruments and simulations are producing more and more volume; data can be shared, mined, cited, preserved… They are a great asset, but they are facing risks: we can miss storage, we can lose them, they can be misused,… To start this session, we will review why it is important to manage research data and how to do this by maintaining a Data Management Plan. This will be based on the best practices from EUDAT H2020 project and European Commission recommendation. During the second part we will interactively draft a DMP for a given use case.
How to store, organize and use petabytes of heterogenous dataOVHcloud
In this session, you will learn how you can set up a scalable infrastructure for your business and make data covering thematic streams like marine, land, climate, emergency… valuable and useful for your business.
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Ola Spjuth
This document discusses analyzing big data in medicine using virtual research environments and microservices. It notes the vast amount of data being generated and challenges of data management, analysis and scaling. The European Open Science Cloud aims to enable access to shared scientific data across borders. Contemporary analysis uses high-performance computing but has limitations. Cloud computing, virtual machines, containers and microservices can help address these challenges by providing on-demand resources and decomposing functionality into independent services. The PhenoMeNal project is building a standardized e-infrastructure using these approaches to enable users to access tools and data. This improves sustainability, reliability, scalability and enables agile development and science.
The document discusses trends in data growth and computing. It notes that the amount of data being stored doubles every 18-24 months and provides examples of large data holdings from companies like AT&T, Google, and Walmart. It then summarizes key points about data growth from enterprises and digital lives. The rest of the document focuses on strategies and technologies for managing large and growing volumes of data, including parallel processing databases, new database architectures, and the QueryObject system.
Similar to EUDAT Service Suite Overview - EUDAT Summer School (Shaun de Witt, CCFE) (20)
With a network of more than 20 European research
organisations, data and computing centres in 14 countries,
the EUDAT Collaborative Data Infrastructure (CDI) is one of
the largest infrastructures of integrated data services and
resources supporting research in Europe.
The EUDAT Collaborative Data Infrastructure (CDI) provides a suite of integrated data services and resources to support research across Europe. It is sustained by over 20 European research organizations and data centers. The CDI includes services like B2SHARE for publishing and sharing research data, B2FIND for discovering data, B2DROP for syncing and exchanging data, B2SAFE for replicating data safely across administrative domains, and more. These services help researchers and communities manage the entire research data lifecycle in a FAIR and trusted manner.
The EUDAT Collaborative Data Infrastructure (CDI) is a growing European research data infrastructure that supports the full lifecycle of research data across borders and disciplines. It provides integrated data services through a collaboration between service providers and research communities. The CDI assists researchers with training and consultancy on research data management and helps build the backbone of the European Open Science Cloud.
B2HANDLE is a distributed service that maintains globally unique persistent identifiers (PIDs) to reliably identify and cite research data objects throughout their lifecycle. It is based on the Handle System and provides a Python library for minting, storing, managing, and resolving PIDs. In the EUDAT ecosystem, services like B2SAFE and B2SHARE use B2HANDLE to create and manage PIDs for hosted data objects, while B2FIND and B2STAGE use PIDs to retrieve and refer to objects.
B2SAFE is a robust service that allows repositories to implement data management policies across administrative domains in a trustworthy manner. It offers an abstraction layer for large-scale heterogeneous storage, protects against data loss, allows optimized access, and enables compute-intensive analysis. B2SAFE is designed to execute auditable policy rules and use persistent identifiers to increase trust in data reuse by ensuring ownership rights and replicating data across sites for safekeeping.
B2FIND is an interdisciplinary discovery portal that allows users to search for and find research data collections from various sources using faceted search options. It harvests and indexes metadata from EUDAT data centers, community repositories, and other sources using various protocols, and provides a unified search interface across data from different scientific domains and metadata standards. Search results can be filtered by location, time period, and other textual facets. Data providers can make their research data collections discoverable on B2FIND by defining mappings between their metadata and the B2FIND schema.
B2ACCESS is a federated authentication and authorization proxy that allows users to log in using credentials from their home institutions or social media accounts to access services in a secure way. It supports standards like SAML, OAuth, and OIDC for service integration and allows single sign-on access for users. B2ACCESS works by authenticating users through external identity providers like university login systems and then authorizes them for access to EUDAT services based on their group memberships.
This document discusses writing effective service documentation for the EUDAT B2 suite of services. It provides an overview of the EUDAT B2 services and research data lifecycle context. It describes the approach to documentation writing, including identifying the end users and leveraging documentation from core technologies. The format and locations of documentation for services like B2ACCESS, B2DROP, B2FIND, B2HANDLE, B2SAFE, and B2SHARE are outlined, including code repositories, descriptions, usage guides, and training materials.
This document summarizes a pilot project using B2NOTE to semantically annotate research data stored in EUDAT B2SHARE. The project was conducted by researchers at the University of Porto. B2NOTE allows researchers to directly annotate datasets in B2SHARE using lightweight ontologies. The annotations are stored along with the datasets. This helps promote semantic annotation of research data by those who create it.
The document discusses plans for a task force on research data management (RDM) as part of the OpenAIRE Advance project. The RDM task force will work to establish working groups to examine different elements of the digital research lifecycle. It will gather examples of good RDM practices and standards. The task force will also scope current and future needs of National Open Access Desks to support RDM. The goal is to increase knowledge of RDM among project partners and support open data and FAIR data activities. A separate section discusses plans for open science training through a helpdesk. The helpdesk will provide practical guidance on open science principles and develop training materials and case studies in collaboration with other projects and infrastructures.
European Open Science Cloud - Skills workshopEUDAT
This document outlines the agenda for a workshop on skills for the European Open Science Cloud (EOSC). The workshop will discuss the skills needed to use EOSC services, existing skills frameworks and training provisions from related projects like EOSCpilot, skills mapping between competencies and cloud capabilities, and applying FAIR principles to training resources. Participants will provide input through live polling and breakout groups. The goal is to validate existing work on skills and identify remaining questions, and discuss how EOSC can help close skills gaps for research institutions, libraries and infrastructures.
Linking service capabilities to data stweardship competences for professional...EUDAT
This document discusses the development of a skills framework for the European Open Science Cloud (EOSC) project. It outlines the goals of defining a data stewardship competency framework, analyzing current skills gaps, and making recommendations for training services in EOSC. The document discusses how stewardship ensures data and research objects are managed according to FAIR principles. It also summarizes the analysis of the current complex landscape of skills training providers and the skills gaps identified by EOSC demonstrators and other stakeholders. Next steps include additional workshops to gather feedback on missing competencies and how EOSC can enhance training.
This document discusses applying FAIR data principles to training materials and events to make them more findable, accessible, interoperable, and reusable. It suggests repositories use DOIs and versioning to make materials findable, use standard metadata and formats to make them accessible and interoperable, and provide rich descriptions and licenses to make them reusable. It also poses questions about how EOSC can support training providers and ensure a comprehensive catalogue of high quality training resources.
Training by EOSC-hub - Integrating and Managing services for the European Ope...EUDAT
EOSC-hub receives funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 777536. EOSC-hub will create a federated integration and management system for the European Open Science Cloud to provide services for data, applications and tools, baseline services, a marketplace, AAI, accounting, monitoring and more according to principles of engagement, security regulations, standards, and terms of use. EOSC-hub involves 20 European infrastructures and over 100 partners working to implement the Federated Service Management training and provide domain-specific and generic training services.
The document proposes a draft governance framework for the European Open Science Cloud (EOSC) consisting of three layers:
1) A Steering layer representing users and providers to determine requirements, policies, and evaluate EOSC effectiveness.
2) A Strategic layer representing funders and decision makers to set the strategic vision, objectives, and performance metrics.
3) An Executive layer ensuring EOSC meets objectives by channeling funding as needed.
The framework is intended to be flexible, leveraging existing structures, and applying the European Interoperability Framework to coordinate different stakeholders across borders. Feedback is sought on the proposed model and decision flow between layers.
The document discusses the AARC project, which aims to improve federated identity management (FIM) for researchers across Europe. The project works to address eScience requirements and offer support for global policies around FIM. It has developed a blueprint architecture and set of building blocks to enable authentication and authorization across research collaborations and infrastructures. The AARC project also focuses on engagement with research communities and infrastructure providers to promote adoption of harmonized solutions. It has established several working groups and frameworks to facilitate collaboration and address security, assurance, and policy aspects of integrated identity and access management.
This document summarizes the work of the ENVRIPLUS project, which developed reusable solutions to common challenges faced by environmental research infrastructures (RIs). It produced the ENVRI Reference Model (RM) and OIL-e to support interoperability. It established a knowledge base and service portfolio of reusable solutions. Current challenges include operational issues like authentication, deployment and maintenance, as well as enabling interdisciplinary science through effective virtual research environments, optimization of data processing, and ensuring long-term sustainability.
The document provides information on services developed by the Data for Science theme of the H2020 project related to environmental research infrastructures (ENVRIs).
It includes six sections that group services related to reference models, reusable solutions from use cases/RIs, software quality checking, and the four service pillars defined in the theme: common vocabulary, semantic linking, RI development, and provenance. Each service is described in terms of its relation to the research data lifecycle, technical specifications, how to use it, and any success stories.
The services aim to support all phases of the research data lifecycle for ENVRIs, from data acquisition to use, through areas like reference models, semantic linking of data, catalog
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Aggregage
This webinar will explore cutting-edge, less familiar but powerful experimentation methodologies which address well-known limitations of standard A/B Testing. Designed for data and product leaders, this session aims to inspire the embrace of innovative approaches and provide insights into the frontiers of experimentation!
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...sameer shah
"Join us for STATATHON, a dynamic 2-day event dedicated to exploring statistical knowledge and its real-world applications. From theory to practice, participants engage in intensive learning sessions, workshops, and challenges, fostering a deeper understanding of statistical methodologies and their significance in various fields."
Codeless Generative AI Pipelines
(GenAI with Milvus)
https://ml.dssconf.pl/user.html#!/lecture/DSSML24-041a/rate
Discover the potential of real-time streaming in the context of GenAI as we delve into the intricacies of Apache NiFi and its capabilities. Learn how this tool can significantly simplify the data engineering workflow for GenAI applications, allowing you to focus on the creative aspects rather than the technical complexities. I will guide you through practical examples and use cases, showing the impact of automation on prompt building. From data ingestion to transformation and delivery, witness how Apache NiFi streamlines the entire pipeline, ensuring a smooth and hassle-free experience.
Timothy Spann
https://www.youtube.com/@FLaNK-Stack
https://medium.com/@tspann
https://www.datainmotion.dev/
milvus, unstructured data, vector database, zilliz, cloud, vectors, python, deep learning, generative ai, genai, nifi, kafka, flink, streaming, iot, edge
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
The Ipsos - AI - Monitor 2024 Report.pdfSocial Samosa
According to Ipsos AI Monitor's 2024 report, 65% Indians said that products and services using AI have profoundly changed their daily life in the past 3-5 years.
EUDAT Service Suite Overview - EUDAT Summer School (Shaun de Witt, CCFE)
1. www.eudat.eu
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065
The EUDAT Service Suite
Shaun de Witt
2. Learning Objective
Why Data Management is Important
Get an overview of the EUDAT services and how they link together
4. The Importance of Data Management
Research Infrastructure trends:
Internationalisation
Diversification
middle age 19th century 20th century 21st century
Large Scale Projects:
SKA (300PB/yr)
LHC (600PB/yr – run 4)
Human Brain Project (21PB)
IPCC/CMIP (10PB ->150PB)
ITER (600PB/yr)
XFEL (10PB/yr)
5. Data Scientist Core Skills (Research Oriented)
From EDISON Data Science Framework
7. Data Management (9000-1,000BC)
Good
Excellent long term preservation with
the right materials
Cheapish materials
Difficult to corrupt/overwrite
Bad
No key to interpretation
VERY slow write rates
Difficult to re-use in a global
environment
Main Uses
Laws, accounting and taxes
8. Data Management (2500BC-present)
Good
Easy to manufacture materials
Much finer details than previous effort
(better bit density)
OK long term preservation
Improved data movement
Bad
Needs right conditions
Easily lost/fragile
Cataloguing and indexing was initially a
problem
Main Uses
Laws, accounting and taxes
Sketching, writing, love letters, paper
airplanes…
9. Data Management (1AD-present)
Good
Easy to manufacture materials
Simpler to organise
OK long term preservation
Highly portable
Bad
Needs right conditions
Easily lost/fragile
Optical Character Recognition not
Invented
Loss of knowledge over time
Main Uses
Meeting minutes, meeting actions,
doodling, Remembering the date,
10. Data Management (1890s – 1990s)
Good
Excellent longevity
Very high data density (35mm frame is
equivalent to about 40 Mpixels)
Highly portable
Reproducible
Bad
Sometimes takes several attempts to
produce good data
Fragile
Subject to noise
Not very metadata rich (difficult to
index)
Main Uses
Pretty astronomical pictures, Family
albums, Selfies…
12. Data Management (1950s – 1990s)
Good
Really programmable
Proper digital.
Reproducible and Transportable
Bad
Tapes and disk were
Low data density
Not designed to any standard
Main Uses
Maths, Bad 1970’s movie backdrops,
Calculating taxes140MB
170MB
1.25MB/s
~ 1MW
~1MB memory
~16 MIPS
14. Data Management (1960s - present)
Good
Powerful
Scalable
Bad
MS-DOS, Windows 3.1, OS-2
Main Uses
Chuckie Egg, Solitaire, Manic Miner
~ 15MW
~1.3PB memory
~93TFLOPS
~900TB
>500PB
~50MB/s
15. Data Management – A Personal Perspective
Data Management is NOT just about data preservation
What data do I need to preserve
What data do I want to make visible
What legal frameworks do I need to adhere to
Who can access my data
Where do I need to move my data to
And when do I need to move it
17. EUDAT Service Suite
During this course you will learn:
How services link to data lifecycle
How services support the FAIR principle
How to use the Web Interface
Where available
How to use the APIs available to access services programmatically.
20. EUDAT Data Domain modeled on the ANDS1 Data Curation Continuum
1. Australian National Data Service organization – www.ands.org.au
Data Domains
21. Help desk
Monitoring
Collaboration Tools (restricted access)
Service Catalogue and registry
Input to requirements
Data project
co-ordination
More than Just Services
22. Secure Access to Services
b2access.eudat.eu
www.eudat.eu
B2ACCESS
B2ACCESS is an easy-to-use and secure authentication and
authorization platform which can be integrated with any
service and supports different methods of authentication.
23. An easy-to-use and secure
authentication and authorization
platform integrated with any
services
The user may log in by using
different methods of
authentication:
Home organisation identity
provider
Social ID
EUDAT ID
Allows group-, community- and
service managers to specify
authorisation decisions
Features:
Easy integration in any service
Reliable and light-weight
Powerful management interface
24. b2drop.eudat.eu
www.eudat.eu
Sync and Share Research Data
B2DROP
EUDAT’s Personal Cloud Storage Service
B2DROP is a secure and trusted data exchange service for
researchers and scientists to keep their research data synchronized
and up-to-date and to exchange with others.
25. Store and exchange data with
colleagues and team members,
including research data not
finalized for publishing
share data with fine-grained
access controls
synchronize multiple versions of
data across different devices
An ideal solution for researchers and scientists to:
Features:
20 GB storage per user
Living objects, so no PIDs
Versioning and offline use
Desktop synchronisation
26. Store and Publish Research Data
b2share.eudat.eu
www.eudat.eu
B2SHARE
B2SHARE is a user-friendly, reliable and trustworthy way for
researchers, scientific communities and scientists to store and share
small-scale research data from diverse contexts.
27. store data safely at a trusted
and certified data centre
preserve data to guarantee
long-term persistence
control access and share data
with colleagues and the world
A winning solution for researchers, scientists and communities
to:
Features:
Metadata management
Permanent PIDs
Open Access support
28. Replicate Research Data Safely
eudat.eu/b2safe
www.eudat.eu
B2SAFE
B2SAFE is a robust, safe and highly available service which allows
community and departmental repositories to implement data
management policies on research data across multiple administrative
domains in a trustworthy manner.
29. replicate research data into secure
data stores
archive and preserve research data
in the long-term
bring data close to powerful
compute resources
co-locate data with different
communities
benefit from economies of scale
The ideal solution for communities with no facility for archival
to:
Features:
Large-scale storage
Robust and highly available
Permanent PIDs
30. Get Data to Computation
eudat.eu/b2stage
www.eudat.eu
B2STAGE
B2STAGE is a reliable, efficient, light-weight and easy-to-use service
to transfer research data sets between EUDAT storage resources and
high-performance computing (HPC) workspaces
31. move large amounts of data
between data stores and high-
performance compute resources
re-ingest computational results
back into EUDAT
deposit large data sets onto EUDAT
resources for long-term preservation
Facilitating communities to:
Features:
High-speed transfer
Reliable and light-weight
Manages permanent PIDs
33. seek data objects and collections
using powerful metadata searches
catalogue community data by
means of selected metadata
browse through multi-disciplinary
data collections filtered by content,
provenance and temporal keywords
A metadata catalogue service to:
Features:
Simple to use
Standards-based
Comprehensive catalogue
34. Data Discovery and Identification
b2handle.eudat.eu
www.eudat.eu
B2HANDLE
B2HANDLE provides an abstraction layer between a globally
unique persistent identifier and a physical location of a data
object allowing researchers to reliably cite and refer in the
long term.
35. Provides abstraction layer between
a globally unique persistent
identifier and physical location
of data objects
Follows policies to register data
and make it long term refer- and
citable
Features:
Reliability through mutual PID mirroring
Machine readable via HTTP RESTful API
Simple integration with any service
Technology agnostic
Sensors increasing resolution (synchrotron and muon sources), genetic sequencing becoming quicker and sequencers becoming cheaper – superexponetial growth)
EUDAT cooperates with a wide variety of research communities, such as the medical and biomedical sciences, environmental sciences, materials and analytical facilities, social sciences and humanities and physical sciences and engineering.
EUDAT has concrete agreements with 7 core communities, an integral part of the initiative, namely:
CLARIN: Common Language Resources and Technology Infrastructure
ELIXIR: A distributed infrastructure for life-science information
ENES: European Network for Earth System Modelling
EPOS: European Plate Observing System
ICOS: Integrated Carbon Observation System
LTER Europe: European Long Term Ecological Research Network
VPH: Virtual Physiological Human
EUDAT works on a Collaborative Data Infrastructure conceived as a network of collaborating, cooperating centers, combining the richness of numerous community specific data repositories with the permanence and persistence of some of Europe’s largest scientific data centers.
B2ACCESS is an easy-to-use and secure authentication and authorization platform which can be integrated with any service and supports different methods of authentication.
B2ACCESS provides an easy-to-use and secure authentication and authorization platform integrated in all other services. It provides different methods of authentication through the home organisation identity provider, but also allows social IDs like Google and Facebook as well as the EUDAT ID. Managers can specify authorisation decisions in the dedicated interface.
For more information see: b2access.eudat.eu
The EUDAT Services Suite consists of seven services. Each service is presented in the following slides, starting with B2DROP.
The B2DROP service can be characterized as a personal cloud storage service. It is a secure and trusted data exchange service.
The B2DROP service is a cloud solution to store and share data in the early state of the research data life cycle. It is aimed at individual researchers and enables the storage and exchange of data with colleagues and team members. Data can be shared with fine-grained access controls. B2DROP synchronizes multiple versions of data across different devices and platforms. B2DROP users are offered up to 20 GB of storage space for their data.
The B2DROP service can be found at b2drop.eudat.eu.
The next service of the EUDAT Services Suite is the B2SHARE service to store and share small-scale research data form diverse contexts.
The B2SHARE service is aimed at individual researchers. It has been integrated in a number of research infrastructures and EUDAT defines custom made community based metadata schema templates to facilitate users.
B2SHARE facilitates data storage in a trusted and certified repository that guarantees long-term persistence of the data. Data objects get a persistent identifier. Depositors can document their data objects and give the data a usage license, preferably an open access license.
The B2SHARE service can be found at b2share.eudat.eu.
The third service of the EUDAT Services Suite is the B2SAFE service.
This service allows community and department repositories to implement data management policies on research data across multiple administrative domains.
The B2SAFE service is aimed at research communities that have no facilities for archival data storage. The service supports a number of procedures such as data replication and the co-location of data with different communities. B2SAFE facilitates high-scale petabytes storage.
More information on the B2SAFE service can be found at eudat.eu/b2safe.
The B2STAGE service enables the movement of large amounts of data between data stores and high-performance computing resources.
The B2STAGE service is aimed at research communities and infrastructures to move large amounts of data between data stores and high-performance computing resources, to re-ingest computational results back into EUDAT and to deposit large data sets onto EUDAT resources for long-term preservation.
More information on the B2STAGE service can be found at: eudat.eu/b2stage.
The last service of the EUDAT Service Suite is the B2FIND service. It can be characterized as a simple, user-friendly metadata catalogue of research data collections stored in EUDAT data centers and other repositories.
The B2FIND service enables the searching and browsing for data objects and collections and supports a number of metadata formats. B2FIND facilitates browsing through multi-disciplinary data collections.
More information on the B2FIND service can be found at: https://b2find.eudat.eu.
B2HANDLE provides an abstraction layer between a globally unique persistent identifier and a physical location of a data object allowing researchers to reliably cite and refer in the long term.
B2HANDLE provides an abstraction layer between a globally unique persistent identifier and a physical location of data objects. It follows policies to register data and make it long term referable and citable.
The service provides high reliability and availability and can be easily integrated using a HTTP RESTful API in any other service or application. The service is therefore technology-agnostic.
For more information see: eudat.eu/b2handle
Most of the EUDAT services have their dedicated service domain and documentation. For all services documentation, tutorials and training are in the making or already exist.
The service suite will be enhanced and expanded. So keep in touch. If you have any questions or remarks concerning the services or the EUDAT initiative, please use the contact form available at the EUDAT website: eudat.eu.