EUDAT B2SHARE: How to store and publish research data | www.eudat.eu
Jun. 28, 2017•0 likes•1,277 views
Download to read offline
Report
Data & Analytics
B2SHARE is a user-friendly, reliable and trustworthy way for researchers, scientific communities and scientists to store and share small-scale research data from diverse contexts.
EUDAT B2SHARE: How to store and publish research data | www.eudat.eu
1. Store and Publish Research Data
b2share.eudat.eu
www.eudat.euEUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065
B2SHARE
How to store and publish research data using EUDAT’s
repository service
This work is licensed under the Creative
Commons CC-BY 4.0 licence
Version 9
June 2017
2. b2share.eudat.eu
B2SHARE is...
a user-friendly, reliable and trustworthy way for
researchers, scientific communities and citizen
scientists to store and publish research data coming
from diverse contexts
B2SHARE Training 2
3. b2share.eudat.eu
The Who, What, Why of B2SHARE
B2SHARE Training
Who uses B2SHARE?
Researchers, students and
even citizen scientists are
creating “long tail” data
which is not stored safely or
easily publishable.
Why use it?
To have that peace of mind
that your data is stored safe
and sound while reaping the
rewards of sharing data with
others.
What is B2SHARE?
B2SHARE is a user-
friendly data repository
A place where data sets
can be safely stored and
published.
4
Who?
What?
Why?
4. b2share.eudat.eu
Who can use B2SHARE?
B2SHARE Training
Anyone!
From individual researchers, PhD and Postdoc students, to
project coordinators and collaborators, and even schools
and citizen scientists
Use the service as an individual user or as part of a specific
research community
B2SHARE bridges the gap between citizen scientists and
researchers to enable and stimulate research data
collaboration and sharing
A winning solution to …
Store: facilitates research data storing
Preserve: guarantees long-term persistence of data
Publish: allows publication of data, results or ideas
worldwide
5
5. b2share.eudat.eu
B2SHARE added-value
Your data is …
Hosted so there are no hardware or network worries on
the depositor side
Assigned a persistent identifier and therefore is always
retraceable to you
Stored alongside queryable & findable metadata and
automatically available via the B2FIND metadata
catalogue
Managed and stored by a data centre
B2SHARE Training 6
CC0 – pixabay.com
6. b2share.eudat.eu
Registration
B2SHARE Training
1. Visit our homepage
https://b2share.eudat.eu
2. Click on “Register”:
You will enter the B2ACCESS
domain!
3. Click “Register a new
account”:
Select an account type and
enter your details
7
8. b2share.eudat.eu
How do I deposit data? – step 1
Go to the homepage
and click on “Create a
new record”
B2SHARE Training 9
Next:
Enter a title
Select an appropriate
domain or project by
clicking on one of the
domain or project boxes
Click “Create Draft
Record”
Datasets will be annotated with the selected domain’s metadata schema where
applicable
9. b2share.eudat.eu
How do I deposit data? – step 2
B2SHARE Training
Select and upload one or several data resources
Drag and drop to
"Drag and drop
files here”
OR….. Hit “Select files” and select
from your pop-up window:
Files will automatically upload!
You can cancel uploads by clicking on the
cross:
OR….. Add files
directly from
B2DROP
10
10. b2share.eudat.eu
How do I deposit data? – step 3
B2SHARE Training
Add basic metadata fields.
The more information you add the
easier your data will be found
by others
Open Access:
Leave as “True” (default) and
everyone can see your data
Select “False” to make your data
accessible only by yourself and
the community administrator
License:
Add a license to your data to
encourage fair usage, use the
public license selector
In all cases the metadata will be
visible to everyone!
11
11. b2share.eudat.eu
How do I deposit data? -
Public license selector
B2SHARE Training
Choose a public license by
answering several
questions regarding
access to your dataset.
Suggestions depend on
several factors:
- Type of data
- Original licenses
- Data consumer access
and distribution rights
Or use the search
functionality.
12
12. b2share.eudat.eu
How do I deposit data? – step 4
B2SHARE Training
Add detailed metadata fields:
The more information you
add the easier your data
will be found by others
Detailed metadata will also be
visible to everyone!
13
Allows to uphold metadata
standards as required by the
community
Possibly some fields (starred) are
mandatory!
If the community has defined a community metadata schema:
13. b2share.eudat.eu
How do I deposit data? – step 5
Select “Submit draft for publication” to get your record
published immediately after save your draft
Hit the “Save draft” button to publish your data!
B2SHARE Training
Depending on the community settings, your record will be
immediately published, or it may need approval by a
community data manager
14
14. b2share.eudat.eu
How do I search for data?
B2SHARE Training
Basic search
Type in part of a title,
keyword, abstract or other
metadata and click on
“Search”
Advanced search
The Advanced search options are available after initial search
Specify community, sort order and page size
Again click on search to update the results
15
15. b2share.eudat.eu
How can I download data?
Once you have found the data you want, click on the dataset
title to show its details:
B2SHARE Training 16
17. b2share.eudat.eu
What data can you upload?
Research data
Primary data
Processed data
Data as basis for a publication
Empirical data
Theoretical data
In virtually any kind of
format….
Papers
Spreadsheets
Audio-visual media,
Data source or purpose of the
data has a scientific
background.
B2SHARE Training
Just a note….
Make sure that you are allowed to upload your data. Data protection laws exist to protect
sensitive data and restrictions apply also on where this data is stored. Always check if
your data has any such restrictions on where it can be stored.
Unrelated personal data should not be stored on B2SHARE
18
18. b2share.eudat.eu
What data can you upload?
Are there any limits on how much I can upload?
For the EUDAT-hosted B2SHARE service:
The number of files you can upload is unlimited
Maximum file size is currently 10 GB per file;
maximum record size is 20 GB
If you want to upload larger files contact the site’s
administrator for additional options
For the EUDAT B2SHARE instance contact
b2share@eudat.eu
B2SHARE Training 19
19. b2share.eudat.eu
Can I restrict access to my data?
You can choose if you want your data
to be open access or be restricted.
All metadata stored in B2SHARE,
except private information such as
email addresses and phone
numbers, are made publicly
available.
As the service is dedicated to
research data, the names and
affiliations of the data owner
and/or the data depositor are
publicly available.
B2SHARE Training 20
gement
CC0 – pixabay.com
data
mana
20. b2share.eudat.eu
Identifying data
The B2SHARE service provider will:
• Respect the user’s access restrictions
• Assign persistent identifiers to each object
using the B2HANDLE service
• Store the data according to the B2SHARE
service statements.
For the most common data types – such as
text, audio and video files – custom players
will be available so that authorized users
will be able to directly view the contents of
the data files.
B2SHARE Training 21
CC0 – pixabay.com
21. b2share.eudat.eu
What happens to my data once I have
deposited it?
EUDAT has no claim over the data
deposited in B2SHARE and
depositors remain entirely
responsible for the data they
deposit
Data is stored on state of the art
servers at the Finnish IT Center
for Science, Kajaani, Finland
EUDAT retains the right to archive,
i.e. create replicas at trusted
centres to take care of long-term
persistence
B2SHARE Training
For data stored in the EUDAT-hosted B2SHARE instance:
22
CSC
22. b2share.eudat.eu
B2SHARE REST API
Direct access to the B2SHARE service functionality using
the B2SHARE REST API
Integrate B2SHARE in your workflow and applications
Create, read, list, update and delete records
Annotate with metadata
Get community information and metadata schemas
Download files directly without using the browser
B2SHARE Training 23
23. b2share.eudat.eu
And how much does all this cost?
EUDAT’s B2SHARE instance services
Self-service registration
Free upload and registration of stable research
data
Data access policy defined by data owner
Metadata openly accessible and harvestable
Customized metadata handling and customized
user interfaces (e.g. for metadata acquisition)
Data integrity ensured by checksums which are
calculated during data ingest
Data is kept online
Storage usage based on fair share principle
Professionally managed service: data is stored
at Datacenter CSC Kajaani, certified by the
ISO/IEC 27001:2005 standard for its
information security management system
User support provided via EUDAT ticketing
system
Service availability and usage of storage is
monitored
Data and metadata remain accessible for two
years if instance is closed
B2SHARE Training 24
The EUDAT-hosted B2SHARE service is free of charge for European
scientists and researchers.
No comparable service currently exists to support EU research.
24. b2share.eudat.eu
What’s next for B2SHARE users?
We are always looking to improve B2SHARE and the following
additions will be made:
Provision of basic B2SHARE service by multiple service providers
Communities will be able to request a premium service providing
larger storage capacity
Service provisioning based on SLAs
Users will be able to choose their own trusted service provider
B2SAFE repositories will be connected to the safe replication
service
Sharing data with user groups
Introduction of social tagging
B2SHARE Training
CC0 – pixabay.com
25. b2share.eudat.eu
For more info: https://eudat.eu/services/b2share
B2SHARE User Documentation:
https://eudat.eu/services/userdoc/b2share
B2SHARE Training presentations:
https://www.eudat.eu/b2share-training-suite
B2SHARE hands-on training:
https://github.com/EUDAT-Training/B2SHARE-Training
26
26. b2share.eudat.eu
www.eudat.eu
This work is licensed under the Creative Commons CC-BY 4.0 licence
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures.
Contract No. 654065
Authors Contributors
Carl Johann Håkansson
Hans van Piggelen, SURFsara
Mark van de Sanden, SURFsara
Thank you!
Editor's Notes
This presentation discusses usage of the B2SHARE data store and publication service of EUDAT.
B2SHARE is a user-friendly, reliable and trustworthy way for researchers, scientific communities and citizen scientists to store and publish research data coming from diverse contexts.
B2SHARE is part of the CDI and directly connected to the B2DROP and B2SAFE services for data import and export
Several other services such as B2HANDLE and B2NOTE are connected to store persistent identifiers and add annotation like provenance data
B2ACCESS is used for authentication and in the future also authorization
B2FIND for metadata harvesting, so that your records and metadata can be found in more general publication search services
B2SHARE is a data repository service to safely store and publish research data sets for long term preservation.
B2SHARE is used by researchers, students and citizen scientists creating data sets which is not stored safely at their own premises or easily publishable. It is used to be assured of a safe place for storage and publication of your research data.
Many different people use B2SHARE, like researchers, PhD students or post-docs. Either as individuals or as members of a specific research community
B2SHARE bridges the gap between citizen scientists and researchers to enable and stimulate research data collaboration and shoring
B2SHARE facilitates storage of research data, guarantees long-term persistence of data and allows publication of data, results and ideas worldwide
The benefits of B2SHARE are as follows:
Data is hosted, so you don’t have to worry about hardware or network issues
All data objects are automatically assigned a persistent identifier, one for each version, so that these objects can be easily found and always be traced to the original depositor
Data is stored alongside queryable & findable metadata. All metadata is automatically harvested by the EUDAT B2FIND service and is therefore easily discoverable even outside EUDAT premises
Data is managed and stored by a EUDAT or external data centre to guarantee long-term preservation
Registration is easy and can be done by the user itself:
Click on Login or registration
B2ACCESS will be opened: click on ‘Register a new account’
You can either create a new account or connect through your institutional credentials if your institution supports it
Once your account is set up, click on your name and select ‘Profile’
The account page shows your roles, records, draft records and API tokens. You can create new tokens here for direct usage in your API-enabled applications.
How to deposit new data?
Make sure you are logged in!
On the homepage click on “Create a new record”
Please note that:
Your deposit will be known as a record once it is published
Unpublished records are known as draft records and can be edited
Files and metadata need to be added during the process
Add files to your new record by:
Select files by dragging them from your file browser to the drag ‘n’ drop box
Click on the same box and select your files
Add files directly from your B2DROP account by clicking on the “Add B2DROP files” box
Selected files are immediately uploaded to B2SHARE.
Continue by filling in the basic metadata fields like title, description and creators. Then choose whether your deposit is Open Access and select a license using the license selector tool. Optionally choose an embargo date, and other more detailed metadata
Click on the “Select License” button to open the public license selector
Public licenses can also be selected using the public license selector tool
With the tool you can select the appropriate license by answering a few questions which will finally suggest the right licenses covering your requirements
The suggestion depends a.o. on the type of data, original licenses of used data and data consumer access and distribution rights you want to allow
Click on “Show more details” to expand the form with more metadata field options
Things to note:
Remember: all metadata will be visible at all times even for restricted access datasets!
The more metadata you fill in the better your dataset will be found by others!
Community metadata schema:
If your chosen community has defined its own metadata schema these need to be filled in as well
Possibly fields are mandatory!
Select “Submit draft for publication” to immediately publish your draft record upon saving it. The button text will change to “Save and publish”
Click “Save draft” to save your filled form and allows changes to be made later on. Your record is not visible in the meantime.
Depending on the community settings, your record will be immediately published, or needs approval by a community data manager.
Search for data using the search functionality at the top of any page
For advanced options, click “Search” and use the extra options below the search box. You can select specific communities, sort order and the number of results per page. Click on a page button to see additional results for your search. You need to click on search again to update your results.
To download data of a dataset, make sure you are logged in.
Search for data and click on the title of one of the results, the landing page with the record’s details will be shown (next slide)
Every data set has its own record detail page showing name, data, abstract (description), keywords and PID
The included files are listed and can be downloaded. On the left the known PIDs are shown, while on the right side the metadata are visible.
You can upload all kinds of research data, there is no restriction on file format as long as the data source has a scientific background
Make sure you are allowed to upload the data and obey the data protection laws and restrictions of your country.
Unrelated personal data should not be stored on B2SHARE.
For the EUDAT-hosted B2SHARE Service there is no limit to the number of files to upload for a given record. The maximum file size is 10 GB per file while using the online deposit workflow. Every record is restricted to 20GB total.
To upload larger files contact the service administrator and ask what the possibilities are.
To restrict access to your data set make sure it is not Open Access during the deposit of your new record.
All metadata is publicly available on the B2SHARE instance, except for private information such as email addresses and phone numbers
Names and affiliations of the data owner or publisher will be visible publically
The B2SHARE service provider will:
Respect the user’s access restrictions
Assign persistent identifiers to each object using the B2HANDLE service
Store the data according to the B2SHARE service statements.
Custom players and viewers are available for specific file formats.
Using the EUDAT-hosted B2SHARE instance:
EUDAT has no claim of ownership over the data deposited in the service
Data is stored at CSC in Finland
EUDAT retains right to archive or create replicas
Access the B2SHARE service using the B2SHARE REST API:
Direct access
Integrate in your workflow or application
CRUDL operations
Metadata annotation
Get community information and metadata schemas
Download files
What are the costs? EUDAT B2SHARE is free of charge at the point of use!
It provides self-service registration through B2ACCESS, allows free upload and registration of stable research data. The data access policy is defined by data owner itself. By default, metadata is openly accessible and harvestable and the service allows customized metadata handling and customized user interfaces. Data integrity is ensured by checksums which are calculated during data ingest. Data is always kept online and storage usage is based on fair share principles. Data and metadata remain accessible for two years if instance is closed
EUDAT’s B2SHARE service is professionally managed by the Datacenter CSC Kajaani and certified by the ISO/IEC 27001:2005 standard for its information security management system. Service availability and usage of storage is monitored
User support provided via EUDAT ticketing system.
B2SHARE is continually improved and will be updated with new functionality:
Multiple service providers
Premium service for communities
Service provisioning based on SLAs
Choose your own trusted service provider
Direct connections to other EUDAT services, e.g. B2SAFE and B2DROP
Share data using groups
Social tagging
Have a look at our website for more information regarding B2SHARE
User documentation is also available here
B2SHARE hands-on training can be found on GitHub. Currently only API access using Python is covered. In the future more modules will be added.