SlideShare a Scribd company logo
CSC – Suomalainen tutkimuksen, koulutuksen, kulttuurin ja julkishallinnon ICT-osaamiskeskus
Research Data Management,
Challenges andTools
Per Öster, CSC – IT Center for Science Ltd
• Drivers
o Number of devices
o Number of communicating apps
o Number of users
2.6.20172
10% of UK
power
consumption
due to ICT
1/3
network
1/3
devices
1/3
datacentres
Amount of data
6/2/173
Analysis
Publication
ReviewConceptualisation
Data
gathering
Open
access
Scientific
blogs Collaborative
bibliographies
Alternative
Reputation
systems
Citizens
science
Open
code
Open
workflows
Open
annotation
Open
data
Pre-
print
Data-
intensive
2!
Sci-
starter.com
Runmycode.
org
ArXiv
Roar.eprints.
org
Impact Story
Altmetric.com
Mendeley.com
Academia.edu
Researchgate.com
Openannotation.org
Datadryad.org
Myexperiment.org
Figshare.com
An#emerging#
ecosystem#of#
services#and#
standards#
It's real!
The DCC Curation
Lifecycle Model
Description and
Representation Information
Preservation Planning
Community Watch and
Participation
Curate and Preserve
Conceptualise
Create or Receive
Appraise and Select
Ingest
Preservation Action
Store
Access, Use and Reuse
Transform
Assign administrative, descriptive, technical, structural and preservation metadata, using appropriate standards, to ensure adequate description and control over the long-term. Collect and assign representation information required to understand
and render both the digital material and the associated metadata.
Plan for preservation throughout the curation lifecycle of digital material. This would include plans for management and administration of all curation lifecycle actions.
Maintain a watch on appropriate community activities, and participate in the development of shared standards, tools and suitable software.
Be aware of, and undertake management and administrative actions planned to promote curation and preservation throughout the curation lifecycle.
Conceive and plan the creation of data, including capture method and storage options.
Create data including administrative, descriptive, structural and technical metadata. Preservation metadata may also be added at the time of creation.
Receive data, in accordance with documented collecting policies, from data creators, other archives, repositories or data centres, and if required assign appropriate metadata.
Evaluate data and select for long-term curation and preservation. Adhere to documented guidance, policies or legal requirements.
Transfer data to an archive, repository, data centre or other custodian. Adhere to documented guidance, policies or legal requirements.
Undertake actions to ensure long-term preservation and retention of the authoritative nature of data. Preservation actions should ensure that data remains authentic, reliable and usable while maintaining its integrity. Actions include data cleaning,
validation, assigning preservation metadata, assigning representation information and ensuring acceptable data structures or file formats.
Store the data in a secure manner adhering to relevant standards.
Ensure that data is accessible to both designated users and reusers, on a day-to-day basis. This may be in the form of publicly available published information. Robust access controls and authentication procedures may be applicable.
Create new data from the original, for example
- By migration into a different format.
- By creating a subset, by selection or query, to create newly derived results, perhaps for publication.
www.dcc.ac.uk
info@dcc.ac.uk
The Curation Lifecycle
The DCC Curation Lifecycle Model provides a graphical high level overview of the stages required for successful curation and preservation of data from initial conceptualisation or receipt. The model can be used to plan activities within an organisation or consortium to
ensure that all necessary stages are undertaken, each in the correct sequence. The model enables granular functionality to be mapped against it; to define roles and responsibilities, and build a framework of standards and technologies to implement. It can help with
the process of identifying additional steps which may be required, or actions which are not required by certain situations or disciplines, and ensuring that processes and policies are adequately documented.
Data, any information in binary digital form, is at the centre of the Curation Lifecycle. This includes:
- Simple Digital Objects are discrete digital items; such as textual files, images or sound files, along with their related identifiers and metadata.
- Complex Digital Objects are discrete digital objects, made by combining a number of other digital objects, such as websites.
Structured collections of records or data stored in a computer system.
Full Lifecycle Actions
Sequential Actions
Data (Digital Objects or Databases)
Occasional Actions
Dispose
Reappraise
Migrate
Dispose of data, which has not been selected for long-term curation and preservation in accordance with documented policies, guidance or legal requirements. Typically data may be transferred to another archive, repository, data centre or
other custodian. In some instances data is destroyed. The data’s nature may, for legal reasons, necessitate secure destruction.
Return data which fails validation procedures for further appraisal and reselection.
Migrate data to a different format. This may be done to accord with the storage environment or to ensure the data’s immunity from hardware or software obsolescence.
Digital Objects
Databases
2.6.20174
Secure Compute Clouds
Supporting sample
logistics
• Federated Authentication
• Authorization
• Dataset registry
• Data transfer hub
• Policy and Legal Framework
Services and
Coordination
High speed encrypted
data transfer
GridFTP/Globus/Aspera
Secure data access remote API
( GA4GH )
Sequencing centers
Data
Users
EGA
at
Data Archiving
Bringing users
to data
Data Generation
Managing Access
Data Owner
Data Access Agreement
Data Access Committee
Data Request
Authorization Management Tools
( EGA and CSC REMS )
5
From field measurements to open data
2.6.20176




































































































































Questions:



Sensitive data?

Requirements on Authentication and authorization?
Instrument
Measuring PCs:
Raw data
at the stations
File servers at
stations:
Raw data and
field diaries,
cal documents
File servers
in Helsinki:
Raw and
intermediate
data,
documents,
scripts
SMEAR database:
Processed data
in Helsinki
ICOS, EBAS,...
databases:
Near real time
and processed data
outside UH
Routine data processing =
(- unit conversion)
- calibration correction
- quality check, gapfilling
- averaging over space or time
SMEAR
data flow
A/D conversion
unit conversion
IDA (CSC data
service):
Raw data &
document archive,
database datasets
Field
documentation
Researchers,
Data processing
server
Feedback on
data quality
Metadata
Metadata
Metadata
2.6.20177
https://avaa.tdata.fi/web/smart/smear
Support in All Phases of Research Process
20168
Plan
Customer Portal
Experts
Guides
Websites
Training
Service Desk
Produce
& Collect
Data
International
resources
Modelling
Software
Supercomputers
Analyse
Cloud Services
Training
Data science
Computing
Software
Store
B2SAFE
B2SHARE
HPC Archive
IDA
Databases
Research long-
term preservation
(LTP)
Share &
Publish
AVAA
B2DROP
B2SHARE
Databank
Etsin
Funet FileSender
Termination. Company	may	terminate	your	access	to	all	or	any	part	of	the	Service	
at	any	time,	with	or	without	cause,	with	or	without	notice,	effective	immediately,	
which	may	result	in	the	forfeiture	and	destruction	of	all	information	associated	with	
your	account,	including	User	Submissions.	If	you	wish	to	terminate	your	account,	
you	may	do	so	by	following	instructions	available	on	the	Site.	Any	fees	paid	
hereunder	are	non-refundable.	All	provisions	of	the	Terms	of	Use	which	by	their	
nature	should	survive	termination	shall	survive	termination,	including,	without	
limitation,	ownership	provisions,	warranty	disclaimers,	indemnity	and	limitations	of	
liability.
ARTICLE 2: DISCLAIMER
1. The Service is provided "as is" and the Provider disclaims any and all
representations and warranties, whether express or implied, including;- but
not limited to;- implied warranties of title, merchantability, fitness for any
particular purpose or non-infringement. The Provider does not promise any
specific results, effects or outcome from the use of the Service.
2. …
3. The Provider reserves the right to change, reduce, interrupt or discontinue
the Service or parts of it at any time.
4. No one has a right to use the Service; the Provider reserves the right to
exclude certain Users.
Are the commercial services sufficient?
• Nice complement but can not serve as the fundamental infrastructure for research
data of national and international interest
• Need for publicly funded and operated infrastructure
2.6.201712
e-Science	Data	Factory
EUDAT CDI
“The	EUDAT	Collaborative	Data	
Infrastructure is	a	defined	data	model	
and	a	set	of	technical	standards	and	
policies adopted	by	European	research	
data	centres	and	community	data	
repositories	to	create	a	single	European	
e-infrastructure	of	interoperable	data	
services.”
“To	date,	over	20	major	European	research	organizations,	
data	and	computing	centres have	signed	an	agreement	to	
sustain	the	EUDAT	– pan	European	collaborative	data	
infrastructure	for	the	next	10	years	giving	the	birth	to	
the EUDAT	Collaborative	Data	Infrastructure”
www.eudat.eu
Findable
– assign persistent IDs, provide rich metadata, register in a
searchable resource...
Accessible
– Retrievable by their ID using a standard protocol, metadata
remain accessible even if data aren’t...
Interoperable
– Use formal, broadly applicable languages, use standard
vocabularies, qualified references...
Reusable
– Rich, accurate metadata, clear licences, provenance, use of
community standards...
www.force11.org/group/fairgroup/fairprinciples
EUDAT is FAIR
What is the EUDAT Service offer?
B2FIND: multi-disciplinary metadata catalogue
now: common metadata catalogue, harvesting across
all CDI data, single point for data discoverability;
in development: aim to improve with agreed basic
metadata for all data objects.
B2HANDLE: policy-based prefix & PID management
now: common PID mechanism across all CDI data;
in development: aim to improve with agreed common
schema and behaviour.
B2SHARE: research data repository
now: full, tailored metadata support for data deposits.
B2SAFE: policy-driven data management
in development: aim to introduce a common data
model promoting metadata extraction and processing.
EUDAT & Findable
F
B2STAGE: data staging service
B2SHARE: research data repository
B2SAFE: policy-driven data management
now: EUDAT presents data through common Internet protocols and
APIs, http and gridftp;
in development: aim to improve with a single http API for all services
and data.
EUDAT & Accessible
A
B2HANDLE: policy-based prefix & PID management
now: common PID mechanism across all CDI data;
in development: aim to improve with agreed common
schema and behaviour.
B2STAGE: data staging service
B2SHARE: research data repository
B2SAFE: policy-driven data management
in development: single http API for all services and data
(interoperability of data services, if not data!).
B2FIND: multi-disciplinary metadata catalogue
in development: agreed basic metadata for all data
objects (a degree of metadata interop).
EUDAT & Interoperable
I
B2SHARE: research data repository
B2SAFE: policy-driven data management
now: encourage use of CC BY v 3 as common open data licence;
encourage open formats where we have any influence.
EUDAT & Reusable
R
Acknowledgment
• Tommi Nyrönen, ELIXIR-Finland Head of Node
• Mikael Linden, CSC
• TimmoVesala, INAR RI, Helsinki University
• Damien Lecarpentier, EUDAT Project Director
2.6.201721
facebook.com/CSCfi
twitter.com/CSCfi
youtube.com/CSCfi
linkedin.com/company/csc---it-center-for-science
Kuvat CSC:n arkisto ja Thinkstock
CSC – IT Center for Science Ltd
Per Öster
Director, Research Infrastructures
per.oster@csc.fi

More Related Content

What's hot

20160523 23 Research Data Things
20160523 23 Research Data Things20160523 23 Research Data Things
20160523 23 Research Data Things
Katina Toufexis
 
RDM policy and recovering costs
RDM policy and recovering costsRDM policy and recovering costs
RDM policy and recovering costs
Sarah Jones
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP intro
Sarah Jones
 
Levine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal ConsiderationsLevine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal Considerations
National Information Standards Organization (NISO)
 
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
OAbooks
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
Sarah Jones
 
Open Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon HodsonOpen Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon Hodson
Academy of Science of South Africa (ASSAf)
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciences
Sarah Jones
 
Paul Jeffreys - Research Integrity: Institutional Responsibility
Paul Jeffreys - Research Integrity: Institutional ResponsibilityPaul Jeffreys - Research Integrity: Institutional Responsibility
Paul Jeffreys - Research Integrity: Institutional Responsibility
Jisc
 
EPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasetsEPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasets
EDINA, University of Edinburgh
 
Tijerina-RDA-NISO-Task Groups-sept11
Tijerina-RDA-NISO-Task Groups-sept11Tijerina-RDA-NISO-Task Groups-sept11
Tijerina-RDA-NISO-Task Groups-sept11
National Information Standards Organization (NISO)
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
EDINA, University of Edinburgh
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data Things
Katina Toufexis
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and Practice
LizLyon
 
Supporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingSupporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of Stirling
Lisa Haddow
 
H2020 open-data-pilot
H2020 open-data-pilotH2020 open-data-pilot
H2020 open-data-pilot
Sarah Jones
 
LSHTM Research Data Management Policy: An Overview
LSHTM Research Data Management Policy: An OverviewLSHTM Research Data Management Policy: An Overview
LSHTM Research Data Management Policy: An Overview
London School of Hygiene and Tropical Medicine
 
How to elaborate a data management plan
How to elaborate a data management planHow to elaborate a data management plan
How to elaborate a data management plan
Biblioteca de la Universitat Jaume I
 
Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...
LIBER Europe
 
Open Science: What, why, how?
Open Science: What, why, how? Open Science: What, why, how?
Open Science: What, why, how?
Biblioteca de la Universitat Jaume I
 

What's hot (20)

20160523 23 Research Data Things
20160523 23 Research Data Things20160523 23 Research Data Things
20160523 23 Research Data Things
 
RDM policy and recovering costs
RDM policy and recovering costsRDM policy and recovering costs
RDM policy and recovering costs
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP intro
 
Levine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal ConsiderationsLevine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal Considerations
 
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Open Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon HodsonOpen Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon Hodson
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciences
 
Paul Jeffreys - Research Integrity: Institutional Responsibility
Paul Jeffreys - Research Integrity: Institutional ResponsibilityPaul Jeffreys - Research Integrity: Institutional Responsibility
Paul Jeffreys - Research Integrity: Institutional Responsibility
 
EPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasetsEPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasets
 
Tijerina-RDA-NISO-Task Groups-sept11
Tijerina-RDA-NISO-Task Groups-sept11Tijerina-RDA-NISO-Task Groups-sept11
Tijerina-RDA-NISO-Task Groups-sept11
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data Things
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and Practice
 
Supporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingSupporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of Stirling
 
H2020 open-data-pilot
H2020 open-data-pilotH2020 open-data-pilot
H2020 open-data-pilot
 
LSHTM Research Data Management Policy: An Overview
LSHTM Research Data Management Policy: An OverviewLSHTM Research Data Management Policy: An Overview
LSHTM Research Data Management Policy: An Overview
 
How to elaborate a data management plan
How to elaborate a data management planHow to elaborate a data management plan
How to elaborate a data management plan
 
Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...
 
Open Science: What, why, how?
Open Science: What, why, how? Open Science: What, why, how?
Open Science: What, why, how?
 

Similar to Research Data Management, Challenges and Tools - Per Öster

Providing support and services for researchers in good data governance
Providing support and services for researchers in good data governanceProviding support and services for researchers in good data governance
Providing support and services for researchers in good data governance
Robin Rice
 
Digital Curation 101 - Taster
Digital Curation 101 - TasterDigital Curation 101 - Taster
Digital Curation 101 - Taster
Digital Curation Centre (DCC)
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
EDINA, University of Edinburgh
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
Denodo
 
Research Data Service geosciences 18oct2018
Research Data Service geosciences 18oct2018Research Data Service geosciences 18oct2018
Research Data Service geosciences 18oct2018
University of Edinburgh
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
Sarah Jones
 
A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)
Denodo
 
Lab Datareach Presentation V5
Lab Datareach Presentation V5Lab Datareach Presentation V5
Lab Datareach Presentation V5
damonhough
 
Moore RDAP11 Policy-based Data Management
Moore RDAP11 Policy-based Data ManagementMoore RDAP11 Policy-based Data Management
Moore RDAP11 Policy-based Data Management
ASIS&T
 
TDWI Checklist Report: Active Data Archiving
TDWI Checklist Report:  Active Data ArchivingTDWI Checklist Report:  Active Data Archiving
TDWI Checklist Report: Active Data Archiving
RainStor
 
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
EUDAT
 
Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing data
World Agroforestry (ICRAF)
 
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
DataScienceConferenc1
 
RDM for trainee physicians
RDM for trainee physiciansRDM for trainee physicians
RDM for trainee physicians
Historic Environment Scotland
 
Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Dc101 oxford sj_16062010
Dc101 oxford sj_16062010
Sarah Jones
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)
Denodo
 
Virtual Research Environments supporting tailor-made data management service...
Virtual Research Environments supporting tailor-made data management service...Virtual Research Environments supporting tailor-made data management service...
Virtual Research Environments supporting tailor-made data management service...
Blue BRIDGE
 
Policy-based Data Management
Policy-based Data Management Policy-based Data Management
Policy-based Data Management
Gary Wilhelm
 
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
dri_ireland
 
Prototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional RepositoryPrototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional Repository
DMR (Directorate of Mushroom Research), ICAR, GOI
 

Similar to Research Data Management, Challenges and Tools - Per Öster (20)

Providing support and services for researchers in good data governance
Providing support and services for researchers in good data governanceProviding support and services for researchers in good data governance
Providing support and services for researchers in good data governance
 
Digital Curation 101 - Taster
Digital Curation 101 - TasterDigital Curation 101 - Taster
Digital Curation 101 - Taster
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Research Data Service geosciences 18oct2018
Research Data Service geosciences 18oct2018Research Data Service geosciences 18oct2018
Research Data Service geosciences 18oct2018
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)
 
Lab Datareach Presentation V5
Lab Datareach Presentation V5Lab Datareach Presentation V5
Lab Datareach Presentation V5
 
Moore RDAP11 Policy-based Data Management
Moore RDAP11 Policy-based Data ManagementMoore RDAP11 Policy-based Data Management
Moore RDAP11 Policy-based Data Management
 
TDWI Checklist Report: Active Data Archiving
TDWI Checklist Report:  Active Data ArchivingTDWI Checklist Report:  Active Data Archiving
TDWI Checklist Report: Active Data Archiving
 
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
Linking HPC to Data Management - EUDAT Summer School (Giuseppe Fiameni, CINECA)
 
Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing data
 
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
 
RDM for trainee physicians
RDM for trainee physiciansRDM for trainee physicians
RDM for trainee physicians
 
Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Dc101 oxford sj_16062010
Dc101 oxford sj_16062010
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)
 
Virtual Research Environments supporting tailor-made data management service...
Virtual Research Environments supporting tailor-made data management service...Virtual Research Environments supporting tailor-made data management service...
Virtual Research Environments supporting tailor-made data management service...
 
Policy-based Data Management
Policy-based Data Management Policy-based Data Management
Policy-based Data Management
 
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
Rebecca Grant - Archiving and Digital Preservation (Figshare Fest)
 
Prototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional RepositoryPrototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional Repository
 

More from LEARN Project

LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM PolicyLEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Project
 
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3mResearch Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
LEARN Project
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of Chile
LEARN Project
 
LEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career ResearchersLEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Project
 
LEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDMLEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDM
LEARN Project
 
Paolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual MeetingPaolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual Meeting
LEARN Project
 
LEARN Webinar
LEARN WebinarLEARN Webinar
LEARN Webinar
LEARN Project
 
Developing a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management ProtocolsDeveloping a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management Protocols
LEARN Project
 
The Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARNThe Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARN
LEARN Project
 
Opening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and ChallengesOpening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and Challenges
LEARN Project
 
About Data From A Machine Learning Perspective
About Data From A Machine Learning PerspectiveAbout Data From A Machine Learning Perspective
About Data From A Machine Learning Perspective
LEARN Project
 
LEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening RemarksLEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening Remarks
LEARN Project
 
Managing Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challengesManaging Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challenges
LEARN Project
 
LEARN Project: The Story So Far
LEARN Project: The Story So FarLEARN Project: The Story So Far
LEARN Project: The Story So Far
LEARN Project
 
The Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research OrganisationsThe Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research Organisations
LEARN Project
 
Data for Development in the Caribbean
Data for Development in the CaribbeanData for Development in the Caribbean
Data for Development in the Caribbean
LEARN Project
 
Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López
LEARN Project
 
CENTRO DE DATOS
CENTRO DE DATOSCENTRO DE DATOS
CENTRO DE DATOS
LEARN Project
 
Research Data Management in São Paulo by Fabio Kon FAPESP
Research Data Management in São Paulo by Fabio Kon FAPESPResearch Data Management in São Paulo by Fabio Kon FAPESP
Research Data Management in São Paulo by Fabio Kon FAPESP
LEARN Project
 
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...
LEARN Project
 

More from LEARN Project (20)

LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM PolicyLEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
 
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3mResearch Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of Chile
 
LEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career ResearchersLEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
 
LEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDMLEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDM
 
Paolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual MeetingPaolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual Meeting
 
LEARN Webinar
LEARN WebinarLEARN Webinar
LEARN Webinar
 
Developing a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management ProtocolsDeveloping a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management Protocols
 
The Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARNThe Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARN
 
Opening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and ChallengesOpening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and Challenges
 
About Data From A Machine Learning Perspective
About Data From A Machine Learning PerspectiveAbout Data From A Machine Learning Perspective
About Data From A Machine Learning Perspective
 
LEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening RemarksLEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening Remarks
 
Managing Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challengesManaging Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challenges
 
LEARN Project: The Story So Far
LEARN Project: The Story So FarLEARN Project: The Story So Far
LEARN Project: The Story So Far
 
The Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research OrganisationsThe Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research Organisations
 
Data for Development in the Caribbean
Data for Development in the CaribbeanData for Development in the Caribbean
Data for Development in the Caribbean
 
Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López
 
CENTRO DE DATOS
CENTRO DE DATOSCENTRO DE DATOS
CENTRO DE DATOS
 
Research Data Management in São Paulo by Fabio Kon FAPESP
Research Data Management in São Paulo by Fabio Kon FAPESPResearch Data Management in São Paulo by Fabio Kon FAPESP
Research Data Management in São Paulo by Fabio Kon FAPESP
 
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...
 

Recently uploaded

G7 Apulia Leaders Communique, June 2024 (1).pdf
G7 Apulia Leaders Communique, June 2024 (1).pdfG7 Apulia Leaders Communique, June 2024 (1).pdf
G7 Apulia Leaders Communique, June 2024 (1).pdf
Energy for One World
 
2024: The FAR - Federal Acquisition Regulations, Part 41
2024: The FAR - Federal Acquisition Regulations, Part 412024: The FAR - Federal Acquisition Regulations, Part 41
2024: The FAR - Federal Acquisition Regulations, Part 41
JSchaus & Associates
 
Bharat Mata - History of Indian culture.pdf
Bharat Mata - History of Indian culture.pdfBharat Mata - History of Indian culture.pdf
Bharat Mata - History of Indian culture.pdf
Bharat Mata
 
一比一原版英国阿伯丁大学毕业证(AU毕业证书)学历如何办理
一比一原版英国阿伯丁大学毕业证(AU毕业证书)学历如何办理一比一原版英国阿伯丁大学毕业证(AU毕业证书)学历如何办理
一比一原版英国阿伯丁大学毕业证(AU毕业证书)学历如何办理
afsebu
 
karnataka housing board schemes . all schemes
karnataka housing board schemes . all schemeskarnataka housing board schemes . all schemes
karnataka housing board schemes . all schemes
narinav14
 
Lecture 7 Module VII Agriculture Insurance - Support Services (2).pdf
Lecture 7 Module VII Agriculture Insurance - Support Services (2).pdfLecture 7 Module VII Agriculture Insurance - Support Services (2).pdf
Lecture 7 Module VII Agriculture Insurance - Support Services (2).pdf
tshree896
 
CBO's Immigration Projections - Presentation
CBO's Immigration Projections - PresentationCBO's Immigration Projections - Presentation
CBO's Immigration Projections - Presentation
Congressional Budget Office
 
ColombiaPresentation.pptx macroeconomics
ColombiaPresentation.pptx macroeconomicsColombiaPresentation.pptx macroeconomics
ColombiaPresentation.pptx macroeconomics
JuanFelipeHerrera4
 
一比一原版(uoit毕业证书)加拿大安大略理工大学毕业证如何办理
一比一原版(uoit毕业证书)加拿大安大略理工大学毕业证如何办理一比一原版(uoit毕业证书)加拿大安大略理工大学毕业证如何办理
一比一原版(uoit毕业证书)加拿大安大略理工大学毕业证如何办理
vfefek
 
在线办理(西班牙UPV毕业证书)瓦伦西亚理工大学毕业证毕业完成信一模一样
在线办理(西班牙UPV毕业证书)瓦伦西亚理工大学毕业证毕业完成信一模一样在线办理(西班牙UPV毕业证书)瓦伦西亚理工大学毕业证毕业完成信一模一样
在线办理(西班牙UPV毕业证书)瓦伦西亚理工大学毕业证毕业完成信一模一样
dj1cx4ex
 
一比一原版(Adelaide毕业证)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证)阿德莱德大学毕业证如何办理
teeaszt
 
G7 Apulia Leaders Communique, 14th June 2024
G7 Apulia Leaders Communique, 14th June 2024G7 Apulia Leaders Communique, 14th June 2024
G7 Apulia Leaders Communique, 14th June 2024
Energy for One World
 
Spending in the 340B Drug Pricing Program, 2010 to 2021
Spending in the 340B Drug Pricing Program, 2010 to 2021Spending in the 340B Drug Pricing Program, 2010 to 2021
Spending in the 340B Drug Pricing Program, 2010 to 2021
Congressional Budget Office
 
How To Cultivate Community Affinity Throughout The Generosity Journey
How To Cultivate Community Affinity Throughout The Generosity JourneyHow To Cultivate Community Affinity Throughout The Generosity Journey
How To Cultivate Community Affinity Throughout The Generosity Journey
Aggregage
 
一比一原版(theauckland毕业证书)新西兰奥克兰大学毕业证成绩单如何办理
一比一原版(theauckland毕业证书)新西兰奥克兰大学毕业证成绩单如何办理一比一原版(theauckland毕业证书)新西兰奥克兰大学毕业证成绩单如何办理
一比一原版(theauckland毕业证书)新西兰奥克兰大学毕业证成绩单如何办理
odmqk
 
UN SDSN Sustainable Development Report 2024
UN SDSN Sustainable Development Report 2024UN SDSN Sustainable Development Report 2024
UN SDSN Sustainable Development Report 2024
Energy for One World
 
在线制作(umich毕业证书)美国密歇根大学毕业证学位证书原版一模一样
在线制作(umich毕业证书)美国密歇根大学毕业证学位证书原版一模一样在线制作(umich毕业证书)美国密歇根大学毕业证学位证书原版一模一样
在线制作(umich毕业证书)美国密歇根大学毕业证学位证书原版一模一样
zvpwjpty
 
Milton Keynes Hospital Charity - A guide to leaving a gift in your Will
Milton Keynes Hospital Charity - A guide to leaving a gift in your WillMilton Keynes Hospital Charity - A guide to leaving a gift in your Will
Milton Keynes Hospital Charity - A guide to leaving a gift in your Will
fundraising4
 
The Power of Community Newsletters: A Case Study from Wolverton and Greenleys...
The Power of Community Newsletters: A Case Study from Wolverton and Greenleys...The Power of Community Newsletters: A Case Study from Wolverton and Greenleys...
The Power of Community Newsletters: A Case Study from Wolverton and Greenleys...
Scribe
 
一比一原版(utas学位证书)澳洲塔斯马尼亚大学毕业证成绩单一模一样
一比一原版(utas学位证书)澳洲塔斯马尼亚大学毕业证成绩单一模一样一比一原版(utas学位证书)澳洲塔斯马尼亚大学毕业证成绩单一模一样
一比一原版(utas学位证书)澳洲塔斯马尼亚大学毕业证成绩单一模一样
taqyea
 

Recently uploaded (20)

G7 Apulia Leaders Communique, June 2024 (1).pdf
G7 Apulia Leaders Communique, June 2024 (1).pdfG7 Apulia Leaders Communique, June 2024 (1).pdf
G7 Apulia Leaders Communique, June 2024 (1).pdf
 
2024: The FAR - Federal Acquisition Regulations, Part 41
2024: The FAR - Federal Acquisition Regulations, Part 412024: The FAR - Federal Acquisition Regulations, Part 41
2024: The FAR - Federal Acquisition Regulations, Part 41
 
Bharat Mata - History of Indian culture.pdf
Bharat Mata - History of Indian culture.pdfBharat Mata - History of Indian culture.pdf
Bharat Mata - History of Indian culture.pdf
 
一比一原版英国阿伯丁大学毕业证(AU毕业证书)学历如何办理
一比一原版英国阿伯丁大学毕业证(AU毕业证书)学历如何办理一比一原版英国阿伯丁大学毕业证(AU毕业证书)学历如何办理
一比一原版英国阿伯丁大学毕业证(AU毕业证书)学历如何办理
 
karnataka housing board schemes . all schemes
karnataka housing board schemes . all schemeskarnataka housing board schemes . all schemes
karnataka housing board schemes . all schemes
 
Lecture 7 Module VII Agriculture Insurance - Support Services (2).pdf
Lecture 7 Module VII Agriculture Insurance - Support Services (2).pdfLecture 7 Module VII Agriculture Insurance - Support Services (2).pdf
Lecture 7 Module VII Agriculture Insurance - Support Services (2).pdf
 
CBO's Immigration Projections - Presentation
CBO's Immigration Projections - PresentationCBO's Immigration Projections - Presentation
CBO's Immigration Projections - Presentation
 
ColombiaPresentation.pptx macroeconomics
ColombiaPresentation.pptx macroeconomicsColombiaPresentation.pptx macroeconomics
ColombiaPresentation.pptx macroeconomics
 
一比一原版(uoit毕业证书)加拿大安大略理工大学毕业证如何办理
一比一原版(uoit毕业证书)加拿大安大略理工大学毕业证如何办理一比一原版(uoit毕业证书)加拿大安大略理工大学毕业证如何办理
一比一原版(uoit毕业证书)加拿大安大略理工大学毕业证如何办理
 
在线办理(西班牙UPV毕业证书)瓦伦西亚理工大学毕业证毕业完成信一模一样
在线办理(西班牙UPV毕业证书)瓦伦西亚理工大学毕业证毕业完成信一模一样在线办理(西班牙UPV毕业证书)瓦伦西亚理工大学毕业证毕业完成信一模一样
在线办理(西班牙UPV毕业证书)瓦伦西亚理工大学毕业证毕业完成信一模一样
 
一比一原版(Adelaide毕业证)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证)阿德莱德大学毕业证如何办理
 
G7 Apulia Leaders Communique, 14th June 2024
G7 Apulia Leaders Communique, 14th June 2024G7 Apulia Leaders Communique, 14th June 2024
G7 Apulia Leaders Communique, 14th June 2024
 
Spending in the 340B Drug Pricing Program, 2010 to 2021
Spending in the 340B Drug Pricing Program, 2010 to 2021Spending in the 340B Drug Pricing Program, 2010 to 2021
Spending in the 340B Drug Pricing Program, 2010 to 2021
 
How To Cultivate Community Affinity Throughout The Generosity Journey
How To Cultivate Community Affinity Throughout The Generosity JourneyHow To Cultivate Community Affinity Throughout The Generosity Journey
How To Cultivate Community Affinity Throughout The Generosity Journey
 
一比一原版(theauckland毕业证书)新西兰奥克兰大学毕业证成绩单如何办理
一比一原版(theauckland毕业证书)新西兰奥克兰大学毕业证成绩单如何办理一比一原版(theauckland毕业证书)新西兰奥克兰大学毕业证成绩单如何办理
一比一原版(theauckland毕业证书)新西兰奥克兰大学毕业证成绩单如何办理
 
UN SDSN Sustainable Development Report 2024
UN SDSN Sustainable Development Report 2024UN SDSN Sustainable Development Report 2024
UN SDSN Sustainable Development Report 2024
 
在线制作(umich毕业证书)美国密歇根大学毕业证学位证书原版一模一样
在线制作(umich毕业证书)美国密歇根大学毕业证学位证书原版一模一样在线制作(umich毕业证书)美国密歇根大学毕业证学位证书原版一模一样
在线制作(umich毕业证书)美国密歇根大学毕业证学位证书原版一模一样
 
Milton Keynes Hospital Charity - A guide to leaving a gift in your Will
Milton Keynes Hospital Charity - A guide to leaving a gift in your WillMilton Keynes Hospital Charity - A guide to leaving a gift in your Will
Milton Keynes Hospital Charity - A guide to leaving a gift in your Will
 
The Power of Community Newsletters: A Case Study from Wolverton and Greenleys...
The Power of Community Newsletters: A Case Study from Wolverton and Greenleys...The Power of Community Newsletters: A Case Study from Wolverton and Greenleys...
The Power of Community Newsletters: A Case Study from Wolverton and Greenleys...
 
一比一原版(utas学位证书)澳洲塔斯马尼亚大学毕业证成绩单一模一样
一比一原版(utas学位证书)澳洲塔斯马尼亚大学毕业证成绩单一模一样一比一原版(utas学位证书)澳洲塔斯马尼亚大学毕业证成绩单一模一样
一比一原版(utas学位证书)澳洲塔斯马尼亚大学毕业证成绩单一模一样
 

Research Data Management, Challenges and Tools - Per Öster

  • 1. CSC – Suomalainen tutkimuksen, koulutuksen, kulttuurin ja julkishallinnon ICT-osaamiskeskus Research Data Management, Challenges andTools Per Öster, CSC – IT Center for Science Ltd
  • 2. • Drivers o Number of devices o Number of communicating apps o Number of users 2.6.20172 10% of UK power consumption due to ICT 1/3 network 1/3 devices 1/3 datacentres Amount of data
  • 3. 6/2/173 Analysis Publication ReviewConceptualisation Data gathering Open access Scientific blogs Collaborative bibliographies Alternative Reputation systems Citizens science Open code Open workflows Open annotation Open data Pre- print Data- intensive 2! Sci- starter.com Runmycode. org ArXiv Roar.eprints. org Impact Story Altmetric.com Mendeley.com Academia.edu Researchgate.com Openannotation.org Datadryad.org Myexperiment.org Figshare.com An#emerging# ecosystem#of# services#and# standards# It's real! The DCC Curation Lifecycle Model Description and Representation Information Preservation Planning Community Watch and Participation Curate and Preserve Conceptualise Create or Receive Appraise and Select Ingest Preservation Action Store Access, Use and Reuse Transform Assign administrative, descriptive, technical, structural and preservation metadata, using appropriate standards, to ensure adequate description and control over the long-term. Collect and assign representation information required to understand and render both the digital material and the associated metadata. Plan for preservation throughout the curation lifecycle of digital material. This would include plans for management and administration of all curation lifecycle actions. Maintain a watch on appropriate community activities, and participate in the development of shared standards, tools and suitable software. Be aware of, and undertake management and administrative actions planned to promote curation and preservation throughout the curation lifecycle. Conceive and plan the creation of data, including capture method and storage options. Create data including administrative, descriptive, structural and technical metadata. Preservation metadata may also be added at the time of creation. Receive data, in accordance with documented collecting policies, from data creators, other archives, repositories or data centres, and if required assign appropriate metadata. Evaluate data and select for long-term curation and preservation. Adhere to documented guidance, policies or legal requirements. Transfer data to an archive, repository, data centre or other custodian. Adhere to documented guidance, policies or legal requirements. Undertake actions to ensure long-term preservation and retention of the authoritative nature of data. Preservation actions should ensure that data remains authentic, reliable and usable while maintaining its integrity. Actions include data cleaning, validation, assigning preservation metadata, assigning representation information and ensuring acceptable data structures or file formats. Store the data in a secure manner adhering to relevant standards. Ensure that data is accessible to both designated users and reusers, on a day-to-day basis. This may be in the form of publicly available published information. Robust access controls and authentication procedures may be applicable. Create new data from the original, for example - By migration into a different format. - By creating a subset, by selection or query, to create newly derived results, perhaps for publication. www.dcc.ac.uk info@dcc.ac.uk The Curation Lifecycle The DCC Curation Lifecycle Model provides a graphical high level overview of the stages required for successful curation and preservation of data from initial conceptualisation or receipt. The model can be used to plan activities within an organisation or consortium to ensure that all necessary stages are undertaken, each in the correct sequence. The model enables granular functionality to be mapped against it; to define roles and responsibilities, and build a framework of standards and technologies to implement. It can help with the process of identifying additional steps which may be required, or actions which are not required by certain situations or disciplines, and ensuring that processes and policies are adequately documented. Data, any information in binary digital form, is at the centre of the Curation Lifecycle. This includes: - Simple Digital Objects are discrete digital items; such as textual files, images or sound files, along with their related identifiers and metadata. - Complex Digital Objects are discrete digital objects, made by combining a number of other digital objects, such as websites. Structured collections of records or data stored in a computer system. Full Lifecycle Actions Sequential Actions Data (Digital Objects or Databases) Occasional Actions Dispose Reappraise Migrate Dispose of data, which has not been selected for long-term curation and preservation in accordance with documented policies, guidance or legal requirements. Typically data may be transferred to another archive, repository, data centre or other custodian. In some instances data is destroyed. The data’s nature may, for legal reasons, necessitate secure destruction. Return data which fails validation procedures for further appraisal and reselection. Migrate data to a different format. This may be done to accord with the storage environment or to ensure the data’s immunity from hardware or software obsolescence. Digital Objects Databases
  • 5. Secure Compute Clouds Supporting sample logistics • Federated Authentication • Authorization • Dataset registry • Data transfer hub • Policy and Legal Framework Services and Coordination High speed encrypted data transfer GridFTP/Globus/Aspera Secure data access remote API ( GA4GH ) Sequencing centers Data Users EGA at Data Archiving Bringing users to data Data Generation Managing Access Data Owner Data Access Agreement Data Access Committee Data Request Authorization Management Tools ( EGA and CSC REMS ) 5
  • 6. From field measurements to open data 2.6.20176 Questions: Sensitive data? Requirements on Authentication and authorization?
  • 7. Instrument Measuring PCs: Raw data at the stations File servers at stations: Raw data and field diaries, cal documents File servers in Helsinki: Raw and intermediate data, documents, scripts SMEAR database: Processed data in Helsinki ICOS, EBAS,... databases: Near real time and processed data outside UH Routine data processing = (- unit conversion) - calibration correction - quality check, gapfilling - averaging over space or time SMEAR data flow A/D conversion unit conversion IDA (CSC data service): Raw data & document archive, database datasets Field documentation Researchers, Data processing server Feedback on data quality Metadata Metadata Metadata 2.6.20177 https://avaa.tdata.fi/web/smart/smear
  • 8. Support in All Phases of Research Process 20168 Plan Customer Portal Experts Guides Websites Training Service Desk Produce & Collect Data International resources Modelling Software Supercomputers Analyse Cloud Services Training Data science Computing Software Store B2SAFE B2SHARE HPC Archive IDA Databases Research long- term preservation (LTP) Share & Publish AVAA B2DROP B2SHARE Databank Etsin Funet FileSender
  • 9.
  • 11. ARTICLE 2: DISCLAIMER 1. The Service is provided "as is" and the Provider disclaims any and all representations and warranties, whether express or implied, including;- but not limited to;- implied warranties of title, merchantability, fitness for any particular purpose or non-infringement. The Provider does not promise any specific results, effects or outcome from the use of the Service. 2. … 3. The Provider reserves the right to change, reduce, interrupt or discontinue the Service or parts of it at any time. 4. No one has a right to use the Service; the Provider reserves the right to exclude certain Users.
  • 12. Are the commercial services sufficient? • Nice complement but can not serve as the fundamental infrastructure for research data of national and international interest • Need for publicly funded and operated infrastructure 2.6.201712 e-Science Data Factory
  • 13. EUDAT CDI “The EUDAT Collaborative Data Infrastructure is a defined data model and a set of technical standards and policies adopted by European research data centres and community data repositories to create a single European e-infrastructure of interoperable data services.” “To date, over 20 major European research organizations, data and computing centres have signed an agreement to sustain the EUDAT – pan European collaborative data infrastructure for the next 10 years giving the birth to the EUDAT Collaborative Data Infrastructure” www.eudat.eu
  • 14.
  • 15. Findable – assign persistent IDs, provide rich metadata, register in a searchable resource... Accessible – Retrievable by their ID using a standard protocol, metadata remain accessible even if data aren’t... Interoperable – Use formal, broadly applicable languages, use standard vocabularies, qualified references... Reusable – Rich, accurate metadata, clear licences, provenance, use of community standards... www.force11.org/group/fairgroup/fairprinciples EUDAT is FAIR
  • 16. What is the EUDAT Service offer?
  • 17. B2FIND: multi-disciplinary metadata catalogue now: common metadata catalogue, harvesting across all CDI data, single point for data discoverability; in development: aim to improve with agreed basic metadata for all data objects. B2HANDLE: policy-based prefix & PID management now: common PID mechanism across all CDI data; in development: aim to improve with agreed common schema and behaviour. B2SHARE: research data repository now: full, tailored metadata support for data deposits. B2SAFE: policy-driven data management in development: aim to introduce a common data model promoting metadata extraction and processing. EUDAT & Findable F
  • 18. B2STAGE: data staging service B2SHARE: research data repository B2SAFE: policy-driven data management now: EUDAT presents data through common Internet protocols and APIs, http and gridftp; in development: aim to improve with a single http API for all services and data. EUDAT & Accessible A
  • 19. B2HANDLE: policy-based prefix & PID management now: common PID mechanism across all CDI data; in development: aim to improve with agreed common schema and behaviour. B2STAGE: data staging service B2SHARE: research data repository B2SAFE: policy-driven data management in development: single http API for all services and data (interoperability of data services, if not data!). B2FIND: multi-disciplinary metadata catalogue in development: agreed basic metadata for all data objects (a degree of metadata interop). EUDAT & Interoperable I
  • 20. B2SHARE: research data repository B2SAFE: policy-driven data management now: encourage use of CC BY v 3 as common open data licence; encourage open formats where we have any influence. EUDAT & Reusable R
  • 21. Acknowledgment • Tommi Nyrönen, ELIXIR-Finland Head of Node • Mikael Linden, CSC • TimmoVesala, INAR RI, Helsinki University • Damien Lecarpentier, EUDAT Project Director 2.6.201721
  • 22. facebook.com/CSCfi twitter.com/CSCfi youtube.com/CSCfi linkedin.com/company/csc---it-center-for-science Kuvat CSC:n arkisto ja Thinkstock CSC – IT Center for Science Ltd Per Öster Director, Research Infrastructures per.oster@csc.fi