SlideShare a Scribd company logo
1 of 26
Research Data Management from 
a disciplinary perspective 
Sarah Jones 
Digital Curation Centre 
sarah.jones@glasgow.ac.uk 
Twitter: @sjDCC 
Stéphane Goldstein 
Research Information Network 
stephane.goldstein@researchinfonet.org 
Twitter: @stephgold7
Disclaimer 
Practice varies greatly by 
discipline and sub-discipline 
so it’s hard to generalise 
Apologies for any sweeping 
statements and groupings 
that don’t fit your model 
Image credit: Sweep by Judy Van der Velden CC-BY-NC-ND 
www.flickr.com/photos/judy-van-der-velden/6757403261
Case studies on disciplinary practice 
RIN Information Seeking and Sharing Behaviour 
www.rin.ac.uk/our-work/using-and-accessing-information-resources 
– Life sciences 
– Humanities 
– Physical sciences 
RIN Open Science Case studies 
www.rin.ac.uk/our-work/data-management-and-curation/open-science-case-studies 
SCARP case studies www.dcc.ac.uk/resources/case-studies/scarp 
Knowledge Exchange Incentives and motivations for sharing research 
data (forthcoming) 
RLUK research data typology (more from Stephane)
Groups and disciplines 
Arts & Humanities 
– Creative arts, languages, philosophy, archaeology… 
Social Science 
– Economics, history, politics, business, psychology... 
Sciences & Engineering 
– Physics, astronomy, earth sciences, computing… 
Life Sciences 
– Biology, ecology, medical and veterinary science…
Arts & Humanities 
Outputs may not be termed 
‘data’ e.g. sketches, writing, 
performance, artefacts, ‘work’ 
Focus on literary outputs & 
manuscripts in some disciplines 
More use of standard tools e.g. 
Word, Excel – less likely to 
adapt technologies to fit 
Arguably lower awareness and 
uptake of RDM overall
Creative Arts 
Several RDM projects in the creative arts e.g. Kultivate, 
KAPTUR, VADS4R, CAiRO training... 
Resistance to term ‘data’ – too scientific 
Importance of personal websites for profile as work is 
also conducted outside of academia 
Visual Arts Data Service - www.vads.ac.uk 
Institutional repositories at arts schools accept a broader 
range of outputs and display content more visually to fill 
the void e.g. http://research.gold.ac.uk
Sonic Arts Research Unit 
Collaboration with IR as a 
result of losing data 
Tension between providing 
access in a visual / usable 
way and preserving data 
Still use soundcloud and 
personal websites for 
access, but these link to 
‘master’ copy of data held 
in IR for preservation 
www.dcc.ac.uk/resources/developing-rdm-services/repository-radar
Digital Humanities 
Intentional creation of resources rather than just data as 
by-product of research process 
More use of standards e.g. XML & TEI in language 
resources, image standards and capture quality for 
digitisation, Dublin Core metadata… 
Often include technical experts in project team 
Links with cultural heritage collections 
Negotiating copyright often a major issue 
Sustainability a big challenge
Mapping Edinburgh’s Social History 
Historical maps overlaid these with all kinds of 
open data to chart how the town has changed 
through time 
Uses open source tools 
Allows you to overlay maps 
Picks up on common themes 
www.mesh.ed.ac.uk
Social Sciences 
 Greater awareness and acceptance 
of RDM by community 
 Methodology is as much a factor in 
determining difference as discipline 
 Nature of data often poses 
challenges for sharing 
 Lots of reuse of large survey data 
 Established metadata standards e.g. 
Data Documentation Initiative (DDI) 
 Strong international data centre 
infrastructure
Public health 
Ethics predominant concern 
– How to negotiate consent 
– How to store, transfer & handle data securely 
– How to anonymise and share data 
Data integration / linking and curation of longitudinal studies is 
major concern as data added to over decades 
Need for data havens to help control access to data – role for 
unis e.g. Grampian Data Safe Haven 
UK Data Service - http://ukdataservice.ac.uk
Twenty-07: Public health study 
Longitudinal study following 4510 people from West of Scotland 
over 20 years to investigate the reasons for differences in health 
Undertook interviews, questionnaires, physical measurements, 
blood samples etc 
Strict access controls and guidelines for data collection 
Data managed within the MRC Social and Public Health Sciences 
Unit and accessible under a data sharing agreement - 
http://2007study.sphsu.mrc.ac.uk/Revised-Data-Sharing-Policy-has-been-
Life Sciences 
 Funders arguably more demanding 
in terms of data sharing policy 
 Sharing can be problematic / resisted 
given the nature of the data, fear of 
misuse or loss of control over IPR 
 Data sharing agreements and access 
committees more common 
 Data integration & mining key 
drivers 
 Research is well-resourced so greater 
capacity to fund local solutions and 
tools for RDM during projects
Genetics 
Vast quantities of data and rapid growth 
– DNA sequence data is doubling every 6-8 months 
Well established public databases for gene sequences e.g. 
GenBank www.ncbi.nlm.nih.gov/genbank 
– However even this is on short-term project funding! 
Need accession number to publish so driver for sharing and 
established workflow 
European Data Infrastructure projects too e.g. ELIXIR
Neuroscience 
 Large data volumes due to use of medical imaging 
 Moving towards larger cohort studies integrating wider range of data types, 
which strains the balance with ethical requirements around personal data 
 Costs of data gathering and advances in analysis technology are making field 
more data intensive - computational methods 
 Small interdisciplinary teams provide the human infrastructure for RDM, but 
historically low funder investment in data management at lab level 
 Disciplinary archives are immature, and has encouraged tendency for labs to 
treat longitudinal datasets as intellectual capital
OMERO – Open Microscopy 
Environment 
Monash e-Research Centre 
helps groups to adopt (and if 
needed adapt) existing 
technological solutions 
Partnered a research group to 
implement OMERO, a secure 
central repository to help 
researchers organise, analyze 
and share images 
Resulting tool more 
sustainable as tailored to 
specific community need 
www.dcc.ac.uk/resources/developing-rdm-services/improving-rdm-monash
Science & Engineering 
 Large scale can mean RDM is built in 
as standard and sharing part of 
workflow e.g. facilities science 
 Often early adopters and advocates 
of new technologies e.g. the Grid, 
wikis & Arxiv in particle physics 
 Archiving established in some cases 
as data can’t be recreated e.g. NERC 
data centres for Earth Sciences 
 Commercial sensitivities can place 
restrictions on sharing in some fields 
Industry 
partners
Mechanical Engineering 
Several RDM projects at Bath e.g. ERIM, REDm-MED 
Concept of repository well established in industrial engineering 
– Product Lifecycle Management (PLM) systems 
Preservation issues as data is challenging e.g. CAD files 
Less information sharing than other disciplines 
– Commercial sensitivities preclude sharing 
– Consultancy-style research can lead to internal-only results 
– Data generated from private systems, so less applicable to others
Crystallography 
X-ray examinations, images and videos of crystal structures, 
chemical crystallography diffraction images 
Established metadata standards e.g. Crystallographic 
Information Framework (CIF) 
Advocates of open science and use of related tools 
 UsefulChem - http://usefulchem.wikispaces.com 
 LabTrove - www.labtrove.org 
eCrystals Archive and Crystallography Open Database (COD) 
National Crystallography Service - www.ncs.ac.uk
Astronomy 
Established data standards (e.g. FITS and NOA) maintained by 
community 
Access to facilities requires the deposit of raw data, although 
this can be embargoed 
International data centres e.g. Sloan Digital Sky Survey - 
www.sdss.org 
Large volumes of data so transfer can be difficult 
Few IPR issues compared to other disciplines 
Data products are not always shared
Galaxy Zoo 
Citizen Science project started to 
classify a million galaxies imaged by 
the Sloan Digital Sky Survey 
Over 50 million classifications in the 
first year, contributed by more than 
150,000 people 
Classifications were as good as those 
from professional astronomers 
Further projects in astronomy, 
climatology, biology, humanities… www.galaxyzoo.org
Research data typology 
Commissioned by RLUK 
Aim: to help librarians improve their ability to 
engage with researchers on RDM matters; and 
to enable them to acquire a better 
understanding of the needs of researchers 
A resource structured around a suggested 
typology of research data, looking at different 
ways in which data might be categorised
Broad data types 
1. How do researchers generate and process data, and 
for what purpose? 
1.1 Method of creation and collection of research data: 
where the data comes from 
1.2 Readiness of research data: extent to which data 
has been processed 
1.3 Use of research data: researchers' main purpose for 
accessing and using data 
2. In what file formats, media and volumes do researchers 
generate data? 
2.1 Medium and format for research data: objects in which 
data is captured and recorded, electronic storage and file 
types 
2.2 Electronic data volumes: size of files (this is subjective, 
and based largely on the perception of researchers 
3. How do researchers manage and store their data? 3.1 Storage of research data: where and how data is kept 
3.2 Types of metadata: not an exhaustive list, but these are 
widely-recognised metadata standards 
3.3 Metadata standards 
3.4 Degree of openness: founded on Royal Society's 
categorisation of 'intelligent openness' 
3.5 Licensing of research data: legal rights appertaining the 
use of the data
An expandable resource 
A scaffold onto which disciplinary examples can be 
hung 
Dynamic resource: community input (from librarians, 
but maybe others too?), crowdsourcing 
Turning it into an online interactive tool 
Refreshing, curating, adapting the resource 
Basic introduction at 
http://www.powtoon.com/show/fZDm1s0W6TI/research-data-typology-for-rluk- 
draft/
Conclusions 
Lots of work still to do! 
Domains different in all respects: data, methods, key 
RDM concerns, level of infrastructure and support… 
Differences exist at sub-discipline level 
Need to understand the area 
 Developing and using RLUK’s typology
How to plug the gaps? 
Dozens of different repositories or databases 
specialising in sub-domains or data types, but still major 
gaps 
– Shared services? 
– Institutional services – specialising rather than generic? 
– Role of publishers and learned societies? 
– Funder calls for domain specific infrastructure? 
– Unis to support ground-up development of tools / services? 
• How can the sector help domain-specific solutions to 
mature and thrive?

More Related Content

What's hot

Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...LIBER Europe
 
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...Amanda Whitmire
 
From policy to practice with DMP Online
From policy to practice with DMP OnlineFrom policy to practice with DMP Online
From policy to practice with DMP OnlineSarah Jones
 
Martin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP OnlineMartin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP OnlineFuture Perfect 2012
 
Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521Amanda Whitmire
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycleMarieke Guy
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Jian Qin
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Robin Rice
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation HeidornBryan Heidorn
 
Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceRobert H. McDonald
 
EPSRC research data expectations and research software management
EPSRC research data expectations and research software managementEPSRC research data expectations and research software management
EPSRC research data expectations and research software managementHistoric Environment Scotland
 
LEARN Conference - How to cost
LEARN Conference - How to costLEARN Conference - How to cost
LEARN Conference - How to costJisc RDM
 
Jeff Haywood - Research Integrity: Institutional Responsibility
Jeff Haywood - Research Integrity: Institutional ResponsibilityJeff Haywood - Research Integrity: Institutional Responsibility
Jeff Haywood - Research Integrity: Institutional ResponsibilityJisc
 
Sharing scientific data: Ethics and consent
Sharing scientific data: Ethics and consentSharing scientific data: Ethics and consent
Sharing scientific data: Ethics and consentAboul Ella Hassanien
 
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Jisc
 

What's hot (20)

Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...
 
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
 
From policy to practice with DMP Online
From policy to practice with DMP OnlineFrom policy to practice with DMP Online
From policy to practice with DMP Online
 
Martin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP OnlineMartin Donnelly Sarah Jones DMP Online
Martin Donnelly Sarah Jones DMP Online
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLANINCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
 
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation Heidorn
 
Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...
Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...
Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...
 
Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability Science
 
EPSRC research data expectations and research software management
EPSRC research data expectations and research software managementEPSRC research data expectations and research software management
EPSRC research data expectations and research software management
 
LEARN Conference - How to cost
LEARN Conference - How to costLEARN Conference - How to cost
LEARN Conference - How to cost
 
Jeff Haywood - Research Integrity: Institutional Responsibility
Jeff Haywood - Research Integrity: Institutional ResponsibilityJeff Haywood - Research Integrity: Institutional Responsibility
Jeff Haywood - Research Integrity: Institutional Responsibility
 
Sharing scientific data: Ethics and consent
Sharing scientific data: Ethics and consentSharing scientific data: Ethics and consent
Sharing scientific data: Ethics and consent
 
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
 

Viewers also liked

H2020 data pilot openaire
H2020 data pilot openaireH2020 data pilot openaire
H2020 data pilot openaireSarah Jones
 
Use and reuse: research data locally & globally #esipfed
Use and reuse: research data locally & globally #esipfedUse and reuse: research data locally & globally #esipfed
Use and reuse: research data locally & globally #esipfedKevin Ashley
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing dataSarah Jones
 
My data, your data, our data - increasing data value through reuse (Eurocris2...
My data, your data, our data - increasing data value through reuse (Eurocris2...My data, your data, our data - increasing data value through reuse (Eurocris2...
My data, your data, our data - increasing data value through reuse (Eurocris2...Kevin Ashley
 
20160719 23 Research Data Things
20160719 23 Research Data Things20160719 23 Research Data Things
20160719 23 Research Data ThingsKatina Toufexis
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open scienceSarah Jones
 
JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009Kevin Ashley
 
basic research versus applied research
basic research versus applied researchbasic research versus applied research
basic research versus applied researchChristian Orsolino
 
Basic vs Applied Research
Basic vs Applied ResearchBasic vs Applied Research
Basic vs Applied ResearchAnupama Saini
 

Viewers also liked (11)

H2020 data pilot openaire
H2020 data pilot openaireH2020 data pilot openaire
H2020 data pilot openaire
 
Use and reuse: research data locally & globally #esipfed
Use and reuse: research data locally & globally #esipfedUse and reuse: research data locally & globally #esipfed
Use and reuse: research data locally & globally #esipfed
 
DMPonline demo
DMPonline demoDMPonline demo
DMPonline demo
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
My data, your data, our data - increasing data value through reuse (Eurocris2...
My data, your data, our data - increasing data value through reuse (Eurocris2...My data, your data, our data - increasing data value through reuse (Eurocris2...
My data, your data, our data - increasing data value through reuse (Eurocris2...
 
20160719 23 Research Data Things
20160719 23 Research Data Things20160719 23 Research Data Things
20160719 23 Research Data Things
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
 
JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009
 
Open Science
Open ScienceOpen Science
Open Science
 
basic research versus applied research
basic research versus applied researchbasic research versus applied research
basic research versus applied research
 
Basic vs Applied Research
Basic vs Applied ResearchBasic vs Applied Research
Basic vs Applied Research
 

Similar to Disciplinary RDM

Sarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveSarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveJisc
 
Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Sarah Jones
 
Curation of Research Data
Curation of Research DataCuration of Research Data
Curation of Research DataMichael Day
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...EDINA, University of Edinburgh
 
Current and emerging scientific data curation practices
Current and emerging scientific data curation practicesCurrent and emerging scientific data curation practices
Current and emerging scientific data curation practicesMichael Day
 
Data curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation CentreData curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation CentreMichael Day
 
Research data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP OnlineResearch data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP OnlineMartin Donnelly
 
Developing institutional RDM services
Developing institutional RDM servicesDeveloping institutional RDM services
Developing institutional RDM servicesMichael Day
 
RDM for Librarians
RDM for LibrariansRDM for Librarians
RDM for LibrariansMarieke Guy
 
Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...Sarah Shreeves
 
An introduction to the Digital Curation Centre
An introduction to the Digital Curation CentreAn introduction to the Digital Curation Centre
An introduction to the Digital Curation CentreMichael Day
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curationMichael Day
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeLizLyon
 
User Engagement in Research Data Curation
User Engagement in Research Data CurationUser Engagement in Research Data Curation
User Engagement in Research Data CurationUniversity of Edinburgh
 

Similar to Disciplinary RDM (20)

Sarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveSarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspective
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
 
Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Dc101 oxford sj_16062010
Dc101 oxford sj_16062010
 
Curation of Research Data
Curation of Research DataCuration of Research Data
Curation of Research Data
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
 
DAF methodology
DAF methodologyDAF methodology
DAF methodology
 
Current and emerging scientific data curation practices
Current and emerging scientific data curation practicesCurrent and emerging scientific data curation practices
Current and emerging scientific data curation practices
 
Data curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation CentreData curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation Centre
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Research data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP OnlineResearch data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP Online
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Developing institutional RDM services
Developing institutional RDM servicesDeveloping institutional RDM services
Developing institutional RDM services
 
Resources for Research Data Managers - 2014-05-28 - University of Oxford
Resources for Research Data Managers - 2014-05-28 - University of OxfordResources for Research Data Managers - 2014-05-28 - University of Oxford
Resources for Research Data Managers - 2014-05-28 - University of Oxford
 
RDM for Librarians
RDM for LibrariansRDM for Librarians
RDM for Librarians
 
Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...
 
An introduction to the Digital Curation Centre
An introduction to the Digital Curation CentreAn introduction to the Digital Curation Centre
An introduction to the Digital Curation Centre
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curation
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and Practice
 
Open Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon HodsonOpen Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon Hodson
 
User Engagement in Research Data Curation
User Engagement in Research Data CurationUser Engagement in Research Data Curation
User Engagement in Research Data Curation
 

More from Sarah Jones

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricksSarah Jones
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and librariesSarah Jones
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activitiesSarah Jones
 
Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextSarah Jones
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open ScienceSarah Jones
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysisSarah Jones
 
Introduction to Open Science and EOSC
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSCSarah Jones
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptxSarah Jones
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?Sarah Jones
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIRSarah Jones
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchersSarah Jones
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open ScienceSarah Jones
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsSarah Jones
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open ScienceSarah Jones
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRSarah Jones
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsSarah Jones
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?Sarah Jones
 
DCC and FAIR initiatives
DCC and FAIR initiativesDCC and FAIR initiatives
DCC and FAIR initiativesSarah Jones
 
Reflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCReflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCSarah Jones
 

More from Sarah Jones (20)

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricks
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and libraries
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activities
 
Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European context
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open Science
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysis
 
Introduction to Open Science and EOSC
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSC
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptx
 
Intro-EOSC.pptx
Intro-EOSC.pptxIntro-EOSC.pptx
Intro-EOSC.pptx
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIR
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open Science
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessons
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open Science
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIR
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commons
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?
 
DCC and FAIR initiatives
DCC and FAIR initiativesDCC and FAIR initiatives
DCC and FAIR initiatives
 
Reflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCReflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDC
 

Recently uploaded

Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 

Recently uploaded (20)

Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 

Disciplinary RDM

  • 1. Research Data Management from a disciplinary perspective Sarah Jones Digital Curation Centre sarah.jones@glasgow.ac.uk Twitter: @sjDCC Stéphane Goldstein Research Information Network stephane.goldstein@researchinfonet.org Twitter: @stephgold7
  • 2. Disclaimer Practice varies greatly by discipline and sub-discipline so it’s hard to generalise Apologies for any sweeping statements and groupings that don’t fit your model Image credit: Sweep by Judy Van der Velden CC-BY-NC-ND www.flickr.com/photos/judy-van-der-velden/6757403261
  • 3. Case studies on disciplinary practice RIN Information Seeking and Sharing Behaviour www.rin.ac.uk/our-work/using-and-accessing-information-resources – Life sciences – Humanities – Physical sciences RIN Open Science Case studies www.rin.ac.uk/our-work/data-management-and-curation/open-science-case-studies SCARP case studies www.dcc.ac.uk/resources/case-studies/scarp Knowledge Exchange Incentives and motivations for sharing research data (forthcoming) RLUK research data typology (more from Stephane)
  • 4. Groups and disciplines Arts & Humanities – Creative arts, languages, philosophy, archaeology… Social Science – Economics, history, politics, business, psychology... Sciences & Engineering – Physics, astronomy, earth sciences, computing… Life Sciences – Biology, ecology, medical and veterinary science…
  • 5. Arts & Humanities Outputs may not be termed ‘data’ e.g. sketches, writing, performance, artefacts, ‘work’ Focus on literary outputs & manuscripts in some disciplines More use of standard tools e.g. Word, Excel – less likely to adapt technologies to fit Arguably lower awareness and uptake of RDM overall
  • 6. Creative Arts Several RDM projects in the creative arts e.g. Kultivate, KAPTUR, VADS4R, CAiRO training... Resistance to term ‘data’ – too scientific Importance of personal websites for profile as work is also conducted outside of academia Visual Arts Data Service - www.vads.ac.uk Institutional repositories at arts schools accept a broader range of outputs and display content more visually to fill the void e.g. http://research.gold.ac.uk
  • 7. Sonic Arts Research Unit Collaboration with IR as a result of losing data Tension between providing access in a visual / usable way and preserving data Still use soundcloud and personal websites for access, but these link to ‘master’ copy of data held in IR for preservation www.dcc.ac.uk/resources/developing-rdm-services/repository-radar
  • 8. Digital Humanities Intentional creation of resources rather than just data as by-product of research process More use of standards e.g. XML & TEI in language resources, image standards and capture quality for digitisation, Dublin Core metadata… Often include technical experts in project team Links with cultural heritage collections Negotiating copyright often a major issue Sustainability a big challenge
  • 9. Mapping Edinburgh’s Social History Historical maps overlaid these with all kinds of open data to chart how the town has changed through time Uses open source tools Allows you to overlay maps Picks up on common themes www.mesh.ed.ac.uk
  • 10. Social Sciences  Greater awareness and acceptance of RDM by community  Methodology is as much a factor in determining difference as discipline  Nature of data often poses challenges for sharing  Lots of reuse of large survey data  Established metadata standards e.g. Data Documentation Initiative (DDI)  Strong international data centre infrastructure
  • 11. Public health Ethics predominant concern – How to negotiate consent – How to store, transfer & handle data securely – How to anonymise and share data Data integration / linking and curation of longitudinal studies is major concern as data added to over decades Need for data havens to help control access to data – role for unis e.g. Grampian Data Safe Haven UK Data Service - http://ukdataservice.ac.uk
  • 12. Twenty-07: Public health study Longitudinal study following 4510 people from West of Scotland over 20 years to investigate the reasons for differences in health Undertook interviews, questionnaires, physical measurements, blood samples etc Strict access controls and guidelines for data collection Data managed within the MRC Social and Public Health Sciences Unit and accessible under a data sharing agreement - http://2007study.sphsu.mrc.ac.uk/Revised-Data-Sharing-Policy-has-been-
  • 13. Life Sciences  Funders arguably more demanding in terms of data sharing policy  Sharing can be problematic / resisted given the nature of the data, fear of misuse or loss of control over IPR  Data sharing agreements and access committees more common  Data integration & mining key drivers  Research is well-resourced so greater capacity to fund local solutions and tools for RDM during projects
  • 14. Genetics Vast quantities of data and rapid growth – DNA sequence data is doubling every 6-8 months Well established public databases for gene sequences e.g. GenBank www.ncbi.nlm.nih.gov/genbank – However even this is on short-term project funding! Need accession number to publish so driver for sharing and established workflow European Data Infrastructure projects too e.g. ELIXIR
  • 15. Neuroscience  Large data volumes due to use of medical imaging  Moving towards larger cohort studies integrating wider range of data types, which strains the balance with ethical requirements around personal data  Costs of data gathering and advances in analysis technology are making field more data intensive - computational methods  Small interdisciplinary teams provide the human infrastructure for RDM, but historically low funder investment in data management at lab level  Disciplinary archives are immature, and has encouraged tendency for labs to treat longitudinal datasets as intellectual capital
  • 16. OMERO – Open Microscopy Environment Monash e-Research Centre helps groups to adopt (and if needed adapt) existing technological solutions Partnered a research group to implement OMERO, a secure central repository to help researchers organise, analyze and share images Resulting tool more sustainable as tailored to specific community need www.dcc.ac.uk/resources/developing-rdm-services/improving-rdm-monash
  • 17. Science & Engineering  Large scale can mean RDM is built in as standard and sharing part of workflow e.g. facilities science  Often early adopters and advocates of new technologies e.g. the Grid, wikis & Arxiv in particle physics  Archiving established in some cases as data can’t be recreated e.g. NERC data centres for Earth Sciences  Commercial sensitivities can place restrictions on sharing in some fields Industry partners
  • 18. Mechanical Engineering Several RDM projects at Bath e.g. ERIM, REDm-MED Concept of repository well established in industrial engineering – Product Lifecycle Management (PLM) systems Preservation issues as data is challenging e.g. CAD files Less information sharing than other disciplines – Commercial sensitivities preclude sharing – Consultancy-style research can lead to internal-only results – Data generated from private systems, so less applicable to others
  • 19. Crystallography X-ray examinations, images and videos of crystal structures, chemical crystallography diffraction images Established metadata standards e.g. Crystallographic Information Framework (CIF) Advocates of open science and use of related tools  UsefulChem - http://usefulchem.wikispaces.com  LabTrove - www.labtrove.org eCrystals Archive and Crystallography Open Database (COD) National Crystallography Service - www.ncs.ac.uk
  • 20. Astronomy Established data standards (e.g. FITS and NOA) maintained by community Access to facilities requires the deposit of raw data, although this can be embargoed International data centres e.g. Sloan Digital Sky Survey - www.sdss.org Large volumes of data so transfer can be difficult Few IPR issues compared to other disciplines Data products are not always shared
  • 21. Galaxy Zoo Citizen Science project started to classify a million galaxies imaged by the Sloan Digital Sky Survey Over 50 million classifications in the first year, contributed by more than 150,000 people Classifications were as good as those from professional astronomers Further projects in astronomy, climatology, biology, humanities… www.galaxyzoo.org
  • 22. Research data typology Commissioned by RLUK Aim: to help librarians improve their ability to engage with researchers on RDM matters; and to enable them to acquire a better understanding of the needs of researchers A resource structured around a suggested typology of research data, looking at different ways in which data might be categorised
  • 23. Broad data types 1. How do researchers generate and process data, and for what purpose? 1.1 Method of creation and collection of research data: where the data comes from 1.2 Readiness of research data: extent to which data has been processed 1.3 Use of research data: researchers' main purpose for accessing and using data 2. In what file formats, media and volumes do researchers generate data? 2.1 Medium and format for research data: objects in which data is captured and recorded, electronic storage and file types 2.2 Electronic data volumes: size of files (this is subjective, and based largely on the perception of researchers 3. How do researchers manage and store their data? 3.1 Storage of research data: where and how data is kept 3.2 Types of metadata: not an exhaustive list, but these are widely-recognised metadata standards 3.3 Metadata standards 3.4 Degree of openness: founded on Royal Society's categorisation of 'intelligent openness' 3.5 Licensing of research data: legal rights appertaining the use of the data
  • 24. An expandable resource A scaffold onto which disciplinary examples can be hung Dynamic resource: community input (from librarians, but maybe others too?), crowdsourcing Turning it into an online interactive tool Refreshing, curating, adapting the resource Basic introduction at http://www.powtoon.com/show/fZDm1s0W6TI/research-data-typology-for-rluk- draft/
  • 25. Conclusions Lots of work still to do! Domains different in all respects: data, methods, key RDM concerns, level of infrastructure and support… Differences exist at sub-discipline level Need to understand the area  Developing and using RLUK’s typology
  • 26. How to plug the gaps? Dozens of different repositories or databases specialising in sub-domains or data types, but still major gaps – Shared services? – Institutional services – specialising rather than generic? – Role of publishers and learned societies? – Funder calls for domain specific infrastructure? – Unis to support ground-up development of tools / services? • How can the sector help domain-specific solutions to mature and thrive?

Editor's Notes

  1. Flexible Image Transport System (FITS)