SlideShare a Scribd company logo
1 of 19
Research data management:
Benefits for the researcher,
Benefits for Society
Kevin Ashley
Digital Curation Centre
www.dcc.ac.uk
@kevingashley
Kevin.ashley@ed.ac.uk
Reusable with attribution: CC-BY The DCC is supported by Jisc
A summary
• Some benefits:
– Citation & impact
– Compliance with funders & regulation
– Improving your research
• What stops us ?
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 2
An alternative summary
Being Selfish
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 3
What’s
possible now
… and still
benefiting others
Being Just Good
Enough
Thanks to:
Neil Chue Hong (@npch), Software Sustainability
Institute
ORCID: 0000-0002-8876-7606
David Flanders (@dfflanders), Dr Steven Manos
(DrStevenManos)
University of Melbourne.
All my colleagues at the DCC
Cameron Neylon (@CameronNeylon)
“the active management and appraisal of
data over the lifecycle of scholarly
and scientific interest”
Data management is part of
good research practice
What is Research Data Management?
Plan
Create
Document
Use
Publish
Share
Slide by Sarah Jones, DCC
Should all data be open?
• NO
• Many reasons – most to do with human
subjects
• But data existence should always be open
• Allows discovery & negotiation on use
• Avoids pointless replication
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 5
http://www.flickr.com/photos/sethw/113073189/
95% of research
results are
never published
Slide: Cameron Neylon2015-06-22
Kevin Ashley – Warsaw data workshop -
CC-BY
6
But if you could publish just data…
• You could gain benefit even from the
experiments that fail – as long you got good
data
• ‘Data papers’ are one way to achieve this
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 7
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 8
Findable, citable data has value
• Important to link publications to data (and vice versa)
• Increases citations – of data & publication
• Increases reuse (hence value)
• But effects exist even without publication, if data is:
– Archived
– Citable
– Discoverable
• All benefit – researcher; institution; publisher
Citability
• Making data available increases citations
• Everyone – academic, funder, institution –
loves citations
• Want evidence?
– Alter, Pienta, Lyle – 240%, social sciences *
– Piwowar, Vision – 9% (microarray data)†
– Henneken, Accomazzi – 20% (astronomy) #
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 9
† Piwowar H, Vision TJ. (2013) Data reuse & the open data citation advantage. PeerJ PrePrints 1:e1v1
http://dx.doi.org/10.7287/peerj.preprints.1v1
* Amy Pienta, George Alter, Jared Lyle, (2010) The Enduring Value of Social Science Research: The Use and Reuse of Primary Research Data.
http://hdl.handle.net/2027.42/78307
# Edwin Henneken, Alberto Accomazzi, (2011) Linking to Data - Effect on Citation Rates in Astronomy. http://arxiv.org/abs/1111.3618
Funders are making demands
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 10
Funder requirements
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 11
Regulatory requirements
• Data protection, freedom of information,
research ethics – all apply to data
• If your data is badly managed:
– Compliance is hard
• Know what you deleted (and why) as well as
what you have
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 12
Because it’s good practice
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 13
“Data management is
essential to
excellence in
research”
Professor Charlotte Clarke, Associate
Dean for Research, School of Health,
Community and Education Studies
Apart from the benefits for
research, good data
management is vital for
many reasons:
accountability, security,
appropriate data-sharing,
re-use protocols and
preservation for example
Prof Julie McLeod, School
of Computing, Engineering
& Information Sciences
www.northumbria.ac.uk/browse/ne/uninews/datamanagement?view=Standard&news=archive
Finally…
• Well-managed data makes your research
easier, now and in future
• Well-managed data is easier to share, more
likely to be re-used
• ISharing data is good for you
• It’s good for all of us
• It isn’t as hard as you think – we’re here to
show you how!
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 14
2015-06-22
Kevin Ashley – Warsaw data workshop -
CC-BY
15
Roles and
Responsibilities
What data to keep
2015-06-22
Kevin Ashley – Warsaw data workshop -
CC-BY
16
2015-06-22
Kevin Ashley – Warsaw data workshop -
CC-BY
17
How to cite data
What data to keep
Acquire research data skills
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 18
Data reuse - messages
2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 19
Often your data tells
stories that your
publications do not
Not all data comes from
other researchers
One person’s noise is
another person’s signal
Discipline-bounded data
discovery doesn’t give us
all we need or want

More Related Content

What's hot

Research Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyResearch Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyRobin Rice
 
Educause 2015 RDM Maturity
Educause 2015 RDM Maturity Educause 2015 RDM Maturity
Educause 2015 RDM Maturity ResearchSpace
 
RDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for Librarians
RDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for LibrariansRDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for Librarians
RDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for LibrariansEDINA, University of Edinburgh
 
RDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for EarthRDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for EarthASIS&T
 
The Rise of the Data Journal
The Rise of the Data JournalThe Rise of the Data Journal
The Rise of the Data JournalMarieke Guy
 
Building Confidence: Training Librarians in Research Data Management
Building Confidence: Training Librarians in Research Data ManagementBuilding Confidence: Training Librarians in Research Data Management
Building Confidence: Training Librarians in Research Data ManagementRobin Rice
 
Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...EDINA, University of Edinburgh
 
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchThe UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchUniversity of California Curation Center
 

What's hot (20)

Research Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyResearch Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional Policy
 
RDM Programme at University of Edinburgh
RDM Programme at University of EdinburghRDM Programme at University of Edinburgh
RDM Programme at University of Edinburgh
 
RDM Programme@Edinburgh
RDM Programme@EdinburghRDM Programme@Edinburgh
RDM Programme@Edinburgh
 
Library Support of Identification and Discovery of Scholarly Output - Cross- ...
Library Support of Identification and Discovery of Scholarly Output - Cross- ...Library Support of Identification and Discovery of Scholarly Output - Cross- ...
Library Support of Identification and Discovery of Scholarly Output - Cross- ...
 
Research Data Management Roadmap@Edinburgh
Research Data Management Roadmap@EdinburghResearch Data Management Roadmap@Edinburgh
Research Data Management Roadmap@Edinburgh
 
Educause 2015 RDM Maturity
Educause 2015 RDM Maturity Educause 2015 RDM Maturity
Educause 2015 RDM Maturity
 
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
 
What's So Special about the Social Sciences
What's So Special about the Social SciencesWhat's So Special about the Social Sciences
What's So Special about the Social Sciences
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
RDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for Librarians
RDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for LibrariansRDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for Librarians
RDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for Librarians
 
RDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for EarthRDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for Earth
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? Part 1: ...
NISO Two Part Webinar:   Is Granularity the Next Discovery Frontier? Part 1: ...NISO Two Part Webinar:   Is Granularity the Next Discovery Frontier? Part 1: ...
NISO Two Part Webinar: Is Granularity the Next Discovery Frontier? Part 1: ...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
The Rise of the Data Journal
The Rise of the Data JournalThe Rise of the Data Journal
The Rise of the Data Journal
 
Building Confidence: Training Librarians in Research Data Management
Building Confidence: Training Librarians in Research Data ManagementBuilding Confidence: Training Librarians in Research Data Management
Building Confidence: Training Librarians in Research Data Management
 
Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...
 
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchThe UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 

Similar to University of Northumbria Research

Kevin Ashley_Sharing research data: benefits for the researcher, benefits for...
Kevin Ashley_Sharing research data: benefits for the researcher, benefits for...Kevin Ashley_Sharing research data: benefits for the researcher, benefits for...
Kevin Ashley_Sharing research data: benefits for the researcher, benefits for...Platforma Otwartej Nauki
 
Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...
Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...
Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...Kevin Ashley
 
My data, your data, our data - increasing data value through reuse (Eurocris2...
My data, your data, our data - increasing data value through reuse (Eurocris2...My data, your data, our data - increasing data value through reuse (Eurocris2...
My data, your data, our data - increasing data value through reuse (Eurocris2...Kevin Ashley
 
Realizing the Potential of Research Data by Carole L. Palmer
Realizing the Potential of Research Data by Carole L. Palmer Realizing the Potential of Research Data by Carole L. Palmer
Realizing the Potential of Research Data by Carole L. Palmer carolelynnpalmer
 
Supporting open research - how to help your researchers - Vitae15
Supporting open research - how to help your researchers - Vitae15Supporting open research - how to help your researchers - Vitae15
Supporting open research - how to help your researchers - Vitae15Kevin Ashley
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data ThingsKatina Toufexis
 
Incentives for modern research
Incentives for modern researchIncentives for modern research
Incentives for modern researchJisc
 
Is democracy the right system? Building an engaged RDM community - Marta Tepe...
Is democracy the right system? Building an engaged RDM community - Marta Tepe...Is democracy the right system? Building an engaged RDM community - Marta Tepe...
Is democracy the right system? Building an engaged RDM community - Marta Tepe...Mari Tinnemans
 
Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6ARDC
 
Use and reuse: research data locally & globally #esipfed
Use and reuse: research data locally & globally #esipfedUse and reuse: research data locally & globally #esipfed
Use and reuse: research data locally & globally #esipfedKevin Ashley
 
Data Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceData Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceJian Qin
 
Magle data curation in libraries
Magle data curation in librariesMagle data curation in libraries
Magle data curation in librariesC. Tobin Magle
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Managementaaroncollie
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ LibraryARDC
 
Final data presentation_clir_july2014
Final data presentation_clir_july2014Final data presentation_clir_july2014
Final data presentation_clir_july2014Patricia Hswe
 
Workshop intro090314
Workshop intro090314Workshop intro090314
Workshop intro090314Philip Bourne
 

Similar to University of Northumbria Research (20)

Kevin Ashley_Sharing research data: benefits for the researcher, benefits for...
Kevin Ashley_Sharing research data: benefits for the researcher, benefits for...Kevin Ashley_Sharing research data: benefits for the researcher, benefits for...
Kevin Ashley_Sharing research data: benefits for the researcher, benefits for...
 
Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...
Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...
Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...
 
My data, your data, our data - increasing data value through reuse (Eurocris2...
My data, your data, our data - increasing data value through reuse (Eurocris2...My data, your data, our data - increasing data value through reuse (Eurocris2...
My data, your data, our data - increasing data value through reuse (Eurocris2...
 
Realizing the Potential of Research Data by Carole L. Palmer
Realizing the Potential of Research Data by Carole L. Palmer Realizing the Potential of Research Data by Carole L. Palmer
Realizing the Potential of Research Data by Carole L. Palmer
 
Supporting open research - how to help your researchers - Vitae15
Supporting open research - how to help your researchers - Vitae15Supporting open research - how to help your researchers - Vitae15
Supporting open research - how to help your researchers - Vitae15
 
Open data for open scholarship - where we are
Open data for open scholarship - where we areOpen data for open scholarship - where we are
Open data for open scholarship - where we are
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data Things
 
Incentives for modern research
Incentives for modern researchIncentives for modern research
Incentives for modern research
 
Is democracy the right system? Building an engaged RDM community - Marta Tepe...
Is democracy the right system? Building an engaged RDM community - Marta Tepe...Is democracy the right system? Building an engaged RDM community - Marta Tepe...
Is democracy the right system? Building an engaged RDM community - Marta Tepe...
 
Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6
 
Use and reuse: research data locally & globally #esipfed
Use and reuse: research data locally & globally #esipfedUse and reuse: research data locally & globally #esipfed
Use and reuse: research data locally & globally #esipfed
 
Data Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceData Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information Science
 
Simon hodson
Simon hodsonSimon hodson
Simon hodson
 
Magle data curation in libraries
Magle data curation in librariesMagle data curation in libraries
Magle data curation in libraries
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ Library
 
Final data presentation_clir_july2014
Final data presentation_clir_july2014Final data presentation_clir_july2014
Final data presentation_clir_july2014
 
Yale Day of Data
Yale Day of Data Yale Day of Data
Yale Day of Data
 
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
 
Workshop intro090314
Workshop intro090314Workshop intro090314
Workshop intro090314
 

More from Kevin Ashley

RISE - the DCC's Research Infrastructure Self-Evaluation Framework
RISE - the DCC's Research Infrastructure Self-Evaluation FrameworkRISE - the DCC's Research Infrastructure Self-Evaluation Framework
RISE - the DCC's Research Infrastructure Self-Evaluation FrameworkKevin Ashley
 
An analysis of open data and open science policies in Europe - a SPARCEurope ...
An analysis of open data and open science policies in Europe - a SPARCEurope ...An analysis of open data and open science policies in Europe - a SPARCEurope ...
An analysis of open data and open science policies in Europe - a SPARCEurope ...Kevin Ashley
 
National Research Data Services in the UK and elsewhere (#confdados)
National Research Data Services in the UK and elsewhere (#confdados)National Research Data Services in the UK and elsewhere (#confdados)
National Research Data Services in the UK and elsewhere (#confdados)Kevin Ashley
 
Research data: burden or treasure? (Talk from #fote13)
Research data: burden or treasure? (Talk from #fote13)Research data: burden or treasure? (Talk from #fote13)
Research data: burden or treasure? (Talk from #fote13)Kevin Ashley
 
Data Quality and Data Curation - a personal view
Data Quality and Data Curation - a personal viewData Quality and Data Curation - a personal view
Data Quality and Data Curation - a personal viewKevin Ashley
 
Data and the webmanager
Data and the webmanagerData and the webmanager
Data and the webmanagerKevin Ashley
 
Research data for repository managers
Research data for repository managers Research data for repository managers
Research data for repository managers Kevin Ashley
 
Research Data Management: the UK national change programme (Nordbib)
Research Data Management: the UK national change programme (Nordbib)Research Data Management: the UK national change programme (Nordbib)
Research Data Management: the UK national change programme (Nordbib)Kevin Ashley
 
Trust: when we need it and how to get it
Trust: when we need it and how to get itTrust: when we need it and how to get it
Trust: when we need it and how to get itKevin Ashley
 
Missing links closing talk - with notes
Missing links closing talk - with notesMissing links closing talk - with notes
Missing links closing talk - with notesKevin Ashley
 
What can the DCC do for you? Sheffield Roadshow
What can the DCC do for you? Sheffield RoadshowWhat can the DCC do for you? Sheffield Roadshow
What can the DCC do for you? Sheffield RoadshowKevin Ashley
 
Audit and outsourcing: their role in creating interoperable repository infras...
Audit and outsourcing: their role in creating interoperable repository infras...Audit and outsourcing: their role in creating interoperable repository infras...
Audit and outsourcing: their role in creating interoperable repository infras...Kevin Ashley
 
JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009Kevin Ashley
 
Digital Curation: gaps and challenges
Digital Curation: gaps and challengesDigital Curation: gaps and challenges
Digital Curation: gaps and challengesKevin Ashley
 
ipres2008: the Digital Preservation Training Programme
ipres2008: the Digital Preservation Training Programmeipres2008: the Digital Preservation Training Programme
ipres2008: the Digital Preservation Training ProgrammeKevin Ashley
 

More from Kevin Ashley (15)

RISE - the DCC's Research Infrastructure Self-Evaluation Framework
RISE - the DCC's Research Infrastructure Self-Evaluation FrameworkRISE - the DCC's Research Infrastructure Self-Evaluation Framework
RISE - the DCC's Research Infrastructure Self-Evaluation Framework
 
An analysis of open data and open science policies in Europe - a SPARCEurope ...
An analysis of open data and open science policies in Europe - a SPARCEurope ...An analysis of open data and open science policies in Europe - a SPARCEurope ...
An analysis of open data and open science policies in Europe - a SPARCEurope ...
 
National Research Data Services in the UK and elsewhere (#confdados)
National Research Data Services in the UK and elsewhere (#confdados)National Research Data Services in the UK and elsewhere (#confdados)
National Research Data Services in the UK and elsewhere (#confdados)
 
Research data: burden or treasure? (Talk from #fote13)
Research data: burden or treasure? (Talk from #fote13)Research data: burden or treasure? (Talk from #fote13)
Research data: burden or treasure? (Talk from #fote13)
 
Data Quality and Data Curation - a personal view
Data Quality and Data Curation - a personal viewData Quality and Data Curation - a personal view
Data Quality and Data Curation - a personal view
 
Data and the webmanager
Data and the webmanagerData and the webmanager
Data and the webmanager
 
Research data for repository managers
Research data for repository managers Research data for repository managers
Research data for repository managers
 
Research Data Management: the UK national change programme (Nordbib)
Research Data Management: the UK national change programme (Nordbib)Research Data Management: the UK national change programme (Nordbib)
Research Data Management: the UK national change programme (Nordbib)
 
Trust: when we need it and how to get it
Trust: when we need it and how to get itTrust: when we need it and how to get it
Trust: when we need it and how to get it
 
Missing links closing talk - with notes
Missing links closing talk - with notesMissing links closing talk - with notes
Missing links closing talk - with notes
 
What can the DCC do for you? Sheffield Roadshow
What can the DCC do for you? Sheffield RoadshowWhat can the DCC do for you? Sheffield Roadshow
What can the DCC do for you? Sheffield Roadshow
 
Audit and outsourcing: their role in creating interoperable repository infras...
Audit and outsourcing: their role in creating interoperable repository infras...Audit and outsourcing: their role in creating interoperable repository infras...
Audit and outsourcing: their role in creating interoperable repository infras...
 
JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009JISC repositories and preservation programme: Plenary presentation 2009
JISC repositories and preservation programme: Plenary presentation 2009
 
Digital Curation: gaps and challenges
Digital Curation: gaps and challengesDigital Curation: gaps and challenges
Digital Curation: gaps and challenges
 
ipres2008: the Digital Preservation Training Programme
ipres2008: the Digital Preservation Training Programmeipres2008: the Digital Preservation Training Programme
ipres2008: the Digital Preservation Training Programme
 

University of Northumbria Research

  • 1. Research data management: Benefits for the researcher, Benefits for Society Kevin Ashley Digital Curation Centre www.dcc.ac.uk @kevingashley Kevin.ashley@ed.ac.uk Reusable with attribution: CC-BY The DCC is supported by Jisc
  • 2. A summary • Some benefits: – Citation & impact – Compliance with funders & regulation – Improving your research • What stops us ? 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 2
  • 3. An alternative summary Being Selfish 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 3 What’s possible now … and still benefiting others Being Just Good Enough Thanks to: Neil Chue Hong (@npch), Software Sustainability Institute ORCID: 0000-0002-8876-7606 David Flanders (@dfflanders), Dr Steven Manos (DrStevenManos) University of Melbourne. All my colleagues at the DCC Cameron Neylon (@CameronNeylon)
  • 4. “the active management and appraisal of data over the lifecycle of scholarly and scientific interest” Data management is part of good research practice What is Research Data Management? Plan Create Document Use Publish Share Slide by Sarah Jones, DCC
  • 5. Should all data be open? • NO • Many reasons – most to do with human subjects • But data existence should always be open • Allows discovery & negotiation on use • Avoids pointless replication 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 5
  • 6. http://www.flickr.com/photos/sethw/113073189/ 95% of research results are never published Slide: Cameron Neylon2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 6
  • 7. But if you could publish just data… • You could gain benefit even from the experiments that fail – as long you got good data • ‘Data papers’ are one way to achieve this 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 7
  • 8. 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 8 Findable, citable data has value • Important to link publications to data (and vice versa) • Increases citations – of data & publication • Increases reuse (hence value) • But effects exist even without publication, if data is: – Archived – Citable – Discoverable • All benefit – researcher; institution; publisher
  • 9. Citability • Making data available increases citations • Everyone – academic, funder, institution – loves citations • Want evidence? – Alter, Pienta, Lyle – 240%, social sciences * – Piwowar, Vision – 9% (microarray data)† – Henneken, Accomazzi – 20% (astronomy) # 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 9 † Piwowar H, Vision TJ. (2013) Data reuse & the open data citation advantage. PeerJ PrePrints 1:e1v1 http://dx.doi.org/10.7287/peerj.preprints.1v1 * Amy Pienta, George Alter, Jared Lyle, (2010) The Enduring Value of Social Science Research: The Use and Reuse of Primary Research Data. http://hdl.handle.net/2027.42/78307 # Edwin Henneken, Alberto Accomazzi, (2011) Linking to Data - Effect on Citation Rates in Astronomy. http://arxiv.org/abs/1111.3618
  • 10. Funders are making demands 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 10
  • 11. Funder requirements 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 11
  • 12. Regulatory requirements • Data protection, freedom of information, research ethics – all apply to data • If your data is badly managed: – Compliance is hard • Know what you deleted (and why) as well as what you have 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 12
  • 13. Because it’s good practice 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 13 “Data management is essential to excellence in research” Professor Charlotte Clarke, Associate Dean for Research, School of Health, Community and Education Studies Apart from the benefits for research, good data management is vital for many reasons: accountability, security, appropriate data-sharing, re-use protocols and preservation for example Prof Julie McLeod, School of Computing, Engineering & Information Sciences www.northumbria.ac.uk/browse/ne/uninews/datamanagement?view=Standard&news=archive
  • 14. Finally… • Well-managed data makes your research easier, now and in future • Well-managed data is easier to share, more likely to be re-used • ISharing data is good for you • It’s good for all of us • It isn’t as hard as you think – we’re here to show you how! 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 14
  • 15. 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 15
  • 16. Roles and Responsibilities What data to keep 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 16
  • 17. 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 17 How to cite data What data to keep
  • 18. Acquire research data skills 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 18
  • 19. Data reuse - messages 2015-06-22 Kevin Ashley – Warsaw data workshop - CC-BY 19 Often your data tells stories that your publications do not Not all data comes from other researchers One person’s noise is another person’s signal Discipline-bounded data discovery doesn’t give us all we need or want

Editor's Notes

  1. The DCC defines data management as the active management and appraisal of data over the lifecycle of scholarly interest. RDM is ‘active’ – digital materials can’t just be left and opened again in 10-20 years time. Lots of things change (hardware, software, operating systems…) so you need to proactively manage data. You also need to select what can and should be kept. Not everything can be retained legally and only some data are valuable to share. Research data management is about all activities in the lifecycle from initially planning (writing DMPs), through creating data and documentation, processing / analysing data and then publishing results and sharing with others.
  2. Medicine does, however, provide some clear reasons why we can’t just stick all research data on the internet for anyone to trawl through. When human subjects are involved there are real concerns about confidentiality. Yet what alltrials.net and other initiatives make clear is that the *existence* of the data should never be hidden. That allows it to be discovered and for negotiations to take place about its use. It avoids costly replication, which can delay scientific discovery and involve human suffering when the replication takes the form of a clinical trial.
  3. Did I mention that making data available increases citations? This is a win all round. If you don’t believe me, here are three studies from three different areas that all show robust, positive correlations. The effect size varies with discipline, but we have enough evidence now that anyone who says that their area is different needs to come up with evidence to show why.
  4. There are many such stories of unexpected data reuse; these are a few examples. The last, exemplified in the Old Weather project, is seeing the original data being reused for at least the third time and in doing so is helping both climatologist and family historians through a single piece of transcription work. An impressive result.