This document summarizes a presentation on open data strategies and research data management best practices. It discusses the importance of open data as part of the broader open science movement. The presenter outlines good practices for research data management, including planning, documentation, storage, and deposition. Benefits of good research data management include increased impact, accessibility, transparency, efficiency and data durability. Risks of poor management include legal issues, financial penalties, lost scientific opportunities and reputational harm. The presentation provides a step-by-step approach to research data management and discusses roles and responsibilities of different stakeholders.
"Open Science, Open Data" training for participants of Software Writing Skills for Your Research - Workshop for Proficient, Helmholtz Centre Potsdam - GFZ German Research Centre for Geosciences, Telegrafenberg, December 16, 2015
Keynote talk to LEARN (LERU/H2020 project) for research data management. Emphasizes that problems are cultural not technical. Promotes modern approaches such as Git / continuousIntegration, announces DAT. Asserts that the Right to Read in the Right to Mine. Calls for widespread development of contentmining (TDM)
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
1st LEARN Workshop. Embedding Research Data as part of the research cycle. 29 Jan 2016. Presentation by Sabina Leonelli, Exeter Centre for the Study of Life Sciences (Egenis) & Department of Sociology, Philosophy and Anthropology, University of Exeter
From Open Data to Open Science, by Geoffrey BoultonLEARN Project
1st LEARN Workshop. Embedding Research Data as part of the research cycle. 29 Jan 2016. Presentation by Geoffrey Boulton, University of Edinburgh & CODATA
"Open Science, Open Data" training for participants of Software Writing Skills for Your Research - Workshop for Proficient, Helmholtz Centre Potsdam - GFZ German Research Centre for Geosciences, Telegrafenberg, December 16, 2015
Keynote talk to LEARN (LERU/H2020 project) for research data management. Emphasizes that problems are cultural not technical. Promotes modern approaches such as Git / continuousIntegration, announces DAT. Asserts that the Right to Read in the Right to Mine. Calls for widespread development of contentmining (TDM)
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
1st LEARN Workshop. Embedding Research Data as part of the research cycle. 29 Jan 2016. Presentation by Sabina Leonelli, Exeter Centre for the Study of Life Sciences (Egenis) & Department of Sociology, Philosophy and Anthropology, University of Exeter
From Open Data to Open Science, by Geoffrey BoultonLEARN Project
1st LEARN Workshop. Embedding Research Data as part of the research cycle. 29 Jan 2016. Presentation by Geoffrey Boulton, University of Edinburgh & CODATA
What is e-research?
Enhancing research practice
e-Research Methods, Strategies, and Issues
Tips For Finding Useful Information
Some Search Tools for doing e-research
Research Design
Quantitative Research
Qualitative Research
Ethics & The e-Researcher
How The Net Complicates Ethics?
Privacy, Confidentiality, Autonomy, And The Respect For Persons
Tips For Ethical e-Research
Collaboration Tools
Why Consensus?
Net-based dissemination of E-research results
Dissemination through peer-reviewed articles
Advantages of a peer-reviewed article
Dissemination through email lists or Usenet groups
Dissemination through a virtual conference
A presentation offering an introduction to managing and sharing research data given at the Czech Open Science days as part of the EC-funded FOSTER project.
OU Library Research Support webinar: Data sharingDaniel Crane
Slides from a webinar delivered on 06th February 2018 for OU research staff and students. Covers data sharing policies; Benefits of data sharing; Data repositories; Preparing data for sharing; and Re-using data.
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds
http://dlab.berkeley.edu/event/open-research-challenge-peer-review-and-publication-research-data
A talk by Dr. Jonathan Tedds, Senior Research Fellow, D2K Data to Knowledge, Dept of Health Sciences, University of Leicester.
PI: #BRISSKit www.brisskit.le.ac.uk
PI: #PREPARDE www.le.ac.uk/projects/preparde
The Peer REview for Publication & Accreditation of Research data in the Earth sciences (PREPARDE) project seeks to capture the processes and procedures required to publish a scientific dataset, ranging from ingestion into a data repository, through to formal publication in a data journal. It will also address key issues arising in the data publication paradigm, namely, how does one peer-review a dataset, what criteria are needed for a repository to be considered objectively trustworthy, and how can datasets and journal publications be effectively cross-linked for the benefit of the wider research community.
I will discuss this and alternative approaches to research data management and publishing through examples in astronomy, biomedical and interdisciplinary research including the arts and humanities. Who can help in the long tail of research if lacking established data centers, archives or adequate institutional support? How much can we transfer from the so called “big data” sciences to other settings and where does the institution fit in with all this? What about software?
Publishing research data brings a wide and differing range of challenges for all involved, whatever the discipline. In PREPARDE we also considered the pre and post publication peer review paradigm, as implemented in the F1000 Research Publishing Model for the life sciences. Finally, in an era of truly international research how might we coordinate the many institutional, regional, national and international initiatives – has the time come for an international Research Data Alliance?
Data management: The new frontier for librariesLEARN Project
Presentation at 3rd LEARN workshop on Research Data Management, “Make research data management policies work”, by Kathleen Shearer, COAR, CARL/ABCR, RDC/DCR, ARL, SSHRC/CSRH.
OPEN DATA. The researcher perspective
Preface
Paul Wouters
Professor of Scientometrics,
Director of CWTS,
Leiden University
Wouter Haak
Vice President,
Research Data Management,
Elsevier
A year ago, in April 2016, Leiden University’s Centre for
Science and Technology Studies (CWTS) and Elsevier
embarked on a project to investigate open data practices
at the workbench in academic research. Knowledge
knows no borders, so to understand open data practices
comprehensively the project has been framed from the
outset as a global study. That said, both the European
Union and the Dutch government have formulated the
transformation of the scientific system into an open
innovation system as a formal policy goal. At the time
we started the project, the Amsterdam Call for Action on
Open Science had just been published under the Dutch
presidency of the Council of the European Union. However,
how are policy initiatives for open science related to the
day-to-day practices of researchers and scholars?
Introduction to research data managementMichael Day
Slides from a presentation given at the JIBS User Group / RLUK joint event "Demystifying research data: don't be scared, be prepared" held at the SOAS Brunei Gallery, London, 17 July 2012.
Presentation given by Sarah Jones and Joy Davidson to a group of South African librarians at a webinar organised by LIASA HELIG. http://www.liasa.org.za/node/977
What is e-research?
Enhancing research practice
e-Research Methods, Strategies, and Issues
Tips For Finding Useful Information
Some Search Tools for doing e-research
Research Design
Quantitative Research
Qualitative Research
Ethics & The e-Researcher
How The Net Complicates Ethics?
Privacy, Confidentiality, Autonomy, And The Respect For Persons
Tips For Ethical e-Research
Collaboration Tools
Why Consensus?
Net-based dissemination of E-research results
Dissemination through peer-reviewed articles
Advantages of a peer-reviewed article
Dissemination through email lists or Usenet groups
Dissemination through a virtual conference
A presentation offering an introduction to managing and sharing research data given at the Czech Open Science days as part of the EC-funded FOSTER project.
OU Library Research Support webinar: Data sharingDaniel Crane
Slides from a webinar delivered on 06th February 2018 for OU research staff and students. Covers data sharing policies; Benefits of data sharing; Data repositories; Preparing data for sharing; and Re-using data.
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds
http://dlab.berkeley.edu/event/open-research-challenge-peer-review-and-publication-research-data
A talk by Dr. Jonathan Tedds, Senior Research Fellow, D2K Data to Knowledge, Dept of Health Sciences, University of Leicester.
PI: #BRISSKit www.brisskit.le.ac.uk
PI: #PREPARDE www.le.ac.uk/projects/preparde
The Peer REview for Publication & Accreditation of Research data in the Earth sciences (PREPARDE) project seeks to capture the processes and procedures required to publish a scientific dataset, ranging from ingestion into a data repository, through to formal publication in a data journal. It will also address key issues arising in the data publication paradigm, namely, how does one peer-review a dataset, what criteria are needed for a repository to be considered objectively trustworthy, and how can datasets and journal publications be effectively cross-linked for the benefit of the wider research community.
I will discuss this and alternative approaches to research data management and publishing through examples in astronomy, biomedical and interdisciplinary research including the arts and humanities. Who can help in the long tail of research if lacking established data centers, archives or adequate institutional support? How much can we transfer from the so called “big data” sciences to other settings and where does the institution fit in with all this? What about software?
Publishing research data brings a wide and differing range of challenges for all involved, whatever the discipline. In PREPARDE we also considered the pre and post publication peer review paradigm, as implemented in the F1000 Research Publishing Model for the life sciences. Finally, in an era of truly international research how might we coordinate the many institutional, regional, national and international initiatives – has the time come for an international Research Data Alliance?
Data management: The new frontier for librariesLEARN Project
Presentation at 3rd LEARN workshop on Research Data Management, “Make research data management policies work”, by Kathleen Shearer, COAR, CARL/ABCR, RDC/DCR, ARL, SSHRC/CSRH.
OPEN DATA. The researcher perspective
Preface
Paul Wouters
Professor of Scientometrics,
Director of CWTS,
Leiden University
Wouter Haak
Vice President,
Research Data Management,
Elsevier
A year ago, in April 2016, Leiden University’s Centre for
Science and Technology Studies (CWTS) and Elsevier
embarked on a project to investigate open data practices
at the workbench in academic research. Knowledge
knows no borders, so to understand open data practices
comprehensively the project has been framed from the
outset as a global study. That said, both the European
Union and the Dutch government have formulated the
transformation of the scientific system into an open
innovation system as a formal policy goal. At the time
we started the project, the Amsterdam Call for Action on
Open Science had just been published under the Dutch
presidency of the Council of the European Union. However,
how are policy initiatives for open science related to the
day-to-day practices of researchers and scholars?
Introduction to research data managementMichael Day
Slides from a presentation given at the JIBS User Group / RLUK joint event "Demystifying research data: don't be scared, be prepared" held at the SOAS Brunei Gallery, London, 17 July 2012.
Presentation given by Sarah Jones and Joy Davidson to a group of South African librarians at a webinar organised by LIASA HELIG. http://www.liasa.org.za/node/977
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)dri_ireland
Presentation given by Martin Donnelly, Senior Institutional Support Officer at the Digital Curation Centre (DCC), as part of the panel session “Digital data sharing: the opportunities and challenges of opening research” at the Digital Humanities conference, Krakow, 15 July 2016. The presentation looks at digital data curation at the DCC.
Immersive informatics - research data management at Pitt iSchool and Carnegie...Keith Webster
A joint presentation by Liz Lyon and Keith Webster on providing education for librarians engaged in research data management. This was delivered at Library Research Seminar VI, at the University of Illinois Urbana Champaign in September 2014. The presentation looks at a class delivered by Lyon at the University of Pittsburgh's iSchool in 2014, and the related needs for immersive training opportunities amongst experienced practicing librarians, using Carnegie Mellon University's library, led by Webster, as a case study.
Research process and research data management. Many universities are looking at how they can better serve the needs of researchers. Ken Chad Consulting worked with the University of Westminster to look the needs and attitudes of researchers and admin staff in terms of research data management (RDM). The result led the University to look first at the whole lifecycle and workflows of research administration. This in turn led to the innovative, rapid development of a system to support researchers and admin staff. Presented by Suzanne Enright (University of Westminster) and Ken Chad at the annual UKSG conference in April 2014
Presentation during the 14th Association of African Universities (AAU) Conference and African Open Science Platform (AOSP)/Research Data Alliance (RDA) Workshop in Accra, Ghana, 7-8 June 2017.
Paper was presented at European Survey Research Association 2013, in the session Research Data Management for Re-use: Bringing Researchers and Archivists closer.
A short, retrospective presentation given as part of the #10yearsDMPonline celebrations in November 2020. I product-managed the first few iterations of this free software tool.
Research data management: a tale of two paradigms: Martin Donnelly
Presentation I was supposed to give at "Scotland’s Collections and the Digital Humanities" workshop in Edinburgh on May 2nd 2014. Illness prevented it, but my heroic DCC colleague Jonathan Rans stepped up and delivered the presentation on my behalf.
'Found' and 'after' - a short history of data reuse in the artsMartin Donnelly
A presentation prepared as emergency backup for RDMF10 (http://www.dcc.ac.uk/events/research-data-management-forum-rdmf/rdmf10-research-data-management-arts-and-humanities), while we were struggling to secure a replacement keynote speaker. It was fun to prepare, though, so here it is, minus the multimedia bits such as the sound files on the 'sampling' slide.
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...Levi Shapiro
Letter from the Congress of the United States regarding Anti-Semitism sent June 3rd to MIT President Sally Kornbluth, MIT Corp Chair, Mark Gorenberg
Dear Dr. Kornbluth and Mr. Gorenberg,
The US House of Representatives is deeply concerned by ongoing and pervasive acts of antisemitic
harassment and intimidation at the Massachusetts Institute of Technology (MIT). Failing to act decisively to ensure a safe learning environment for all students would be a grave dereliction of your responsibilities as President of MIT and Chair of the MIT Corporation.
This Congress will not stand idly by and allow an environment hostile to Jewish students to persist. The House believes that your institution is in violation of Title VI of the Civil Rights Act, and the inability or
unwillingness to rectify this violation through action requires accountability.
Postsecondary education is a unique opportunity for students to learn and have their ideas and beliefs challenged. However, universities receiving hundreds of millions of federal funds annually have denied
students that opportunity and have been hijacked to become venues for the promotion of terrorism, antisemitic harassment and intimidation, unlawful encampments, and in some cases, assaults and riots.
The House of Representatives will not countenance the use of federal funds to indoctrinate students into hateful, antisemitic, anti-American supporters of terrorism. Investigations into campus antisemitism by the Committee on Education and the Workforce and the Committee on Ways and Means have been expanded into a Congress-wide probe across all relevant jurisdictions to address this national crisis. The undersigned Committees will conduct oversight into the use of federal funds at MIT and its learning environment under authorities granted to each Committee.
• The Committee on Education and the Workforce has been investigating your institution since December 7, 2023. The Committee has broad jurisdiction over postsecondary education, including its compliance with Title VI of the Civil Rights Act, campus safety concerns over disruptions to the learning environment, and the awarding of federal student aid under the Higher Education Act.
• The Committee on Oversight and Accountability is investigating the sources of funding and other support flowing to groups espousing pro-Hamas propaganda and engaged in antisemitic harassment and intimidation of students. The Committee on Oversight and Accountability is the principal oversight committee of the US House of Representatives and has broad authority to investigate “any matter” at “any time” under House Rule X.
• The Committee on Ways and Means has been investigating several universities since November 15, 2023, when the Committee held a hearing entitled From Ivory Towers to Dark Corners: Investigating the Nexus Between Antisemitism, Tax-Exempt Universities, and Terror Financing. The Committee followed the hearing with letters to those institutions on January 10, 202
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
Embracing GenAI - A Strategic ImperativePeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
Acetabularia Information For Class 9 .docxvaibhavrinwa19
Acetabularia acetabulum is a single-celled green alga that in its vegetative state is morphologically differentiated into a basal rhizoid and an axially elongated stalk, which bears whorls of branching hairs. The single diploid nucleus resides in the rhizoid.
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
Open Data - strategies for research data management & impact of best practices
1. Facilitate Open Science Training for European Research
Open Data: Strategies for Research Data Management,
and impact of best practices?
Martin Donnelly
Digital Curation Centre
University of Edinburgh
NCP Academy Webinar
16 June 2017
2. Overview
1. Background
2. Context: Open Access + Open Data (+ Open Source) =
Open Science (or Open Research)
3. What is good RDM practice?
4. What are the benefits of good RDM?
5. What are the risks of poor RDM?
6. A step by step approach
7. Do’s and don’ts / Rules of thumb
8. About the FOSTER project
9. About the DCC / contact details
3. Overview
1. Background
2. Context: Open Access + Open Data (+ Open Source) =
Open Science (or Open Research)
3. What is good RDM practice?
4. What are the benefits of good RDM?
5. What are the risks of poor RDM?
6. A step by step approach
7. Do’s and don’ts / Rules of thumb
8. About the FOSTER project
9. About the DCC / contact details
4. Background (me)
• Academic background in cultural heritage computing…
• Which led me to work in digital preservation…
• Which led to my current involvement in research data
management and the broader topic of Open Science
• I’ve been involved to various degrees in the development
of early DMP resources (DCC Checklist, DMPonline,
DMPTool, book chapter on DMP…)
• Member of the original FOSTER consortium
• Also involved in consultancy, advocacy, events, training
etc, e.g. as external expert reviewer of Horizon 2020
DMPs
5. Overview
1. Background
2. Context: Open Access + Open Data (+ Open Source)
= Open Science (or Open Research)
3. What is good RDM practice?
4. What are the benefits of good RDM?
5. What are the risks of poor RDM?
6. A step by step approach
7. Do’s and don’ts / Rules of thumb
8. About the FOSTER project
9. About the DCC / contact details
6. Open Access + Open Data = Open Science
• Openness in research is situated within a context of ever
greater transparency, accessibility and accountability
• As Open Access to publications became normal (if not yet
ubiquitous), the scholarly community turned its attention to the
data which underpins the research outputs, and eventually to
consider it a first-class output in its own right. The development
of the OA and research data management (RDM) agendas are
closely linked as part of a broader trend in research, sometimes
termed ‘Open Science’ or ‘Open Research’
• “The European Commission is now moving beyond open access towards
the more inclusive area of open science. Elements of open science will
gradually feed into the shaping of a policy for Responsible Research and
Innovation and will contribute to the realisation of the European
Research Area and the Innovation Union, the two main flagship
initiatives for research and innovation”
http://ec.europa.eu/research/swafs/index.cfm?pg=policy&lib=science
7. Overview
1. Background
2. Context: Open Access + Open Data (+ Open Source) =
Open Science (or Open Research)
3. What is good RDM practice?
4. What are the benefits of good RDM?
5. What are the risks of poor RDM?
6. A step by step approach
7. Do’s and don’ts / Rules of thumb
8. About the FOSTER project
9. About the DCC / contact details
8. Good practice in RDM
RDM is “the active
management and appraisal
of data over the lifecycle of
scholarly and scientific
interest”
What sorts of activities?
- Planning and describing data-
related work before it takes
place
- Documenting your data so that
others can find and understand
it
- Storing it safely during the
project
- Depositing it in a trusted
archive at the end of the
project
- Linking publications to the
datasets that underpin them
9. Overview
1. Background
2. Context: Open Access + Open Data (+ Open Source) =
Open Science (or Open Research)
3. What is good RDM practice?
4. What are the benefits of good RDM?
5. What are the risks of poor RDM?
6. A step by step approach
7. Do’s and don’ts / Rules of thumb
8. About the FOSTER project
9. About the DCC / contact details
10. Benefits
• IMPACT and LONGEVITY: Open data (and publications) receive
more citations, over longer periods
• SPEED: The research process becomes faster
• ACCESSIBILITY: Interested third parties can (where
appropriate) access and build upon publicly-funded research
outputs with minimal barriers to access
• EFFICIENCY: Data collection can be funded once, and used
many times for a variety of purposes
• TRANSPARENCY and QUALITY: The evidence that underpins
research can be made open for anyone to scrutinise, and
attempt to replicate findings. This leads to a more robust
scholarly record, and reduces academic fraud for example
• DURABILITY: simply put, fewer important datasets will be lost
11. “In genomics research, a large-scale
analysis of data sharing shows that
studies that made data available in
repositories received 9% more
citations, when controlling for other
variables; and that whilst self-reuse
citation declines steeply after two
years, reuse by third parties
increases even after six years.”
(Piwowar and Vision, 2013)
Van den Eynden, V. and Bishop, L.
(2014). Incentives and motivations for
sharing research data, a researcher’s
perspective. A Knowledge Exchange
Report,
http://repository.jisc.ac.uk/5662/1/KE
_report-incentives-for-sharing-
researchdata.pdf
Benefits: Impact and Longevity
12. “Data is necessary for
reproducibility of
computational research”
Victoria Stodden, “Innovation and Growth
through Open Access to Scientific Research:
Three Ideas for High-Impact Rule Changes” in
Litan, Robert E. et al. Rules for Growth:
Promoting Innovation and Growth Through Legal
Reform. SSRN Scholarly Paper. Rochester, NY:
Social Science Research Network, February 8,
2011. http://papers.ssrn.com/abstract=1757982.
Benefits: Quality
13. Baker, M. (2016)
“1,500 scientists
lift the lid on
reproducibility”,
Nature,
533:7604,
http://www.nat
ure.com/news/1
-500-scientists-
lift-the-lid-on-
reproducibility-
1.19970
14. “Conservatively, we estimate that the value of data in
Australia’s public research to be at least $1.9 billion
and possibly up to $6 billion a year at current levels of
expenditure and activity. Research data curation and
sharing might be worth at least $1.8 billion and possibly
up to $5.5 billion a year, of which perhaps $1.4 billion to
$4.9 billion annually is yet to be realized.”
• “Open Research Data”, Report to the Australian National Data Service (ANDS),
November 2014 - John Houghton, Victoria Institute of Strategic Economic
Studies & Nicholas Gruen, Lateral Economics
Benefits: Financial
15. J. Manyika et al. "Open data: Unlocking innovation
and performance with liquid information" McKinsey
Global Institute, October 2013
16. “If we are going to wait
five years for data to
be released, the Arctic
is going to be a very
different place.”
Bryn Nelson, Nature, 10 Sept 2009
http://www.nature.com/nature/jour
nal/v461/n7261/index.html
Benefits: Speed
https://www.flickr.com/photos/gsfc/7348953774/
- CC-BY
17. Benefits: Durability
Vines et al. “examined the availability of data from 516 studies between 2 and 22 years
old”
- The odds of a data set being reported as extant fell by 17% per year
- Broken e-mails and obsolete storage devices were the main obstacles to data sharing
- Policies mandating data archiving at publication are clearly needed
“The current system of leaving data with authors means that almost all of it is lost
over time, unavailable for validation of the original results or to use for entirely new
purposes” according to Timothy Vines, one of the researchers. This underscores the
need for intentional management of data from all disciplines and opened our
conversation on potential roles for librarians in this arena. (“80 Percent of Scientific
Data Gone in 20 Years” HNGN, Dec. 20, 2013,
http://www.hngn.com/articles/20083/20131220/80-percent-of-scientific-data-gone-in-
20-years.htm.)
Vines et al., The Availability of Research Data Declines Rapidly with Article Age,
Current Biology (2014), http://dx.doi.org/10.1016/j.cub.2013.11.014
18. Overview
1. Background
2. Context: Open Access + Open Data (+ Open Source) =
Open Science (or Open Research)
3. What is good RDM practice?
4. What are the benefits of good RDM?
5. What are the risks of poor RDM?
6. A step by step approach
7. Do’s and don’ts / Rules of thumb
8. About the FOSTER project
9. About the DCC / contact details
19. Risks of getting this wrong
• Legal – sensitive data is protected by law (and contracts)
and needs to be protected
• Financial – non-compliance with funder policies can lead
to reduced access to income streams
• Scientific – potential discoveries may be hidden away in
drawers, on USB
• Opportunity cost – reduced visibility for research > lost
opportunities for collaboration
• Quality – the scholarly record becomes less robust
• Reputational – responsible data management is
increasingly considered a core element of good scholarly
practice in the 21st century
20. Growing momentum and ubiquity…
Data management
is a part of good
research practice.
- RCUK Policy and Code of
Conduct on the
Governance of Good
Research Conduct
21. Overview
1. Background
2. Context: Open Access + Open Data (+ Open Source) =
Open Science (or Open Research)
3. What is good RDM practice?
4. What are the benefits of good RDM?
5. What are the risks of poor RDM?
6. A step by step approach
7. Do’s and don’ts / Rules of thumb
8. About the FOSTER project
9. About the DCC / contact details
22. Step 1. Be clear about who is involved
• RDM is a hybrid activity, involving multiple stakeholder groups…
• The researchers themselves
• Research support personnel
• Partners based in other institutions, funders, data centres, commercial
partners, etc
• No single person does everything, and it makes no sense to duplicate
effort or reinvent wheels
• Data Management Planning (DMP) underpins and pulls together
different strands of data management activities. DMP is the process
of planning, describing and communicating the activities carried
out during the research lifecycle in order to…
• Keep sensitive data safe
• Maximise data’s re-use potential
• Support longer-term preservation
• Data Management Plans are a means of communication, with
contemporaries and future re-users alike
23. Step 2. Write things down
• In a data management plan / record
• In metadata to describe the data and help others to
understand it
• In workflows and README files
• In version management
• In justifying decisions re. access, embargo, selection
and appraisal… the list can be very long…
Communication is crucial!
24. Step 3. Don’t try to do everything yourself
• See Step 1 ;)
25. RDM / Open Data in practice: key points
1. Understand your funder’s policies (and perhaps national policy
initiatives – see recent SPARC-Europe reports)
2. Create a data management plan (e.g. with DMPonline)
3. Decide which data to preserve (e.g. using the DCC How-To
guide and checklist, “Five Steps to Decide what Data to Keep”)
4. Identify a long-term home for your data (e.g. via re3data.org)
5. Link your data to your publications with a persistent identifier
(e.g. via DataCite)
• N.B. Many archives, including Zenodo, will do this for you
6. Investigate EU infrastructure services and resources
26. Overview
1. Background
2. Context: Open Access + Open Data (+ Open Source) =
Open Science (or Open Research)
3. What is good RDM practice?
4. What are the benefits of good RDM?
5. What are the risks of poor RDM?
6. A step by step approach
7. Do’s and don’ts / Rules of thumb
8. About the FOSTER project
9. About the DCC / contact details
27. A few do’s and don’ts for RDM
DO DON’T
Have a plan for your data Make it up as you go along
Keep backups. Make this easy with automated
syncing services like Dropbox, provided your
data isn’t too sensitive
Carry the only copy around on a memory
card, your laptop, your phone, etc
Describe your data as you collect it. This
makes it possible for others to interpret it,
and for you to do the same a few years down
the line
Leave this till the end. The quality of
metadata decreases with time, and the
best metadata is created at the moment of
data capture
Save your work in open file formats, where
possible, and use accepted metadata
standards to enable like-with-like comparison
Invent new ‘standards’ where community
norms already exist
Deposit your data in a data centre or
repository, and link it to your publications
Be afraid to ask for help. This will exist
both within your institution, and via
national / European support organisations
28. Rules of thumb
• Without intervention, data + time = no data
• See Vines, above
• Prioritise: could anyone die or go to jail?
• Legal issues (e.g. protecting vulnerable subjects) are the most
important
• Storage is not the same as management
• Think of data as plants and the servers as a greenhouse
• The plants still need to be fed, watered, pruned, etc… and
sometimes disposed of
• Management is not the same as sharing
• Not all data should be shared
• Approach: “As open as possible, as closed as necessary”
29. Overview
1. Background
2. Context: Open Access + Open Data (+ Open Source) =
Open Science (or Open Research)
3. What is good RDM practice?
4. What are the benefits of good RDM?
5. What are the risks of poor RDM?
6. A step by step approach
7. Do’s and don’ts / Rules of thumb
8. About the FOSTER project
9. About the DCC / contact details
30. • Phase 1 (2014-2016): Spread
the Seeds of Open Science and
Open Access
• Creation of Open Science
Taxonomy
• 2000+ training materials,
categorized in the FOSTER
Portal
• More than 100 f2f training
events in 28 countries and 25
online courses, totalling more
than 6300 participants
FacilitateOpenScienceTrainingforEuropeanResearch
The project
http://fosteropenscience.eu
31. • Phase 2 (2017-2019): Let the Flowers of Open Science Bloom
• Focus on:
• Training for the practical implementation of Open Science (face to face
and online) including RDM and Open Data
• Developing intermediate/advanced level/discipline-specific training
resources in collaboration with three disciplinary communities (and
related RIs): Life Sciences (ELIXIR), Social Sciences (CESSDA) and
Humanities (DARIAH)
• Update the FOSTER Portal to support moderated learning, badges and
gamification
• In concrete terms:
• 150 new training resources
• Over 50 training events (outcome-oriented, providing participants with
tangible skills) and 20 e-learning courses
• Multi-module Open Science Toolkit
• Trainers Network, Open Science Bootcamp, Open Science Training
Handbook, and more…
FacilitateOpenScienceTrainingforEuropeanResearch
The project
http://fosteropenscience.eu
32. Overview
1. Background
2. Context: Open Access + Open Data (+ Open Source) =
Open Science (or Open Research)
3. What is good RDM practice?
4. What are the benefits of good RDM?
5. What are the risks of poor RDM?
6. A step by step approach
7. Do’s and don’ts / Rules of thumb
8. About the FOSTER project
9. About the DCC / contact details
33. The Digital Curation Centre (DCC)
• UK national centre of expertise in digital preservation
and data management, est. 2004
• Principal audience is the UK higher education sector, but
we increasingly work further afield (continental Europe,
North America, South Africa, Asia…)
• Provide guidance, training, tools (e.g. DMPonline) and
other services on all aspects of research data
management and Open Science
• Tailored consultancy/training
• Organise national and international events and webinars
(International Digital Curation Conference, Research
Data Management Forum)
34. Contact details
• For more information about the
FOSTER project:
• Website: www.fosteropenscience.eu
• Principal investigator: Eloy Rodrigues
(eloy@sdum.uminho.pt)
• General enquiries: Gwen Franck
(gwen.franck@eifl.net)
• Twitter: @fosterscience
• My contact details:
• Email: martin.donnelly@ed.ac.uk
• Twitter: @mkdDCC
• Slideshare:
http://www.slideshare.net/martindo
nnelly
This work is licensed under the
Creative Commons Attribution
2.5 UK: Scotland License.