The University of Edinburgh has taken several steps to improve research data management:
1. They developed the first research data management policy in the UK to provide guidelines for storing, sharing, and preserving research data.
2. They created online training and guidance materials called MANTRA to teach researchers best practices for data organization, documentation, and long-term access.
3. They are developing research data services including a data library, repository, and storage strategies to support researchers in managing their data throughout the research lifecycle and ensuring access over time.
Presented by Peter Burnhill, Director of EDINA, Beyond Books: What STM & Social Science publishing should learn from each other, London. Conference programme. 22 April 2010.
Presented by Peter Burnhill, Director of EDINA, at PARSE.insight workshop on Preservation, Access and Re-use of Scientific Data, Darmstadt, Germany, 22 September 2009.
The Go-Geo! Spatial Data Portal provides a discovery and research tool for UK academics to find geospatial resources. It includes over 2,820 searchable geospatial datasets and metadata records. Go-Geo! also provides geospatial metadata best practices and guidelines, metadata editing and publishing tools, workshops and training to support the use and sharing of spatial data across UK academia.
presented by Stuart Macdonald at the College of Science and Engineering - "What's new for you in the Library“, Murray Library, Kings Buildings, University of Edinburgh. 28 May 2014
Covers research data, research data management, funder policies and the University's RDM policy, RDM services and support, awareness raising, training, progress so far.
The University of Edinburgh has taken several steps to improve research data management:
1. They developed the first research data management policy in the UK to provide guidelines for storing, sharing, and preserving research data.
2. They created online training and guidance materials called MANTRA to teach researchers best practices for data organization, documentation, and long-term access.
3. They are developing research data services including a data library, repository, and storage strategies to support researchers in managing their data throughout the research lifecycle and ensuring access over time.
Presented by Peter Burnhill, Director of EDINA, Beyond Books: What STM & Social Science publishing should learn from each other, London. Conference programme. 22 April 2010.
Presented by Peter Burnhill, Director of EDINA, at PARSE.insight workshop on Preservation, Access and Re-use of Scientific Data, Darmstadt, Germany, 22 September 2009.
The Go-Geo! Spatial Data Portal provides a discovery and research tool for UK academics to find geospatial resources. It includes over 2,820 searchable geospatial datasets and metadata records. Go-Geo! also provides geospatial metadata best practices and guidelines, metadata editing and publishing tools, workshops and training to support the use and sharing of spatial data across UK academia.
presented by Stuart Macdonald at the College of Science and Engineering - "What's new for you in the Library“, Murray Library, Kings Buildings, University of Edinburgh. 28 May 2014
Covers research data, research data management, funder policies and the University's RDM policy, RDM services and support, awareness raising, training, progress so far.
Management of research data specifically for Engineering and Physical Science. Delivered by Stuart Macdonald at the "Support for Enhancing Research Impact" meeting at the University of Edinburgh on 22 June 2016.
The Research Data MANTRA (MANagementTRAining) project at the University of Edinburgh created open online learning materials for research data management. The materials were developed for postgraduate students and early career researchers, grounded in best practices for specific disciplines like social science and geosciences. The course includes video interviews, data exercises, and will be embedded in university graduate programs and available openly online. Key to the project's success will be positive user feedback and increased advocacy for research data management practices across the university. The university also approved a new research data policy to provide guidelines and support for proper data management.
The University of Edinburgh implemented a research data management policy and programme to provide services and support for researchers. Key services include DataStore for active data storage, DataShare for publishing data, and DataVault for long-term preservation. Training, guidance on data management planning, and support staff help researchers comply with funder requirements and best practices. The multi-phase programme establishes critical services while pursuing interoperability and engaging the research community.
Presentation given by Chris Higgens at the Annual Infrastructure for Spatial Information in European (INSPIRE) Conference Krakow, Poland. 22 June 2010.
Research Data Management: Approaches to Institutional PolicyRobin Rice
This document summarizes research data management policies from several universities. It discusses the purpose statements, tones, roles and responsibilities outlined in the policies of universities in the UK, Australia, and US. The University of Edinburgh policy takes a partnership approach, sharing responsibilities between the university and researchers. It aims to support research excellence through managing data to high standards across the research lifecycle.
This document discusses the importance of preserving both research data and literature for future use. It quotes two scientists emphasizing the value of original data. It then makes three recommendations: 1) Include research literature as part of the record of science; 2) Make data available for future unknown uses; 3) Regard assured access to digital content as a grand challenge. Several organizations are working to archive e-journals and digital content to ensure long-term preservation and access.
The document summarizes a presentation about Jorum, a UK national repository for sharing learning and teaching materials. It discusses how Jorum has been reshaped to support open educational resources and open courseware by adopting the DSpace platform and adding features like Creative Commons licensing, content packaging support, and metadata registration via RSS feeds. The goal is for Jorum to be a place where open content from UK higher education projects can be stored, managed, and made available worldwide.
The document summarizes a pilot project at the University of Edinburgh to support the development of a UK Research Data Discovery Service. PhD interns engaged with researchers from various schools to describe and deposit research datasets in the university's systems to be harvested by the discovery service. Observations found mixed results across schools, with humanities researchers less comfortable sharing data due to copyright and reluctance to share interpretations. Other schools had established data repositories causing less interest in the university's system. Building research data management practices will require tailored approaches and more training over time.
The University of Edinburgh approved a research data management policy in May 2011 to address growing pressures around research data. The Vice-Principal, Jeff Haywood, championed the development of the first research data management policy in the UK. The policy aimed to comply with funder requirements for open access to research data and address reputational risks around responding to Freedom of Information requests. In developing the policy, the university sought broad discussion, identified champions at various levels, and addressed gaps in research data services to support retention and access to data underlying published research.
Addy Pope demonstrates how a suite of EDINA and Edinburgh University Data Library tools and apps can make curating your spatial data a breeze. Presented at the Open Repositories 2014, June 9-13, Helsinki, Finland http://or2014.helsinki.fi
The document summarizes discussions from a meeting about ensuring long-term access to scholarly works in electronic formats. It describes the governance and activities of the UK LOCKSS Alliance, including comparison of different e-journal archiving initiatives, the PECAN project to build an entitlement registry, and recommendations from a white paper on e-journal archiving. It also discusses the newly formed JARVIG committee tasked with determining the most effective national e-journal archiving infrastructure for UK higher education.
Presentation given by Peter Burnhill, director of EDINA, at #ReCon_15 : Beyond the paper: publishing data, software and more. Edinburgh, 19 June 2015
Peter Burnhill
http://reconevent.com/
This document summarizes outcomes from an interoperability experiment on using Shibboleth for access management of geospatial web services. It discusses how Shibboleth can allow single sign-on access to protected spatial data resources across administrative domains. The academic sector helped test and demonstrate modified open source clients that work with Shibboleth-secured services to improve access for research and education. Going forward, expanding the use of Shibboleth and strengthening connections between access management federations could maximize the benefits.
EDINA is a national data center based at the University of Edinburgh that provides open access to scholarly resources and supports the UK education sector. It runs several geospatial services including GeoTagger for geotagging images, Cartogrammar for creating cartograms, GoGeo for discovering GIS resources and metadata, Unlock for georeferencing text, and ShareGeo and Openstream for mapping and sharing open geospatial data. EDINA also promotes open events and groups for geospatial topics.
1. The document discusses the development of JISC Mediahub, a new platform that will aggregate and provide access to various digital media collections from different sources.
2. JISC Mediahub will allow users to search across collections in one place and stream or download content depending on licensing.
3. An initial preview version of JISC Mediahub has been launched, allowing users to search metadata and view some content. The full service is planned for launch in August.
Management of research data specifically for Engineering and Physical Science. Delivered by Stuart Macdonald at the "Support for Enhancing Research Impact" meeting at the University of Edinburgh on 22 June 2016.
The Research Data MANTRA (MANagementTRAining) project at the University of Edinburgh created open online learning materials for research data management. The materials were developed for postgraduate students and early career researchers, grounded in best practices for specific disciplines like social science and geosciences. The course includes video interviews, data exercises, and will be embedded in university graduate programs and available openly online. Key to the project's success will be positive user feedback and increased advocacy for research data management practices across the university. The university also approved a new research data policy to provide guidelines and support for proper data management.
The University of Edinburgh implemented a research data management policy and programme to provide services and support for researchers. Key services include DataStore for active data storage, DataShare for publishing data, and DataVault for long-term preservation. Training, guidance on data management planning, and support staff help researchers comply with funder requirements and best practices. The multi-phase programme establishes critical services while pursuing interoperability and engaging the research community.
Presentation given by Chris Higgens at the Annual Infrastructure for Spatial Information in European (INSPIRE) Conference Krakow, Poland. 22 June 2010.
Research Data Management: Approaches to Institutional PolicyRobin Rice
This document summarizes research data management policies from several universities. It discusses the purpose statements, tones, roles and responsibilities outlined in the policies of universities in the UK, Australia, and US. The University of Edinburgh policy takes a partnership approach, sharing responsibilities between the university and researchers. It aims to support research excellence through managing data to high standards across the research lifecycle.
This document discusses the importance of preserving both research data and literature for future use. It quotes two scientists emphasizing the value of original data. It then makes three recommendations: 1) Include research literature as part of the record of science; 2) Make data available for future unknown uses; 3) Regard assured access to digital content as a grand challenge. Several organizations are working to archive e-journals and digital content to ensure long-term preservation and access.
The document summarizes a presentation about Jorum, a UK national repository for sharing learning and teaching materials. It discusses how Jorum has been reshaped to support open educational resources and open courseware by adopting the DSpace platform and adding features like Creative Commons licensing, content packaging support, and metadata registration via RSS feeds. The goal is for Jorum to be a place where open content from UK higher education projects can be stored, managed, and made available worldwide.
The document summarizes a pilot project at the University of Edinburgh to support the development of a UK Research Data Discovery Service. PhD interns engaged with researchers from various schools to describe and deposit research datasets in the university's systems to be harvested by the discovery service. Observations found mixed results across schools, with humanities researchers less comfortable sharing data due to copyright and reluctance to share interpretations. Other schools had established data repositories causing less interest in the university's system. Building research data management practices will require tailored approaches and more training over time.
The University of Edinburgh approved a research data management policy in May 2011 to address growing pressures around research data. The Vice-Principal, Jeff Haywood, championed the development of the first research data management policy in the UK. The policy aimed to comply with funder requirements for open access to research data and address reputational risks around responding to Freedom of Information requests. In developing the policy, the university sought broad discussion, identified champions at various levels, and addressed gaps in research data services to support retention and access to data underlying published research.
Addy Pope demonstrates how a suite of EDINA and Edinburgh University Data Library tools and apps can make curating your spatial data a breeze. Presented at the Open Repositories 2014, June 9-13, Helsinki, Finland http://or2014.helsinki.fi
The document summarizes discussions from a meeting about ensuring long-term access to scholarly works in electronic formats. It describes the governance and activities of the UK LOCKSS Alliance, including comparison of different e-journal archiving initiatives, the PECAN project to build an entitlement registry, and recommendations from a white paper on e-journal archiving. It also discusses the newly formed JARVIG committee tasked with determining the most effective national e-journal archiving infrastructure for UK higher education.
Presentation given by Peter Burnhill, director of EDINA, at #ReCon_15 : Beyond the paper: publishing data, software and more. Edinburgh, 19 June 2015
Peter Burnhill
http://reconevent.com/
This document summarizes outcomes from an interoperability experiment on using Shibboleth for access management of geospatial web services. It discusses how Shibboleth can allow single sign-on access to protected spatial data resources across administrative domains. The academic sector helped test and demonstrate modified open source clients that work with Shibboleth-secured services to improve access for research and education. Going forward, expanding the use of Shibboleth and strengthening connections between access management federations could maximize the benefits.
EDINA is a national data center based at the University of Edinburgh that provides open access to scholarly resources and supports the UK education sector. It runs several geospatial services including GeoTagger for geotagging images, Cartogrammar for creating cartograms, GoGeo for discovering GIS resources and metadata, Unlock for georeferencing text, and ShareGeo and Openstream for mapping and sharing open geospatial data. EDINA also promotes open events and groups for geospatial topics.
1. The document discusses the development of JISC Mediahub, a new platform that will aggregate and provide access to various digital media collections from different sources.
2. JISC Mediahub will allow users to search across collections in one place and stream or download content depending on licensing.
3. An initial preview version of JISC Mediahub has been launched, allowing users to search metadata and view some content. The full service is planned for launch in August.
A webinar delivered by EDINA on 7 November 2012. How to view, customise, print and download Ordnance Survey maps from Roam, a service in Digimap's OS Collection.
Slides used in a Digimap webinar in February 2013. Covers available map data in the Digimap Collections (subscription services for UK further and higher edcation) and its use in ArcGIS. Information on data formats, data conversion tools and data styling.
This webinar covered tools from the UK Data Service Census Support for working with UK census data, boundaries, and postcodes. It demonstrated how to use the Boundary Data Selector to download census boundaries, the Thematic Mapper to create choropleth maps from census data, and the Postcode Data Selector to extract postcode data and add lookups to other geographies. The webinar provided an overview of the UK census and types of data available, and explained how these online tools can be used to access and visualize UK census and geographic data.
This document discusses the preservation of e-journal content by archiving organizations called "The Keepers." It provides examples of organizations that serve as keepers, such as the National Science Library of the Chinese Academy of Sciences. It also mentions The Keepers Registry, which allows users to search for e-journal content preserved by keepers based on title, ISSN, or publisher. The document suggests that users can search The Keepers Registry to discover which volumes of the journal Folklore have been preserved.
The document discusses managing research data and digital repositories in difficult economic times. It provides an overview of policies, strategies, technologies and infrastructure used to manage research and teaching materials. It also discusses funding from JISC and other organizations for repository services and projects in the UK.
1) The document discusses roles and responsibilities in ensuring permanent access to scholarly works.
2) It notes that while access to works has improved online, continuity of access is challenged as content can disappear from the web.
3) The document reports on measured progress in archiving journal content through organizations like CLOCKSS and Portico, but notes that only 19% of identified online journals are currently being preserved.
1) The document outlines the PECAN Phase 2 project which developed a prototype entitlement registry to match up title information with institutional subscriptions and post-cancellation entitlement.
2) Key components of the prototype included designing an entitlement registry demonstrator to ingest and display data, assessing methods for automating data ingestion and maintaining record accuracy over time.
3) Challenges identified included the dynamic nature of deals and titles, defining packages, and developing standard data formats and workflows for publisher data supply to minimize manual intervention.
The document discusses the UK LOCKSS Alliance program which aims to help libraries preserve electronic journals and books through local archiving. It describes how the collaborative organization coordinates support for members to build and maintain their own archives of web published content. The UK LOCKSS Alliance works to ensure long-term sustainable access to scholarly works.
Presented by Adam Rusbridge at e-Journals are forever? Preservation and Continuing Access to e-journal Content. A DPC, EDINA and JISC joint initiative, British Library, London, 26 April 2010.
The document reports on the progress of the IASSIST Latin Engagement Strategic Action Group. It summarizes the group's findings from surveying data professionals in Spain. It found that while data library roles are not prominent, interest in research data management is growing. The document recommends that IASSIST provide multilingual resources, training events in Spain, and opportunities for Latin American members to attend conferences to further engage Latin members.
The document discusses the evolution of digital library services at EDINA from the 1990s to present day. It covers:
1. Early services like SALSER, a union catalog of serials in Scotland, and knowledge gained from projects like JOIN-UP on distributed architectures.
2. Key projects and services over time including SUNCAT, the Keepers Registry for e-journal preservation, and work on entitlement registries.
3. The central role of identifiers like ISSN in enhancing records and enabling services across these systems.
4. A vision for further integrating print and digital content and moving to semantic web approaches by 2020.
The document proposes the AGILE Data Access Initiative to address issues researchers face in accessing core geospatial data from National Mapping and Cadastral Agencies across Europe. Surveys of NMCAs and academic users found that while most NMCAs make data available, barriers include cost, licensing restrictions, and difficulties obtaining cross-border data. The initiative seeks to negotiate national agreements for academic access and develop reciprocal licenses and access controls to enable easier transnational research.
Stuart Macdonald reviews what researchers need to do to comply with the new EPSRC framework concerning the management and provision of access to publicly-funded research data. Presented at the Mobility, Mood and Place Research Committee Meeting workshop at the Edinburgh College of Art, 16 June, 2015.
This document provides an overview of making research data open and preparing it for sharing. It discusses why data should be shared, including benefits like innovation, transparency and increased citations. It covers funder and publisher policies requiring data sharing. Key points on preparing data for sharing include adding metadata and documentation, using open file formats, and considering intellectual property rights and licensing. The document also discusses ethical issues around informing participants and seeking consent, as well as new GDPR requirements.
This document provides an overview of research data sharing, including why data should be shared, how to prepare data for sharing, considerations around rights and ethics, and reusing shared data. The key points covered are the benefits of sharing data, funder and publisher policies requiring data plans and sharing, preparing data by adding documentation and using open formats, obtaining informed consent, and where to find shared data for reuse.
Presentation given by Sarah Jones at a seminar run by LSHTM on 6th November 2012. http://www.lshtm.ac.uk/newsevents/events/2012/11/developing-data-management-expertise-in-research---half-day-event
An overview of the LSHTM Research Data Management Policy, outlining the motivations for its introduction, obligations that need to be met and the support available
Policies from funders, publishers, and universities increasingly require researchers to share their data. Sharing data brings benefits like enabling replication and innovation by other researchers, safeguarding research integrity, and potentially increasing citations. Researchers should select what data to share, prepare it with good documentation and open file formats, and consider using repositories. The library provides support for data management plans, preparation, and sharing through services like Open Research Data Online.
OU Library Research Support webinar: Data sharingDaniel Crane
Slides from a webinar delivered on 06th February 2018 for OU research staff and students. Covers data sharing policies; Benefits of data sharing; Data repositories; Preparing data for sharing; and Re-using data.
Stuart Macdonald steps through the process of creating a robust data management plan for researchers. Presented at the European Association for Health Information and Libraries (EAHIL) 2015 workshop, Edinburgh, 11 June 2015.
The document provides information on creating a data management plan (DMP) for grant applications. It discusses what a DMP is, why they are important, and what funders require in a DMP. A DMP outlines how research data will be collected, documented, stored, shared, and preserved. The document recommends addressing six key themes in a DMP: data types and standards; ethics and intellectual property; data access, sharing and reuse; short-term storage and management; long-term preservation; and resourcing. Developing a strong DMP helps researchers manage data effectively and makes data available and reusable by others.
Presentación de Joy Davidson, Digital Curation Centre (UK) en FOSTER event: Data Management Plan and Social Impact of Research. Universitat Jaume I, 27 mayo 2016
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATTony Ross-Hellauer
OpenAIRE and EUDAT co-present this webinar which aims to introduce researchers and others to the concept of research data management (RDM). As well as presenting the benefits of taking an active approach to research data management – including increased speed and ease of access, efficiency (fund once, reuse many times), and improved quality and transparency of research – the webinar will advise on strategies for successful RDM, resources to help manage data effectively, choosing where to store and deposit data, the EC H2020 Open Data Pilot and the basics of data management, stewardship and archiving.
Webinar recording available: http://www.instantpresenter.com/eifl/EB57D6888147
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATOpenAIRE
OpenAIRE and EUDAT co-present this webinar which aims to introduce researchers and others to the concept of research data management (RDM). As well as presenting the benefits of taking an active approach to research data management – including increased speed and ease of access, efficiency (fund once, reuse many times), and improved quality and transparency of research – the webinar will advise on strategies for successful RDM, resources to help manage data effectively, choosing where to store and deposit data, the EC H2020 Open Data Pilot and the basics of data management, stewardship and archiving.
Webinar recording available: http://www.instantpresenter.com/eifl/EB57D6888147
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | EUDAT
This webinar discusses research data management. It explains why managing data is important for reproducibility, avoiding data loss, and meeting funder requirements. It outlines Horizon 2020's requirements for open data and describes services from EUDAT and OpenAIRE that can help with the entire data lifecycle from creation to long-term preservation and sharing. The webinar covers best practices like creating data management plans, metadata, using standards, licensing, and selecting repositories to archive and share research data.
Paper was presented at European Survey Research Association 2013, in the session Research Data Management for Re-use: Bringing Researchers and Archivists closer.
This document provides an introduction to research data management for geoscience PhD students. It defines research data and different data types. It discusses the importance of managing data throughout its lifecycle for efficient and valid research. It outlines funder requirements, university policies, and activities involved in good research data management like data planning, documentation, storage, sharing and preservation.
Research data management at TU EindhovenLeon Osinski
The document discusses research data management at TU Eindhoven. It outlines the long process of developing RDM practices since 2008. It describes the current organization and governance structure for RDM. Key external requirements for RDM from funders, regulations, and integrity standards are also summarized. The document concludes by outlining RDM support services available and the benefits of good RDM practices.
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT
| www.eudat.eu | 2nd Session: July 14, 2016.
In this webinar, Sarah Jones (DCC) and Marjan Grootveld (DANS) talked through the aspects that Horizon 2020 requires from a DMP. They discussed examples from real DMPs and also touched upon the Software Management Plan, which for some projects can be a sensible addition
Similar to EPSRC research data expectations and research software management (20)
Historic Environment Scotland (HES) manages Scotland's historic environment and archives. The presentation discusses HES's digital archiving processes, which involve quarantining and virus checking digital materials, cataloging them in an Oracle database according to international standards, and making them accessible on Canmore. Future plans include digitizing over 1 million images, applying for digital repository accreditation, migrating formats for long-term preservation, and formalizing policies and procedures.
The document summarizes a presentation given by Stuart Macdonald, Digital Archivist at Historic Environment Scotland, about digital archiving practices at HES. HES manages archival collections including aerial photographs, historic images, and Canmore, an online catalogue of Scotland's historic environment. Macdonald discussed HES's digital archiving processes, which include virus checking, cataloguing, and storing digital assets according to archival standards. Future plans include seeking accreditation as a trusted digital repository, integrating a digital preservation system, and formalizing policies and procedures.
The document discusses integrating the RSpace electronic lab notebook (ELN) with the University of Edinburgh's research data management services. It describes how RSpace can link to files stored in Edinburgh's DataStore storage system, export data and metadata to the DataShare research data repository, and archive data long-term in the future DataVault archive. The integration helps researchers manage and share their data across different projects and institutions while complying with the university's RDM policy. RSpace provides a convenient interface for researchers, while the services help institutions meet requirements for data storage, publication, and preservation.
The document provides background information on RDM services at the University of Edinburgh. It summarizes that EDINA and the University Data Library provide research data management support and online resources. It then overviews key RDM services including DataStore for active research data storage, DataShare for open data publication, and plans for a long-term DataVault archive. The document also discusses RDM training and the university's RDM policy implemented through a multi-phase roadmap.
This document provides an overview of a workshop on good practice in research data management held at the University of Tartu, Estonia. The workshop covered various topics including defining research data, research data management and data management plans, organizing and documenting data, file formats and storage, metadata, security, and sharing and preserving data. The workshop was led by Stuart Macdonald from the University of Edinburgh and included presentations, introductions, and discussions around each of these research data management topics.
The Edinburgh DataShare is an institutional data repository hosted by the University of Edinburgh Data Library to provide open access to research datasets. It uses a customized DSpace platform to allow discovery of datasets and provides persistent identifiers, metadata harvesting, and quality assurance checks. Enhancements are being made to streamline deposit workflows and improve usability, and future plans include pursuing a Data Seal of Approval and integrating with other systems like GitHub and electronic lab notebooks.
The document summarizes the research data management program at the University of Edinburgh. It discusses the services provided, including a data management planning tool, a data repository for publication and preservation, and a data storage system. Training and support are also offered to help researchers with best practices in organizing, documenting, sharing, and preserving their research data over its entire lifecycle. The program aims to implement the University's research data policy and support funder requirements by establishing these research data management services.
Presentation made at the 'Towards linked science - Open Data and DataCite Esrtonia seminar as part of the Estonian Open Access Week at University of Tartu
The document provides an overview of the CISER Data Archive at Cornell University and introduces key concepts of research data management (RDM).
The CISER Data Archive is a collection of over 27,000 numeric datasets to support quantitative research in various social science fields. It provides consulting services to help users find, access, and use data. It also maintains the Cornell research data repository.
The document defines research data and outlines the research data lifecycle. It discusses best practices for organizing, documenting, storing, and securing research data. Key aspects of RDM include developing data management plans, using appropriate file formats, and ensuring long-term preservation and sharing of research data.
This document provides an overview of research data management (RDM) priorities, stakeholders, and practices from the perspective of the University of Edinburgh. It discusses the university's RDM roadmap, which aims to implement RDM services and support over multiple phases by April 2015. Key services discussed include general RDM support and consultancy, support for data management planning, storage and collaboration facilities, and tools for long-term data management and deposit. The roles of key university committees in overseeing the RDM program are also outlined. Finally, the document discusses the university's communications plan to raise awareness of RDM among researchers and support staff.
The document provides information about research data management (RDM) services and initiatives at the University of Edinburgh. It describes the EDINA National Data Centre and Data Library, which provide online resources and data management support. It outlines several JISC-funded RDM projects undertaken by the Data Library, including building the Edinburgh DataShare repository. It also summarizes the Research Data MANTRA training module and the university's RDM roadmap, which lays out a multi-phase plan to improve RDM support and services by 2015 in line with funder requirements.
The document summarizes the activities of EDINA and the Data Library at the University of Edinburgh related to research data management. It describes EDINA as a national data center that provides online resources for education and research. The Data Library assists university researchers with discovering, accessing, using and managing research datasets. It also outlines several projects the Data Library is involved in to develop training, policies and services to support best practices in research data management according to funder requirements. This includes developing an institutional research data management roadmap to help the university meet funder expectations by 2015.
The Cornell University CISER Data Archive contains over 27,000 numeric datasets covering topics such as demography, economics, health, labor, and surveys. It provides consulting services to help users find, access, and use appropriate data for their research needs. Cornell researchers can download publicly available datasets or access restricted data within the CISER computing environment. The archive also maintains a restricted data center for Cornell researchers to preserve and share their own research data.
Slides presented at the Spanish Agency of Science and Technology (FECYT) and the network of Spanish repositories (RECOLECTA) Research Data Management Webinar Series - see url:
http://www.recolecta.net/buscador/webminars.jsp
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Aggregage
This webinar will explore cutting-edge, less familiar but powerful experimentation methodologies which address well-known limitations of standard A/B Testing. Designed for data and product leaders, this session aims to inspire the embrace of innovative approaches and provide insights into the frontiers of experimentation!
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
Open Source Contributions to Postgres: The Basics POSETTE 2024ElizabethGarrettChri
Postgres is the most advanced open-source database in the world and it's supported by a community, not a single company. So how does this work? How does code actually get into Postgres? I recently had a patch submitted and committed and I want to share what I learned in that process. I’ll give you an overview of Postgres versions and how the underlying project codebase functions. I’ll also show you the process for submitting a patch and getting that tested and committed.
The Ipsos - AI - Monitor 2024 Report.pdfSocial Samosa
According to Ipsos AI Monitor's 2024 report, 65% Indians said that products and services using AI have profoundly changed their daily life in the past 3-5 years.
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...sameer shah
"Join us for STATATHON, a dynamic 2-day event dedicated to exploring statistical knowledge and its real-world applications. From theory to practice, participants engage in intensive learning sessions, workshops, and challenges, fostering a deeper understanding of statistical methodologies and their significance in various fields."
Codeless Generative AI Pipelines
(GenAI with Milvus)
https://ml.dssconf.pl/user.html#!/lecture/DSSML24-041a/rate
Discover the potential of real-time streaming in the context of GenAI as we delve into the intricacies of Apache NiFi and its capabilities. Learn how this tool can significantly simplify the data engineering workflow for GenAI applications, allowing you to focus on the creative aspects rather than the technical complexities. I will guide you through practical examples and use cases, showing the impact of automation on prompt building. From data ingestion to transformation and delivery, witness how Apache NiFi streamlines the entire pipeline, ensuring a smooth and hassle-free experience.
Timothy Spann
https://www.youtube.com/@FLaNK-Stack
https://medium.com/@tspann
https://www.datainmotion.dev/
milvus, unstructured data, vector database, zilliz, cloud, vectors, python, deep learning, generative ai, genai, nifi, kafka, flink, streaming, iot, edge
EPSRC research data expectations and research software management
1. EPSRC research data expectations and
research software management
Stuart Macdonald
Associate Data Librarian
University of Edinburgh
stuart.macdonald@ed.ac.uk
Research Committee Meeting, School of Mathematics, University of Edinburgh, 25 Jan. 2016
2. What is research data?
Research data is defined by EPSRC as recorded factual material commonly used in
the scientific community as necessary to validate research findings
Although the majority of such data is created in digital format, all research data is
included irrespective of the format in which it is created.
Note that EPSRC does not expect every piece of data produced during a project to
be retained – decisions about what to keep should be taken on a case by case
basis.
There is however a clear expectation that data which underpins published research
outputs will be retained and managed.
3. • EPSRC have introduced a policy framework concerning the management
and provision of access to publicly-funded research data.
• EPSRC Principal Investigators and the University must demonstrate to
EPSRC that their expectations are being met. The 9 expectations are
detailed at: http://www.epsrc.ac.uk/about/standards/researchdata/
• EPSRC began monitoring compliance on 1st May 2015 on a case-by-case
basis. If it judges sharing of research data is being obstructed then it
reserves the right to impose sanctions.
5. The expectations arise from 7 core principles which align with
the core RCUK principles on data sharing, namely:
• EPSRC-funded research data is a public good produced in the public
interest and should be made freely and openly available with as few
restrictions as possible in a timely and responsible manner.
• EPSRC recognises that there are legal, ethical and commercial constraints
on release of research data
• Sharing research data is an important contributor to the impact of publicly
funded research.
6. • EPSRC-funded researchers should be entitled to a limited period of
privileged access to the data they collect to allow them to work on and
publish their results.
• Data management policies and plans should be in accordance with
relevant standards and community best practice and should exist for all
data
• Sufficient metadata should be recorded and made openly available to
enable other researchers to understand the potential for further research
and re-use of the data
• It is appropriate to use public funds to support the preservation and
management of publicly-funded research data.
7. What do PIs and researchers need to
know?
• All researchers or research students funded by EPSRC will be required to
comply with these expectations.
• Data that is not generated in digital format will be stored in a manner to
facilitate it being shared in the event of a valid request for access.
• A link to digital research data is expected to be included in the metadata.
• Where access to data is restricted published metadata should give the
reason and summarise the conditions which must be satisfied for access
to be granted.
8. • Key expectation 1: The data should be securely stored for at least 10 years
• Key expectation 2: An online record should be created within 12 months of
the data being generated that describes the research data and how to
access it.
• Key expectation 3: Published research papers should include a short
statement describing how and on what terms any supporting research
data may be accessed.
What do PIs and researchers need to do?
9. • Research data that underpins a publication must be stored safely and securely, and
made accessible.
• Data may already be managed by a trusted domain archive outside of the university,
in which case data may not need to be stored locally.
• If not then data must be stored in a suitable University of Edinburgh storage
solution. Minimal compliance is achieved by having your data on DataStore and
then making a secure copy of it into the Data Vault (this service is currently in
development).
• For those who wish to openly publish data (and a snapshot of their research
software), Edinburgh DataShare is the university’s open online digital repository of
data produced by local researchers (policies, DOI, licence, citation).
• Datasets added to DataShare will be allocated persistent identifiers (DOIs) for
citations.
Key expectation 1: store data securely
10. • The University is using PURE to record descriptive data (metadata) about
the research data in order to meet this expectation. Research staff are
therefore expected to add a metadata record for any EPSRC-funded
research data into PURE, normally within 12 months of the data being
generated.
• To enter a new dataset description in PURE, click on the green ‘Add new’
button, and select ‘Dataset’.
• Once added to PURE via the dataset content type, the resulting record
should link to the funding source and also link to any associated
publications.
Key expectation 2: a record describing
the data must be freely available online
12. • If the dataset is available online, for example in DataShare, the URL (or
DOI) of that dataset should also be added.
• Where access to the data is to be restricted, the published dataset
metadata in PURE should give the reason and summarise the conditions
which must be satisfied to grant access.
• Dataset metadata added to PURE will ultimately be publicly accessible via
the Edinburgh Research Explorer subject to confidentiality and other such
restrictions.
13. • EPSRC state that this expectation ‘could be satisfied by citing such data in the
published research and including in such citations direct links to the data or to
supporting documentation that describes the data in detail, how it may be
accessed and any constraints that may apply.’ Such links should be persistent URLs
such as DOIs.
• An example of a basic data citation would be of the form: ‘Creator (Publication
Year): Title. Publisher. DOI’ Further details can be found at:
https://www.datacite.org/services/cite-your-data.html
• If commercial, legal or ethical reasons exist to protect access to the data these
should be noted in a statement included in the published research paper. A simple
direction to interested parties to ‘contact the author for access’ may not be
considered sufficient.
• The paper must also be made Open Access in PURE.
Key expectation 3: include a statement in
published papers under-pinned by EPSRC-funded data
14. Does research data include software?
This depends on the research which is being carried out.
As noted in the definition of research data, the deciding factor is whether the software is
necessary to validate research findings, such as those published in a journal paper.
As “rule of thumb”, if your research can’t be replicated without your code then the code
should be included and shared as part of the research data
Often the software, script or simulation may be the research output and the data
produced considered ancillary content. In this case it is more important to store and share
your code than the actual data.
Additionally, even if you don’t need to preserve software, it is good practice to make
software available with adequate documentation to enable others to validate your
research findings, and to access and reuse your research data.
15. Who should make the decision about
what research data should be
preserved?
Each research organisation will have specific policies and associated processes
to determine what and how publicly-funded research data will be stored and
managed.
Normally it will be the PI of the research project and/or Head of
Department/School who will make the decision about what research data
should be preserved and made available.
It is important to recognise that not all research data can or should be freely
shared – ethical, legal or confidentiality issues may constrain who may have
access.
16. What about software not produced by
my project, but is required to validate my
research results?
It is prudent, in terms of providing access to your research data and of enabling
your own future research, to take reasonable steps to assure continued availability
of the software you use.
This may include taking a copy of open-source software and preserving it if the
licence allows, or using commercial software where a multi-year support
agreement is available.
Given the requirement to preserve research data for 10 years from the date of last
access by a third party, this provides a compelling reason to use open-data
formats and open-source software
17. What licence should I choose for my
data and software?
Following the principle that publicly funded research data should be made
openly available with as few restrictions as possible, you should consider
applying an appropriate open licence to the data and software generated by
your research (GNU General Public Licence, MIT, CC, ODC).
The Digital Curation Centre and Software Sustainability Institute have written
guides to help you license research data and choose an open-source licence for
your software.
How to licence research data (DCC) - http://www.dcc.ac.uk/resources/how-guides/license-research-
data
Choosing an open-source licence - http://www.software.ac.uk/resources/guides/adopting-open-source-
licence
18. Where should I deposit my research data
and software?
EPSRC does not provide a publications repository, research data repository, or
software repository.
Researchers are expected to use institutional or disciplinary/domain repositories
available to them (e.g. GitHub, SubVersion).
It is important that deposited objects can be referenced and accessed via a persistent
identifier (e.g. a DOI) and that appropriately structured metadata describing the
objects is easily discoverable.
A good way to make data and software discoverable is to cite it in research
publications, and to include the persistent identifier (DOI) in the citation.
19. Writing and using a software
management plan
At present software management plans are relatively new for research
proposals.
The EPSRC Software for the Future call explicitly requires software
management plans as part of the Pathways to Impact.
NSF SI2 funding requires software to be addressed as part of the mandatory
data management plan.
A software management plan is a way of formalising a set of structures and
goals to ensure your software is accessible and reusable in the short, medium
and long term.
20. Software management plans should
minimally include:
• information on what software outputs (including documentation and other related
material) are expected to be produced
• who is responsible for releasing the software (e.g. PI / lead developer)
• the revision control process to be used [Note: it is important to choose a revision
control / configuration management system that all members of the team will use]
• what license will be used for each output
A software management plan could also:
• identify the software development model
• identify external software and any associated licences
• dependencies and risks
21. Support
Implementation of the EPSRC Policy at Edinburgh is being supported by the Research
Data Service delivered by Information Services
For help about meeting this policy requirement contact:
• Email: IS.Helpline@ed.ac.uk with “Help with EPSRC data policy framework” in
your subject line.
• Email: PURE@ed.ac.uk if you have questions about PURE.
For help about research data management in general contact:
• Email: IS.Helpline@ed.ac.uk with “Help with Research Data Management in
general” in your subject line.
• Email: IS.Helpline@ed.ac.uk if you would like to arrange an RDM training or
awareness raising session.
For help with research software contact:
• Email: info@software.ac.uk at the Software Sustainability Institute
22. Thanks!
URLs
• EPSRC Policy Framework on Research Data:
http://www.epsrc.ac.uk/about/standards/researchdata/impact/
• DataStore: https://www.wiki.ed.ac.uk/x/Np9FD
• Edinburgh DataShare: http://datashare.is.ed.ac.uk/
• Data Catalogue in PURE: http://www.pure.ed.ac.uk
• Digital preservation and curation - the danger of overlooking software -
http://www.software.ac.uk/resources/guides/digital-preservation-and-curation-danger-overlooking-
software
• Choosing a repository for your software project - http://software.ac.uk/resources/guides/choosing-
repository-your-software-project
• How to cite and describe software - http://software.ac.uk/so-exactly-what-software-did-you-use
• Writing and using a software management plan - http://www.software.ac.uk/resources/guides/software-
management-plans