Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...ASIS&T
Betsy Gunia, David Fearon, Benjamin Brosius, Tim DiLauro
JHU Data Management Services
Johns Hopkins University Sheridan Libraries
A Workflow for Depositing to a Research Data Repository: A Case Study for Archiving Publication Data
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...ASIS&T
Research Data Access and Preservation Summit, 2015
Minneapolis, MN
April 22-23, 2015
Erica M. Johns, Jon Corson-Rikert, Huda J. Khan, Dean B. Krafft and Matthew S. Mayernik
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FutureASIS&T
Wendy A. Kozlowski, Dianne Dietrich, Gail Steinhart and Sarah Wright
Cornell University Library, Ithaca, NY
Research Data in eCommons @ Cornell: Present and Future
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
Poster RDAP13: Data information literacy multiple paths to a single goalASIS&T
Jake Carlson, Jon Jeffryes, Brian Westra and Sarah Wright
Data Information Literacy: Multiple Paths to a Single Goal
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
The goal of the Very Open Data Project is to provide a software-technical foundation for this exchange of data, more specifically to provide an open database platform for data from the raw data coming from experimental measurements or models through intermediate manipulations to finally published results. The sheer expanse of the amount data involved creates some unique software-technical challenges. One of these challenges is addressed in the part of the study presented here, namely to characterize scientific data (with the initial focus being detailed chemistry data from the combustion kinetic community), so that efficient searches can be made. A formalization of this characterization comes in the form of schemas of descriptions of tags and keywords describing data and ontologies describing the relationship between data types and the relationship between the characterizations themselves. These will be translated to meta-data tags connected to the data points within a non-relational data of data for the community.
The focus of the initial work will be on data and its accessibility. As the project progresses, the emphasis will shift on not only having available data accessible for the community, but that the community itself will be able to, with emphasis on minimal effort, will be able contribute their own data. This will involve, for example, the concepts of the ‘electronic lab notebook’ and the existence and availability of extensive concept extraction tools, primarily from the chemical informatics field.
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...ASIS&T
Betsy Gunia, David Fearon, Benjamin Brosius, Tim DiLauro
JHU Data Management Services
Johns Hopkins University Sheridan Libraries
A Workflow for Depositing to a Research Data Repository: A Case Study for Archiving Publication Data
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...ASIS&T
Research Data Access and Preservation Summit, 2015
Minneapolis, MN
April 22-23, 2015
Erica M. Johns, Jon Corson-Rikert, Huda J. Khan, Dean B. Krafft and Matthew S. Mayernik
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FutureASIS&T
Wendy A. Kozlowski, Dianne Dietrich, Gail Steinhart and Sarah Wright
Cornell University Library, Ithaca, NY
Research Data in eCommons @ Cornell: Present and Future
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
Poster RDAP13: Data information literacy multiple paths to a single goalASIS&T
Jake Carlson, Jon Jeffryes, Brian Westra and Sarah Wright
Data Information Literacy: Multiple Paths to a Single Goal
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
The goal of the Very Open Data Project is to provide a software-technical foundation for this exchange of data, more specifically to provide an open database platform for data from the raw data coming from experimental measurements or models through intermediate manipulations to finally published results. The sheer expanse of the amount data involved creates some unique software-technical challenges. One of these challenges is addressed in the part of the study presented here, namely to characterize scientific data (with the initial focus being detailed chemistry data from the combustion kinetic community), so that efficient searches can be made. A formalization of this characterization comes in the form of schemas of descriptions of tags and keywords describing data and ontologies describing the relationship between data types and the relationship between the characterizations themselves. These will be translated to meta-data tags connected to the data points within a non-relational data of data for the community.
The focus of the initial work will be on data and its accessibility. As the project progresses, the emphasis will shift on not only having available data accessible for the community, but that the community itself will be able to, with emphasis on minimal effort, will be able contribute their own data. This will involve, for example, the concepts of the ‘electronic lab notebook’ and the existence and availability of extensive concept extraction tools, primarily from the chemical informatics field.
PSB2014 A Vision for Biomedical ResearchPhilip Bourne
Some preliminary thoughts about my role as Associate Director for Data Science at the NIH so as to have a discussion with attendees at the Pacific Symposium on Biocomputing on Jan 4, 2014, The Big Island of Hawaii.
Information technology and resources are an integral and indispensable part of the contemporary academic enterprise. In particular, technological advances have nurtured a new paradigm of data-intensive research. However, far too much of this activity still takes place in silos, to the detriment of open scholarly inquiry, integrity, and advancement. To counteract this tendency, the University of California Curation Center (UC3) has been developing and deploying a comprehensive suite of curation services that facilitate widespread data management, preservation, publication, sharing, and reuse. Through these services UC3 is engaging with new communities of use: in addition to its traditional stakeholders in cultural heritage memory organizations, e.g., libraries, museums, and archives, the UC3 service suite is now attracting significant adoption by research projects, laboratories, and individual faculty researchers. This webinar will present an introduction to five specific services – DMPTool, DataUp, EZID, Merritt, Web Archiving Service (WAS) – applicable to data curation throughout the scholarly lifecycle, two recent initiatives in collaboration with UC campuses, UC Berkeley Research Hub and UC San Francisco DataShare, and the ways in which they encourage and promote new communities of practice and greater transparency in scholarly research.
Spring 2014 Data Management Lab: Session 1 Slides (more details at http://ulib.iupui.edu/digitalscholarship/dataservices/datamgmtlab)
What you will learn:
1. Build awareness of research data management issues associated with digital data.
2. Introduce methods to address common data management issues and facilitate data integrity.
3. Introduce institutional resources supporting effective data management methods.
4. Build proficiency in applying these methods.
5. Build strategic skills that enable attendees to solve new data management problems.
Ginny Pannabecker, Life Science & Scholarly Communications Librarian at Virginia Tech, is an ACRL Science and Technology Section (STS) liaison to the American Institute of Biological Sciences (AIBS). This presentation shares key points for librarians and researchers from an AIBS workshop on "Changing Practices in Data Publications," which took place in December 2014 and involved representatives from federal funding agencies; publishers and librarians; scientific societies and journals; and data services / providers.
Presentation given at the TNSCORE 2012 annual conference. Tennessee is a designated EPSCoR state. EPSCoR (Experimental Program to Stimulate Competitive Research) is a program administered by the National Science Foundation to assist states in boosting the level of research funding provided by NSF. This presentation gave an overview of the development of cyberinfrastructure in the state as well as planned future improvements.
Overview of Research Data Census preliminary findings for Montana State University Research Council, Sept 3, 2015, Jerry Sheehan, see also http://er.educause.edu/articles/2015/9/data-intensive-science-and-campus-it
Key lecture for the EURO-BASIN Training Workshop on Introduction to Statistical Modelling for Habitat Model Development, 26-28 Oct, AZTI-Tecnalia, Pasaia, Spain (www.euro-basin.eu)
This presentation was provided by Maria Praetzellis of California Digital Library, during the NISO hot topic virtual conference "Effective Data Management," which was held on September 29, 2021.
PSB2014 A Vision for Biomedical ResearchPhilip Bourne
Some preliminary thoughts about my role as Associate Director for Data Science at the NIH so as to have a discussion with attendees at the Pacific Symposium on Biocomputing on Jan 4, 2014, The Big Island of Hawaii.
Information technology and resources are an integral and indispensable part of the contemporary academic enterprise. In particular, technological advances have nurtured a new paradigm of data-intensive research. However, far too much of this activity still takes place in silos, to the detriment of open scholarly inquiry, integrity, and advancement. To counteract this tendency, the University of California Curation Center (UC3) has been developing and deploying a comprehensive suite of curation services that facilitate widespread data management, preservation, publication, sharing, and reuse. Through these services UC3 is engaging with new communities of use: in addition to its traditional stakeholders in cultural heritage memory organizations, e.g., libraries, museums, and archives, the UC3 service suite is now attracting significant adoption by research projects, laboratories, and individual faculty researchers. This webinar will present an introduction to five specific services – DMPTool, DataUp, EZID, Merritt, Web Archiving Service (WAS) – applicable to data curation throughout the scholarly lifecycle, two recent initiatives in collaboration with UC campuses, UC Berkeley Research Hub and UC San Francisco DataShare, and the ways in which they encourage and promote new communities of practice and greater transparency in scholarly research.
Spring 2014 Data Management Lab: Session 1 Slides (more details at http://ulib.iupui.edu/digitalscholarship/dataservices/datamgmtlab)
What you will learn:
1. Build awareness of research data management issues associated with digital data.
2. Introduce methods to address common data management issues and facilitate data integrity.
3. Introduce institutional resources supporting effective data management methods.
4. Build proficiency in applying these methods.
5. Build strategic skills that enable attendees to solve new data management problems.
Ginny Pannabecker, Life Science & Scholarly Communications Librarian at Virginia Tech, is an ACRL Science and Technology Section (STS) liaison to the American Institute of Biological Sciences (AIBS). This presentation shares key points for librarians and researchers from an AIBS workshop on "Changing Practices in Data Publications," which took place in December 2014 and involved representatives from federal funding agencies; publishers and librarians; scientific societies and journals; and data services / providers.
Presentation given at the TNSCORE 2012 annual conference. Tennessee is a designated EPSCoR state. EPSCoR (Experimental Program to Stimulate Competitive Research) is a program administered by the National Science Foundation to assist states in boosting the level of research funding provided by NSF. This presentation gave an overview of the development of cyberinfrastructure in the state as well as planned future improvements.
Overview of Research Data Census preliminary findings for Montana State University Research Council, Sept 3, 2015, Jerry Sheehan, see also http://er.educause.edu/articles/2015/9/data-intensive-science-and-campus-it
Key lecture for the EURO-BASIN Training Workshop on Introduction to Statistical Modelling for Habitat Model Development, 26-28 Oct, AZTI-Tecnalia, Pasaia, Spain (www.euro-basin.eu)
This presentation was provided by Maria Praetzellis of California Digital Library, during the NISO hot topic virtual conference "Effective Data Management," which was held on September 29, 2021.
Meeting the NSF DMP Requirement June 13, 2012IUPUI
June 13 version of the IUPUI workshop Meeting the NSF Data Management Plan Requirement: What you need to know. This workshop is co-sponsored by the Office of the Vice Chancellor for Research and the University Library.
Sallans RDAP11 NSF Data Management Plan Case StudiesASIS&T
NSF Data Management Plan Case Studies
The 2nd Research Data Access and Preservation (RDAP) Summit
An ASIS&T Summit
March 31-April 1, 2011 Denver, CO
In cooperation with the Coalition for Networked Information
http://asist.org/Conferences/RDAP11/index.html
Meeting the NSF DMP Requirement: March 7, 2012IUPUI
March 7 version of the IUPUI workshop Meeting the NSF Data Management Plan Requirement: What you need to know. This workshop is co-sponsored by the Office of the Vice Chancellor for Research and the University Library.
Have you implemented a Data Management Plan (DMP) tool at your institution or are you currently involved in discussions to implement one? Would you like to connect with others who are involved in implementing DMPs? Then this webinar is for you!
This webinar will bring together those involved in planning or implementing DMP to exchange information and explore ideas around DMP.
>>>>>>>>>>>>>>>>>>>>>>>>
Kathryn Unsworth and Natasha Simons lead the conversation by starting off with a few thoughts on:
-- a wrap up of the DMP Birds of a Feather session at eResearch Australasia (Oct 2016)
-- DMPs v2
-- discussion around DMPs as Thing 15 in the 23 (Research Data) Things program
-- and some thought provoking ideas.
This section WILL be recorded.
Then open up for discussion - NOT recorded.
We will also be looking to gauge interest in the formation of a DMP Community of Practice in Australia.
>>>>>>>>>>>>>>>>>>>>>>>>
Background:
Significant advocacy and technical enterprise have been directed towards the development and use of DMP tools. However, the agents and motivations driving DMP use differ, presenting use cases to explore and questions to be answered:
-- Why implement a DMP tool?
-- Does DMP use align with an agent’s motivations and more importantly with intended outcomes?
-- What are the expected outcomes?
-- Is there a one-size-fits-all DMP?
-- Is best practice for researchers an aim or a hoped-for by product?
>>>>>>>>>>>>>>>>>>>>>>>>
More info about DMPs: http://www.ands.org.au/working-with-data/data-management/data-management-plans
Australian DMP examples: https://projects.ands.org.au/policy.php
>>>>>>>>>>>>>>>>>>>>>>>>
Contact:
Kathryn.Unsworth@ands.org.au
Natasha.Simons@ands.org.au
How to develop a Pilot Data Management Infrastructure for Biomedical Research...Meik Poschen
This presentation introduces the 'MaDAM Pilot data management infrastructure for biomedical researchers at University of Manchester' project (funded under the Infrastructure strand of the JISC Managing Research Data Programme between October 2009 and June 2010) which aims at producing a technical & governance solution based on researchers’ requirements with flexibility to meet needs across multiple research groups/disciplines and taking into account the institutional landscape and its policies. This encompasses data capture, data storage and data curation, and is designed to add value both to the full lifecycle of research projects and also by making data readily available for reuse.
The pilot is focusing on a specific domain area (Life and Medical Sciences) as input to a wider strategic activity to address the needs of the whole of the University research community. User engagement in the MaDAM project focuses on an iterative user-driven/bottom-up development process together with collecting non-technical requirements.
Similar to DMPTool: Integration with other open science software (20)
Improving Integrity, Transparency, and Reproducibility Through Connection of ...Andrew Sallans
The Center for Open Science (COS) was founded as a non-profit technology start-up in 2013 with the goal of improving transparency and reproducibility by connecting the scholarly workflow. COS achieves this goal through the development of a free, open source web application called the Open Science Framework (OSF), providing features like file sharing and citing, persistent urls, provenance tracking, and automated versioning. Initial workflow API connections focused on storage services and included Figshare, GitHub, Amazon S3, Dropbox, and Dataverse. The team is now working to connect other parts of the workflow with services like DMPTool, Databib/re3data, and Databrary. This session will introduce the core architecture and the problems that it solves, and illustrate how connecting services can benefit everyone involved in supporting the research ecosystem. COS is funded through the generosity of grants from the Laura and John Arnold Foundation, the John Templeton Foundation, the Alfred P. Sloan Foundation, the Association of Research Libraries, and others.
Presented at CNI Fall 2014, Washington, DC.
Open Science Framework (OSF): Presentation and TrainingAndrew Sallans
Presentation Date: December 12, 2013.
Location: UC Berkeley, CA
Presenters: Johanna Cohoon & Andrew Sallans (Center for Open Science)
Center for Open Science website: http://centerforopenscience.org
Berkeley Initiative for Transparency in the Social Sciences website: http://bitss.org/annual-meeting/2013-2/
UVa Library Scientific Data Consulting Group (SciDaC): New Partnerships and...Andrew Sallans
A. Sallans. "UVa Library Scientific Data Consulting Group (SciDaC): New Partnerships and Services to Support Scientific Data in the Library." Presented at the 2011 International Association for Social Science Information Services and Technology.
A. Sallans. "Practical Applications of e-Science." Presented at the 2011 eScience Bootcamp at the University of Virginia's Claude Moore Health Sciences Library. 4 March 2011
Understanding the Big Picture of e-ScienceAndrew Sallans
A. Sallans. "Understanding the Big Picture of e-Science." Presented at the 2011 eScience Bootcamp at the University of Virginia's Claude Moore Health Sciences Library. 4 March 2011
NSF Data Management Plan - Implications for LibrariansAndrew Sallans
A. Sallans. "NSF Data Management Plan - Implications for Librarians." Presented at the Science and Technology Section (STS) Hot Topics Discussion Group Meeting of the American Library Association's 2011 Midwinter Meeting. 8 January 2011
2. A document that describes what you will do
with your data during your research & after
you complete your project
What is a data management plan?
From Flickr by spanaut
A natural fit with
interests in open
science and
reproducibility of
research.
4. DMP supplement may include:
1. the types of data, samples, physical collections, software, curriculum
materials, and other materials to be produced in the course of the project
2. the standards to be used for data and metadata format and content
(where existing standards are absent or deemed inadequate, this should
be documented along with any proposed solutions or remedies)
3. policies for access and sharing including provisions for appropriate
protection of privacy, confidentiality, security, intellectual property, or
other rights or requirements
4. policies and provisions for re-use, re-distribution, and the production of
derivatives
5. plans for archiving data, samples, and other research products, and
for preservation of access to them
NSF DMP Requirements
From Grant Proposal Guidelines:
5. NSF’s Vision*
DMPs and their evaluation will grow & change over
time (similar to broader impacts)
Peer review will determine next steps
Community-driven guidelines
Different disciplines have different definitions of acceptable
data sharing
Flexibility at the directorate and division levels
Tailor implementation of DMP requirement
Evaluation will vary with directorate, division, &
program officer
*Unofficially
Help from Jennifer Schopf, NSF
6. Challenge
Researchers don’t know the necessary things
to meet these new requirements
Institutions are on the hook to ensure
compliance
Proper compliance involves integration of
many things (ie. Standards, policies, practices,
infrastructure, etc.)
Huge funding at stake
7. DMPTool:
Data management planning made EASIER
Data management is complex and requires a
dialogue amongst many stakeholders
There’s a range of understanding and
available support resources
DMPTool focus is on:
simplifying and scaling the common parts,
developing a community, and
providing functionality and integration with other
tools to advance services when possible
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18. Access
Added to campus single sign-on service
InCommon / Shibboleth
Researchers use campus login
19. Funder Requirements
Supported
Gordon and Betty Moore
Foundation
IMLS
NEH – Office of Digital
Humanities
NIH
NOAA
NSF – General
NSF - Astronomical Sciences
NSF - Atmospheric and
Geospace Sciences
NSF - Biological Sciences
NSF – Chemistry
NSF – Computer and
Information Science and
Engineering
NSF – Earth Sciences
NSF – Education and Human
Resources
NSF – Emerging Frontiers in
Research and Innovation
NSF – Engineering
NSF – Materials Research
NSF – Social, Behavioral and
Economic
NSF - Physics
20. New Areas of Functionality in
2013
Granular
modeling of plan
templates
Granular
modeling of
institutions
Role-based user
authorization
DMP life cycle
management
Organizational
planning
activities
Enhanced
search and
browse
Institutional
branding
Search and
reporting for
business
intelligence
Advanced
administrative
interface
Collaborative
plan creation
Open API
23. What we need from you…
New use cases
Opportunities to integrate with other relevant
tools
Partnership on development of API
requirements during 2013 project push
24. Thank you!
Please contact me at any time with ideas or
opportunities:
Andrew Sallans
als9q@virginia.edu
434-243-2180
Twitter: asallans
Editor's Notes
Federal Funding Accountability Transparency Act was passed in 2006, which ensures that the public can access information on all entities and organizations receiving Federal funds [National Institutes of Health, which began requiring data management plans for large grant proposals. Several government funders have since followed suit, including the National Science Foundation (NSF), National Oceanic and Atmospheric Administration (NOAA), U.S. Department of Agriculture (USDA), the Environmental Protection Agency (EPA)