Open data oct 2013


Published on

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Open data oct 2013

  1. 1. Open Data Margaret Henderson Director, Research Data Management VCU Libraries
  2. 2. Bobby Mckay's photos via Getty Images
  3. 3. NIH Public Access Policy SEC. 218. The Director of the National Institutes of Health shall require that all investigators funded by the NIH submit or have submitted for them to the National Library of Medicine’s PubMed Central an electronic version of their final peer-reviewed manuscripts upon acceptance for publication, to be made publicly available no later than 12 months after the official date of publication: Provided, That the NIH shall implement the public access policy in a manner consistent with copyright law.
  4. 4. NIH Data Sharing Policy The Final NIH Statement on Sharing Research Data was published in the NIH Guide on February 26, 2003. This is an extension of NIH policy on sharing research resources, and reaffirms NIH support for the concept of data sharing. The new policy becomes effective with the October 1, 2003 receipt date for applications or proposals to NIH. ● ● ● Application seeking $500K or more in direct costs in any single year of the project period, or As a special requirement of a Funding Opportunity Announcement (FOA) Applies to the sharing of Final Research Data for applications submitted on or after October 1, 2003
  5. 5. NSF Data Management Plans NSF Data Sharing Policy Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants. Grantees are expected to encourage and facilitate such sharing. See Award & Administration Guide (AAG) Chapter VI.D.4. NSF Data Management Plan Requirements Proposals submitted or due on or after January 18, 2011, must include a supplementary document of no more than two pages labeled “Data Management Plan”. This supplementary document should describe how the proposal will conform to NSF policy on the dissemination and sharing of research results. See Grant Proposal Guide (GPG) Chapter II.C.2.j for full policy implementation.
  6. 6. OSTP Memorandum:Increasing Access to the Results of Federally Funded Scientific Research February 22, 2013 “ensuring that, … the direct results of federally funded scientific research are made available to and useful for the public, industry, and the scientific community. Such results include peer-reviewed publications and digital data.” “develop plans to make the results of federally-funded research publically available free of charge within 12 months after original publication.”
  7. 7. 4. Objectives for Public Access to Scientific Data in Digital Formats “digitally formatted scientific data resulting from unclassified research supported wholly or in part by Federal funding should be stored and publicly accessible to search, retrieve, and analyze.” Data defined: “the digital recorded factual material commonly accepted in the scientific community as necessary to validate research findings including data sets used to support scholarly publications, but does not include laboratory notebooks, preliminary analyses, drafts of scientific papers, plans for future research, peer review reports, communications with colleagues, or physical objects, such as laboratory specimens.”
  8. 8. b)Ensure that all extramural researchers receiving Federal grants and contracts for scientific research and intramural researchers develop data management plans, as appropriate, describing how they will provide for long-term preservation of, and access to, scientific data in digital formats resulting from federally funded research, or explaining why long-term preservation and access cannot be justified
  9. 9. c) Allow the inclusion of appropriate costs for data management and access in proposals for Federal funding for scientific research;
  10. 10. f) Promote the deposit of data in publicly accessible databases, where appropriate and available;
  11. 11. Who Owns Your Data? “Consistent with federal policy and prevailing higher education practice, Research Data belong to the University.” “All Research Data shall be preserved in the custody of, or as arranged by, the Principal Investigator on behalf of the University.” Research Data Ownership, Office of Research Retention, and Access,%20Retention%20&%20Access.pdf
  12. 12. by caribb on flickr CC BY-NC-ND 2.0
  13. 13. Benefits to Data Sharing • Helps to avoid duplication, thereby reducing costs and wasted effort • Promotes scientific integrity and debate • Enables scrutiny of research findings and allows for validation of results • Leads to new collaborations between data users and data creators • Improves research and leads to better science • Increases citations* * A study by Piwowar, Day and Fridsma showed a 69% increase in citation,
  14. 14. NIH Reasons Data sharing achieves many important goals for the scientific community, such as ● reinforcing open scientific inquiry ● encouraging diversity of analysis and opinion, ● promoting new research, testing of new or alternative hypotheses and methods of analysis ● supporting studies on data collection methods and measurement ● facilitating education of new researchers ● enabling the exploration of topics not envisioned by the initial investigators ● permitting the creation of new datasets by combining data from multiple sources. htm
  15. 15. Data Sharing Examples Examples of shared epidemiologic data include the Framingham Heart Study, the Honolulu Heart Program, the Atherosclerosis Risk in Communities, Epidemiology of Chronic Disease in the Oldest Old, and the Iowa 65+ Rural Health Study. Examples of shared data from clinical trials include the Asymptomatic Cardiac Ischemia Pilot, the Intermittent Positive Pressure Breathing Study, and the Safety and Efficacy Trial of Zidovudine for Asymptomatic HIV Infected Individuals. Examples of shared datasets from the basic sciences include a growing number of genome sequences and maps, as well as protein and nucleotide databases (see ENTREZ http://www.ncbi.nlm.nih. gov/Database/index.html and other resources for molecular biology at the National Center for Biotechnology Information at http://www.
  16. 16. Data Sharing Policies NIH implemented a “Policy for Sharing of Data Obtained in NIH Supported or Conducted Genome-Wide Association Studies (GWAS)” (see html). “The NIH is interested in advancing genome-wide association studies (GWAS) to identify common genetic factors that influence health and disease. For the purposes of this policy, a genome-wide association study is defined as any study of genetic variation across the entire human genome that is designed to identify genetic associations with observable traits (such as blood pressure or weight), or the presence or absence of a disease or condition. 1 Whole genome information, when combined with clinical and other phenotype data, offers the potential for increased understanding of basic biological processes affecting human health, improvement in the prediction of disease and patient care, and ultimately the realization of the promise of personalized medicine. In addition, rapid advances in understanding the patterns of human genetic variation and maturing high-throughput, cost-effective methods for genotyping are providing powerful research tools for identifying genetic variants that contribute to health and disease.”
  17. 17.