Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Ag Data Commons for AgBioData


Published on

A presentation to the AgBioData monthly webinar:
On August 1, 2018

Published in: Technology
  • The Bulimia Recovery Program, We Recovered, You CAN TOO! ◆◆◆
    Are you sure you want to  Yes  No
    Your message goes here
  • Be the first to like this

Ag Data Commons for AgBioData

  1. 1. Ag Data Commons Cynthia Parr USDA ARS National Agricultural Library A platform to harness the power of Digital Agriculture
  2. 2. Agricultural Data (Gather) Agricultural Knowledge (Transform) Agricultural Decision-making and Action (Translate) Why Ag Data Commons? Federal directives: Public access to open, machine-readable data
  3. 3. Photo credit: Alpha Stock Images CC BY SA 3.0 USDA Enterprise Data Management USDA Public Access Policy ARS OSQR Procedure NIFA RFP, Terms and Conditions Cooperative agreements and contracts • Data Management Plan • Data to be made public in trusted repository within 30 months unless private, proprietary, or sensitive • Datasets to be cataloged at Ag Data Commons with appropriate identifiers
  4. 4. PLOS ONE Data Availability: 20% Currently in Repositories U41A: How Safe and Persistent Is Your Research? AGU Fall Meeting, December 14, 2017 Kerry Kroffe, Director, Editorial Services, PLOS ”Enabling FAIR Data” initiative • Journal will require all data supporting the article be in a data citation and described in the Data Availability Statement • Editors and reviewers enforce policy • Ensure NO data is in the supplement • Repository selected by author must be FAIR-compliant • Journal community adopts and enforces FAIR principles Citation: Stall, S. (2017), Enabling findable, accessible, interoperable, and reusable data, Eos, 98, Published on 15 September 2017.
  5. 5. 22% 34% 2%2% 40% Required Encouraged Over half of top agricultural journals encourage or require open data n = 50 Where USDA researchers published in 2016 (thanks Jon Sears) 17% 78% 5% Yes No Undetermined Researchers have few options for open submission in domain- specific databases n = 235 (thanks Erin Antognoli) Where ag researchers deposit data in 2016
  6. 6. The Concept • Discovery Interface • Catalog • APIs • Computational Tools • Data Analytic Tools Ag Data Commons Knowledge Base Data Producers Data Consumers •Publications •Patents •Grant Info. Federal Repository (I) University Repository (K) Industry Repository (N) Experiment Devices Farm Equipment UAVs, Sensors
  7. 7. FAIR Data Principles Catalog and repository ecosystem Self-submission & harvesting Currently all open data, linked to literature Currently USDA-funded datasets and databases 11% of records have data in our repository – issuing DOIs Ag Data Commons
  8. 8. 8 Public interactive monthly platform statistics Registered Users Catalogued Datasets Downloads Citations
  9. 9. 9 Organizing datasets Photo credit: Anjuli_ayer CC- BY-NC-SA
  10. 10. 10 Ag Data Commons Topics NAL Thesaurus Terms
  11. 11. ARS National Programs 11
  12. 12. ARS National Program 301 12
  13. 13. AgBioData program 13
  14. 14. AgBioData program 14
  15. 15. 15 Harvesting metadata Photo: CC BY Tony Walmsley
  16. 16. Harvesting metadata in DKAN 16 E.g. NCBI Bioprojects USDA NAL Geodata USFS Research Data Archive E.g. Project Open Data, CSW, OAI-PMH
  17. 17. Harvesting from distributed repositories • Avoids duplication of submission effort • More exposure = more impact • Distributes costs for storage • Keeps to specialized platforms for communities • Usually lacks funding information • Many lack DOIs • Many lack methodological detail • Challenging to match up with associated articles 17
  18. 18. Making data machine readable, linked Promoting shared standards JSON, RDF Data dictionary CSV, API, DB, code Ag Data Commons frictionlessdata.ioscience
  19. 19. NAL Resources Ag Data Commons Data Management Plans NOW REQUIRED BY MOST FUNDERS NAL provides online resources & will provide consultation on draft DMPs click on DATA 20
  20. 20. DISCUSSION How can Ag Data Commons help AgBioData • Harvesting metadata? • DOI service for subsets or entire versions of datasets? • Compliance: linking data to grant and award numbers? • Linking data to citations (re-use)? • Discoverability? • Collecting consistent documentation and API information? • Transformation services? • Other? 21