Open data and the ag data commons

Cyndy Parr
Cyndy ParrBiologist and technologist at National Agricultural Library
Open Data and
The Ag Data Commons
Presented by
Cyndy Parr & Erin Antognoli
April 25, 2019
1
Agenda
Open data
● Definition and basics
Ag Data Commons
● USDA research data catalog
● Open agricultural data
National Agricultural Library services
● Data dictionaries
● Data management plans
2
Open Data
The basics and background
3
Open data policy history
2013 - Obama administration’s open data policy memo
Directs all federal agencies to publish their information as machine-readable data, using
searchable, open formats
Required every agency to maintain a centralized Enterprise Data Inventory that lists all
data sets
Mandated a centralized inventory for the whole government – the platform currently
known as data.gov
2019 - OPEN Government Data Act becomes law
https://project-open-data.cio.gov/policy-memo/
https://www.congress.gov/bill/115th-congress/house-bill/4174/text 4
Public access policy history
2013 - “Holdren memo” issued by Office of Science and Technology Policy
2014 - USDA Implementation Plan approved
2016 - USDA Public Access Policy for Scholarly Publications approved
● CHORUS will provide access to many published articles
● Submission of accepted manuscripts to PubAg (pubag.data.nal.gov) is imminent
2019 - Anticipate approval of USDA Public Access Policy for Digital Scientific
Data
https://go.usa.gov/xmB9a https://go.usa.gov/xmB92 5
Open data is...
“...data that can be freely used, re-used and redistributed by anyone - subject
only, at most, to the requirement to attribute and sharealike.”
~ Open Data Handbook
Why is a clear definition of open data important?
Interoperability - different datasets should be able to work together
● Availability and access
● Re-use and redistribution
● Universal participation
http://opendatahandbook.org/guide/en/what-is-open-data/ 6
Availability and Access
“The data must be available as a whole and at no more than a reasonable
reproduction cost, preferably by downloading over the internet. The data
must also be available in a convenient and modifiable form.”
http://opendatahandbook.org/guide/en/what-is-open-data/ 7
Re-use and Redistribution
“The data must be provided
under terms that permit
re-use and redistribution
including the intermixing
with other datasets.”
http://opendatahandbook.org/guide/en/what-is-open-data/ 8
Universal Participation
“Everyone must be able to use, re-use and redistribute - there should be no
discrimination against fields of endeavour or against persons or groups. For
example, ‘non-commercial’ restrictions that would prevent ‘commercial’ use,
or restrictions of use for certain purposes (e.g. only in education), are not
allowed.”
9
FAIR principles reinforce open data
Findable
Accessible
Interoperable
Reusable
FINDABLE
Rich metadata
Persistent identifiers
INTEROPERABLE
Open formats
Common metadata
standards
Controlled vocabularies
REUSABLE
Usage license
Provenance
Community standards
ACCESSIBLE
Fixity
Data & metadata
available to target
audience
FAIR Principles
https://www.force11.org/group/fairgroup/fairprinciples 10
Ag Data Commons
USDA open agricultural data
11
The Ag Data Commons is...
● A catalog and data repository for open
agricultural research data
● The catalog for all USDA-funded research data
● Satisfies the federal open data requirements
● Satisfies the USDA public access requirements
https://data.nal.usda.gov/
12
Ag Data Commons collection policies
Ag-related data
● Many high-level categories - i.e. Agroecosystems &
Environment, Agricultural Economics, Bioenergy,
Agricultural Products, etc.
USDA Funding
● USDA-funded data or data from USDA
researchers working on collaborative projects
DOI
● Assigned for locally
held resources
Version policy
https://data.nal.usda.gov/ 13
Ag Data Commons features
Groups by project or affiliation
● Programs can request a tag to keep
all their data entries grouped
together
● Data hierarchies one level deep
supported (parent / child)
ORCID integration
● Authors can link to their profiles to
prevent ambiguity
Citations
● Specify a citation for your own
data
● Link to scholarly publications
or data papers / PubAg
● Link to other related data
content
https://data.nal.usda.gov/ 14
Submission limitations
Data should have ties to USDA
● Funder, collaborator, or employer
File size - 20 GB per file max
● Larger size data storage pilot underway!
No executables allowed
● Executables can be cataloged with a pointer to
the software/code, but not deposited directly
https://data.nal.usda.gov/ 15
Submit ag-related data
Create an account
● https://data.nal.usda.gov/user/register
Data submission form
● Metadata entry
● Workflow tools
● Clone metadata
● Separate descriptions for each
resource file
Metadata - Project Open Data
● Open standard
● Formatted for ingest into
data.gov
● https://project-open-data.cio.gov/
schema/https://data.nal.usda.gov/ 16
Data dictionaries
Advancing open data through
transparency and reusability
17
A data dictionary is...
… a collection of descriptions of the data objects or items in a
dataset or model for the benefit of programmers and others who
need to refer to them.
18
Ag Data Commons supports data dictionaries
Encouraged as part of catalog entry in the Ag Data Commons
● A special designation for data dictionary resources in the submission form
● CSV format preferred, other machine-readable formats accepted
19
NAL offers data dictionary resources
Ag Data Commons submission manual
● https://data.nal.usda.gov > under the About tab
● Instructions for automatic and manual generation
● Blank template
Data dictionary webinars
● National Agricultural Library YouTube channel
● Link under the Ag Data Commons “About” tab
Direct questions / advice / help
● NAL-ADC-Curator@ars.usda.gov
20
Data Management Plans
More steps toward open data
21
DMPs are required for USDA funding proposals
USDA funding proposals now require a
DMP
There is a specific format for NIFA DMP
- 2 pages with 5 sections*
● Expected data types
● Data formats (and standards)
● Data storage and preservation (of access)
● Data sharing, protection,
and public access
● Roles and responsibilities
*Note: Other agencies or institutions may require a different format
22
NAL assists with DMPs
USDA DMP guide
● https://www.nal.usda.gov/ks/guidelines-data-management-planning
NAL provides DMP draft review
● USDA researchers and collaborators can send their drafts to
NAL-ADC-Curator@ars.usda.gov for review
DMP Webinars
● National Agricultural Library YouTube channel
● Linked under the Ag Data Commons “About” tab
23
Other resources at NAL
Webinars
● Recordings available publicly on the NAL
YouTube channel
● Anyone may join future webinars - email
NAL-ADC-Curator@ars.usda.gov to be added
to the list
Ag Data Commons site
● Submission manual, policy pages, etc., all
linked under the “About” tab
PubAg
●https://pubag.nal.usda.gov/
Knowledge Services website
● https://www.nal.usda.gov/ks
24
Summary
Open data
● Required for federal research
● Available and accessible for reuse and
redistribution
● FAIR principles - Findable, Accessible,
Interoperable, Reusable
Ag Data Commons
● USDA’s catalog for ag research data
● Agricultural data submissions
Guidelines and assistance at NAL
● Data dictionaries
● Data management plans
25
Questions?
NAL-ADC-Curator@ars.usda.gov
26
1 of 26

Recommended

GBIF: An infrastructure for infrastructures by
GBIF: An infrastructure for infrastructures GBIF: An infrastructure for infrastructures
GBIF: An infrastructure for infrastructures Francisco Pando
101 views17 slides
Data Publishing Overview by
Data Publishing OverviewData Publishing Overview
Data Publishing OverviewRichard Huffine
514 views15 slides
DMPTool Webinar 7: Digital Humanities and the DMPTool by Miriam Posner by
DMPTool Webinar 7: Digital Humanities and the DMPTool by Miriam PosnerDMPTool Webinar 7: Digital Humanities and the DMPTool by Miriam Posner
DMPTool Webinar 7: Digital Humanities and the DMPTool by Miriam PosnerUniversity of California Curation Center
3K views26 slides
DataShare: Empowering Researcher Data Curation by
DataShare: Empowering Researcher Data CurationDataShare: Empowering Researcher Data Curation
DataShare: Empowering Researcher Data CurationUniversity of California Curation Center
841 views17 slides
FAIR data principles and data management plans - 31 Oct 2017 by
FAIR data principles and data management plans - 31 Oct 2017FAIR data principles and data management plans - 31 Oct 2017
FAIR data principles and data management plans - 31 Oct 2017ARDC
471 views9 slides
From Open Access to Open data, our initiatives by
From Open Access to Open data, our initiativesFrom Open Access to Open data, our initiatives
From Open Access to Open data, our initiativesJohannes Keizer
773 views26 slides

More Related Content

What's hot

20170530_Open Research Data in Horizon 2020 by
20170530_Open Research Data in Horizon 202020170530_Open Research Data in Horizon 2020
20170530_Open Research Data in Horizon 2020OpenAIRE
3.6K views59 slides
GODORT SLDTF 2009 Meeting Outline by
GODORT SLDTF 2009 Meeting OutlineGODORT SLDTF 2009 Meeting Outline
GODORT SLDTF 2009 Meeting Outlineuclagovinfolibrarian
241 views21 slides
Using Open Data - David Tarrant by
Using Open Data - David TarrantUsing Open Data - David Tarrant
Using Open Data - David TarrantgodanSec
713 views38 slides
Parr ag datacommonsnal_brownbag by
Parr ag datacommonsnal_brownbagParr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbagCyndy Parr
513 views14 slides
Let's be FAIR: ALLEA workshop at DARIAH annual event 2019 by
Let's be FAIR: ALLEA workshop at DARIAH annual event 2019Let's be FAIR: ALLEA workshop at DARIAH annual event 2019
Let's be FAIR: ALLEA workshop at DARIAH annual event 2019dri_ireland
331 views10 slides
CHORUS 5-minute Flash Talk at STM Innovations December 4, 2013 by
CHORUS 5-minute Flash Talk at STM Innovations December 4, 2013CHORUS 5-minute Flash Talk at STM Innovations December 4, 2013
CHORUS 5-minute Flash Talk at STM Innovations December 4, 2013hratner
443 views19 slides

What's hot(19)

20170530_Open Research Data in Horizon 2020 by OpenAIRE
20170530_Open Research Data in Horizon 202020170530_Open Research Data in Horizon 2020
20170530_Open Research Data in Horizon 2020
OpenAIRE3.6K views
Using Open Data - David Tarrant by godanSec
Using Open Data - David TarrantUsing Open Data - David Tarrant
Using Open Data - David Tarrant
godanSec713 views
Parr ag datacommonsnal_brownbag by Cyndy Parr
Parr ag datacommonsnal_brownbagParr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbag
Cyndy Parr513 views
Let's be FAIR: ALLEA workshop at DARIAH annual event 2019 by dri_ireland
Let's be FAIR: ALLEA workshop at DARIAH annual event 2019Let's be FAIR: ALLEA workshop at DARIAH annual event 2019
Let's be FAIR: ALLEA workshop at DARIAH annual event 2019
dri_ireland331 views
CHORUS 5-minute Flash Talk at STM Innovations December 4, 2013 by hratner
CHORUS 5-minute Flash Talk at STM Innovations December 4, 2013CHORUS 5-minute Flash Talk at STM Innovations December 4, 2013
CHORUS 5-minute Flash Talk at STM Innovations December 4, 2013
hratner443 views
Reflections on making EFSA an open science organisation by Nikos Manouselis
Reflections on making EFSA an open science organisationReflections on making EFSA an open science organisation
Reflections on making EFSA an open science organisation
Nikos Manouselis885 views
RDMRose 2.3 Institutional data repository policies by RDMRose
RDMRose 2.3 Institutional data repository policiesRDMRose 2.3 Institutional data repository policies
RDMRose 2.3 Institutional data repository policies
RDMRose319 views
A Blueprint for the Research Data Landscape by Sayeed Choudhury
A Blueprint for the Research Data LandscapeA Blueprint for the Research Data Landscape
A Blueprint for the Research Data Landscape
Sayeed Choudhury241 views
Europa requisitos y servicios en torno a los datos de investigacion by maredata
Europa requisitos y servicios en torno a los datos de investigacionEuropa requisitos y servicios en torno a los datos de investigacion
Europa requisitos y servicios en torno a los datos de investigacion
maredata212 views
iNACOL Research In Review Webinar: Blended and Online Learning Clearinghouse by iNACOL
iNACOL Research In Review Webinar: Blended and Online Learning ClearinghouseiNACOL Research In Review Webinar: Blended and Online Learning Clearinghouse
iNACOL Research In Review Webinar: Blended and Online Learning Clearinghouse
iNACOL922 views
Scaling up food safety information transparency by Nikos Manouselis
Scaling up food safety information transparencyScaling up food safety information transparency
Scaling up food safety information transparency
Nikos Manouselis2.4K views
Agricultural Data Interest Group & Wheat Data Working Group of RDA by Vassilis Protonotarios
Agricultural Data Interest Group & Wheat Data Working Group of RDAAgricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDA
D3.1.2 heterogeneous data repositories and related services by FOODIE_Project
D3.1.2 heterogeneous data repositories and related servicesD3.1.2 heterogeneous data repositories and related services
D3.1.2 heterogeneous data repositories and related services
FOODIE_Project110 views
Data Sharing Principles and Legal Interoperability for Essential Biodiversity... by agosti
Data Sharing Principles and Legal Interoperability for Essential Biodiversity...Data Sharing Principles and Legal Interoperability for Essential Biodiversity...
Data Sharing Principles and Legal Interoperability for Essential Biodiversity...
agosti967 views

Similar to Open data and the ag data commons

Overview of Emerging Requirements for Data Management of Federally Funded Res... by
Overview of Emerging Requirements for Data Management of Federally Funded Res...Overview of Emerging Requirements for Data Management of Federally Funded Res...
Overview of Emerging Requirements for Data Management of Federally Funded Res...Richard Huffine
657 views17 slides
Overview of Emerging Requirements for Data Management of Federally Funded Res... by
Overview of Emerging Requirements for Data Management of Federally Funded Res...Overview of Emerging Requirements for Data Management of Federally Funded Res...
Overview of Emerging Requirements for Data Management of Federally Funded Res...Richard Huffine
399 views14 slides
Compliance: Data Management Plans and Public Access to Data by
Compliance: Data Management Plans and Public Access to DataCompliance: Data Management Plans and Public Access to Data
Compliance: Data Management Plans and Public Access to DataMargaret Henderson
1K views60 slides
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J... by
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...OpenAIRE
1.8K views22 slides
H2020 data pilot openaire by
H2020 data pilot openaireH2020 data pilot openaire
H2020 data pilot openaireSarah Jones
738 views22 slides
Ag Data Commons: Adding Value to open agricultural research data by
Ag Data Commons: Adding Value to open agricultural research dataAg Data Commons: Adding Value to open agricultural research data
Ag Data Commons: Adding Value to open agricultural research dataCyndy Parr
974 views14 slides

Similar to Open data and the ag data commons(20)

Overview of Emerging Requirements for Data Management of Federally Funded Res... by Richard Huffine
Overview of Emerging Requirements for Data Management of Federally Funded Res...Overview of Emerging Requirements for Data Management of Federally Funded Res...
Overview of Emerging Requirements for Data Management of Federally Funded Res...
Richard Huffine657 views
Overview of Emerging Requirements for Data Management of Federally Funded Res... by Richard Huffine
Overview of Emerging Requirements for Data Management of Federally Funded Res...Overview of Emerging Requirements for Data Management of Federally Funded Res...
Overview of Emerging Requirements for Data Management of Federally Funded Res...
Richard Huffine399 views
Compliance: Data Management Plans and Public Access to Data by Margaret Henderson
Compliance: Data Management Plans and Public Access to DataCompliance: Data Management Plans and Public Access to Data
Compliance: Data Management Plans and Public Access to Data
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J... by OpenAIRE
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
OpenAIRE1.8K views
H2020 data pilot openaire by Sarah Jones
H2020 data pilot openaireH2020 data pilot openaire
H2020 data pilot openaire
Sarah Jones738 views
Ag Data Commons: Adding Value to open agricultural research data by Cyndy Parr
Ag Data Commons: Adding Value to open agricultural research dataAg Data Commons: Adding Value to open agricultural research data
Ag Data Commons: Adding Value to open agricultural research data
Cyndy Parr974 views
Research data sharing by CGIAR
Research data sharingResearch data sharing
Research data sharing
CGIAR664 views
2012 Fall Data Management Planning Workshop by Lizzy_Rolando
2012 Fall Data Management Planning Workshop2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop
Lizzy_Rolando410 views
Inroads into Data: Getting Involved in Data at Your Institution by Margaret Henderson
Inroads into Data: Getting Involved in Data at Your InstitutionInroads into Data: Getting Involved in Data at Your Institution
Inroads into Data: Getting Involved in Data at Your Institution
Margaret Henderson579 views
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016... by EUDAT
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT6.7K views
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld... by OpenAIRE
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
OpenAIRE3.1K views
Ogc Ben Schaap june 24 2019 with link to farm data train by benschp
Ogc Ben Schaap june 24 2019 with link to farm data trainOgc Ben Schaap june 24 2019 with link to farm data train
Ogc Ben Schaap june 24 2019 with link to farm data train
benschp37 views
Open Data Ireland Public Meeting by Derilinx
Open Data Ireland Public MeetingOpen Data Ireland Public Meeting
Open Data Ireland Public Meeting
Derilinx1.5K views
Llinked open data training for EU institutions by Open Data Support
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutions
Open Data Support4.2K views
Ethiopian Open Government Data Initiative by abiyotb
Ethiopian Open Government Data InitiativeEthiopian Open Government Data Initiative
Ethiopian Open Government Data Initiative
abiyotb170 views
Ag Data Commons for AgBioData by Cyndy Parr
Ag Data Commons for AgBioDataAg Data Commons for AgBioData
Ag Data Commons for AgBioData
Cyndy Parr305 views

More from Cyndy Parr

Biodiversity informatics and the agricultural data landscape by
Biodiversity informatics and the agricultural data landscapeBiodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscapeCyndy Parr
320 views16 slides
Public access to research results at USDA by
Public access to research results at USDAPublic access to research results at USDA
Public access to research results at USDACyndy Parr
247 views20 slides
Ag Data Commons: Agricultural research metadata and data by
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataCyndy Parr
402 views15 slides
Ag Data Commons: A new USDA catalog and repository for agricultural research ... by
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Cyndy Parr
878 views17 slides
Preparing for data-intensive science across domains. by
Preparing for data-intensive science across domains.Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.Cyndy Parr
634 views24 slides
Big Data Initiatives for Agroecosystems by
Big Data Initiatives for AgroecosystemsBig Data Initiatives for Agroecosystems
Big Data Initiatives for AgroecosystemsCyndy Parr
570 views19 slides

More from Cyndy Parr(20)

Biodiversity informatics and the agricultural data landscape by Cyndy Parr
Biodiversity informatics and the agricultural data landscapeBiodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscape
Cyndy Parr320 views
Public access to research results at USDA by Cyndy Parr
Public access to research results at USDAPublic access to research results at USDA
Public access to research results at USDA
Cyndy Parr247 views
Ag Data Commons: Agricultural research metadata and data by Cyndy Parr
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
Cyndy Parr402 views
Ag Data Commons: A new USDA catalog and repository for agricultural research ... by Cyndy Parr
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Cyndy Parr878 views
Preparing for data-intensive science across domains. by Cyndy Parr
Preparing for data-intensive science across domains.Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.
Cyndy Parr634 views
Big Data Initiatives for Agroecosystems by Cyndy Parr
Big Data Initiatives for AgroecosystemsBig Data Initiatives for Agroecosystems
Big Data Initiatives for Agroecosystems
Cyndy Parr570 views
TDWG 2014 opening talk: Chair's Welcome by Cyndy Parr
TDWG 2014 opening talk: Chair's WelcomeTDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's Welcome
Cyndy Parr665 views
Behavior ontology workshop princeton by Cyndy Parr
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princeton
Cyndy Parr518 views
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK by Cyndy Parr
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
Cyndy Parr1.2K views
Frontiers of discovery with Encyclopedia of Life by Cyndy Parr
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life
Cyndy Parr1K views
Practical interoperability across semantic stores of data for ecological, tax... by Cyndy Parr
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...
Cyndy Parr626 views
Using and extending Darwin Core for structured attribute data by Cyndy Parr
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute data
Cyndy Parr622 views
How the Encyclopedia of Life is wrangling organismal attribute data by Cyndy Parr
How the Encyclopedia of Life is wrangling organismal attribute dataHow the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute data
Cyndy Parr573 views
The Road to TraitBank: What's Next for the Encyclopedia of Life by Cyndy Parr
The Road to TraitBank: What's Next for the Encyclopedia of LifeThe Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of Life
Cyndy Parr1.1K views
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ... by Cyndy Parr
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Cyndy Parr5K views
Encyclopedia of Life: Use cases for phenotypes by Cyndy Parr
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypes
Cyndy Parr459 views
Species pages and portals by Cyndy Parr
Species pages and portals Species pages and portals
Species pages and portals
Cyndy Parr344 views
Building EOL species pages by Cyndy Parr
Building EOL species pagesBuilding EOL species pages
Building EOL species pages
Cyndy Parr457 views
Leveraging an international infrastructure: Case studies from the Encyclopeda... by Cyndy Parr
Leveraging an international infrastructure: Case studies from the Encyclopeda...Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...
Cyndy Parr1.5K views
Introduction to EOL.org for scientists by Cyndy Parr
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientists
Cyndy Parr2K views

Recently uploaded

Data Integrity for Banking and Financial Services by
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesPrecisely
78 views26 slides
Uni Systems for Power Platform.pptx by
Uni Systems for Power Platform.pptxUni Systems for Power Platform.pptx
Uni Systems for Power Platform.pptxUni Systems S.M.S.A.
61 views21 slides
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O... by
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...ShapeBlue
88 views13 slides
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas... by
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...Bernd Ruecker
50 views69 slides
Cencora Executive Symposium by
Cencora Executive SymposiumCencora Executive Symposium
Cencora Executive Symposiummarketingcommunicati21
139 views14 slides
Network Source of Truth and Infrastructure as Code revisited by
Network Source of Truth and Infrastructure as Code revisitedNetwork Source of Truth and Infrastructure as Code revisited
Network Source of Truth and Infrastructure as Code revisitedNetwork Automation Forum
52 views45 slides

Recently uploaded(20)

Data Integrity for Banking and Financial Services by Precisely
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
Precisely78 views
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O... by ShapeBlue
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...
ShapeBlue88 views
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas... by Bernd Ruecker
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
Bernd Ruecker50 views
Confidence in CloudStack - Aron Wagner, Nathan Gleason - Americ by ShapeBlue
Confidence in CloudStack - Aron Wagner, Nathan Gleason - AmericConfidence in CloudStack - Aron Wagner, Nathan Gleason - Americ
Confidence in CloudStack - Aron Wagner, Nathan Gleason - Americ
ShapeBlue88 views
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... by ShapeBlue
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
ShapeBlue138 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava... by ShapeBlue
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...
ShapeBlue101 views
Digital Personal Data Protection (DPDP) Practical Approach For CISOs by Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash153 views
Future of AR - Facebook Presentation by Rob McCarty
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook Presentation
Rob McCarty62 views
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue by ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueElevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
ShapeBlue179 views
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue by ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
ShapeBlue103 views
Business Analyst Series 2023 - Week 4 Session 7 by DianaGray10
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7
DianaGray10126 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue158 views
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... by ShapeBlue
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
ShapeBlue132 views
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ... by ShapeBlue
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
ShapeBlue85 views
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... by ShapeBlue
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
ShapeBlue117 views

Open data and the ag data commons

  • 1. Open Data and The Ag Data Commons Presented by Cyndy Parr & Erin Antognoli April 25, 2019 1
  • 2. Agenda Open data ● Definition and basics Ag Data Commons ● USDA research data catalog ● Open agricultural data National Agricultural Library services ● Data dictionaries ● Data management plans 2
  • 3. Open Data The basics and background 3
  • 4. Open data policy history 2013 - Obama administration’s open data policy memo Directs all federal agencies to publish their information as machine-readable data, using searchable, open formats Required every agency to maintain a centralized Enterprise Data Inventory that lists all data sets Mandated a centralized inventory for the whole government – the platform currently known as data.gov 2019 - OPEN Government Data Act becomes law https://project-open-data.cio.gov/policy-memo/ https://www.congress.gov/bill/115th-congress/house-bill/4174/text 4
  • 5. Public access policy history 2013 - “Holdren memo” issued by Office of Science and Technology Policy 2014 - USDA Implementation Plan approved 2016 - USDA Public Access Policy for Scholarly Publications approved ● CHORUS will provide access to many published articles ● Submission of accepted manuscripts to PubAg (pubag.data.nal.gov) is imminent 2019 - Anticipate approval of USDA Public Access Policy for Digital Scientific Data https://go.usa.gov/xmB9a https://go.usa.gov/xmB92 5
  • 6. Open data is... “...data that can be freely used, re-used and redistributed by anyone - subject only, at most, to the requirement to attribute and sharealike.” ~ Open Data Handbook Why is a clear definition of open data important? Interoperability - different datasets should be able to work together ● Availability and access ● Re-use and redistribution ● Universal participation http://opendatahandbook.org/guide/en/what-is-open-data/ 6
  • 7. Availability and Access “The data must be available as a whole and at no more than a reasonable reproduction cost, preferably by downloading over the internet. The data must also be available in a convenient and modifiable form.” http://opendatahandbook.org/guide/en/what-is-open-data/ 7
  • 8. Re-use and Redistribution “The data must be provided under terms that permit re-use and redistribution including the intermixing with other datasets.” http://opendatahandbook.org/guide/en/what-is-open-data/ 8
  • 9. Universal Participation “Everyone must be able to use, re-use and redistribute - there should be no discrimination against fields of endeavour or against persons or groups. For example, ‘non-commercial’ restrictions that would prevent ‘commercial’ use, or restrictions of use for certain purposes (e.g. only in education), are not allowed.” 9
  • 10. FAIR principles reinforce open data Findable Accessible Interoperable Reusable FINDABLE Rich metadata Persistent identifiers INTEROPERABLE Open formats Common metadata standards Controlled vocabularies REUSABLE Usage license Provenance Community standards ACCESSIBLE Fixity Data & metadata available to target audience FAIR Principles https://www.force11.org/group/fairgroup/fairprinciples 10
  • 11. Ag Data Commons USDA open agricultural data 11
  • 12. The Ag Data Commons is... ● A catalog and data repository for open agricultural research data ● The catalog for all USDA-funded research data ● Satisfies the federal open data requirements ● Satisfies the USDA public access requirements https://data.nal.usda.gov/ 12
  • 13. Ag Data Commons collection policies Ag-related data ● Many high-level categories - i.e. Agroecosystems & Environment, Agricultural Economics, Bioenergy, Agricultural Products, etc. USDA Funding ● USDA-funded data or data from USDA researchers working on collaborative projects DOI ● Assigned for locally held resources Version policy https://data.nal.usda.gov/ 13
  • 14. Ag Data Commons features Groups by project or affiliation ● Programs can request a tag to keep all their data entries grouped together ● Data hierarchies one level deep supported (parent / child) ORCID integration ● Authors can link to their profiles to prevent ambiguity Citations ● Specify a citation for your own data ● Link to scholarly publications or data papers / PubAg ● Link to other related data content https://data.nal.usda.gov/ 14
  • 15. Submission limitations Data should have ties to USDA ● Funder, collaborator, or employer File size - 20 GB per file max ● Larger size data storage pilot underway! No executables allowed ● Executables can be cataloged with a pointer to the software/code, but not deposited directly https://data.nal.usda.gov/ 15
  • 16. Submit ag-related data Create an account ● https://data.nal.usda.gov/user/register Data submission form ● Metadata entry ● Workflow tools ● Clone metadata ● Separate descriptions for each resource file Metadata - Project Open Data ● Open standard ● Formatted for ingest into data.gov ● https://project-open-data.cio.gov/ schema/https://data.nal.usda.gov/ 16
  • 17. Data dictionaries Advancing open data through transparency and reusability 17
  • 18. A data dictionary is... … a collection of descriptions of the data objects or items in a dataset or model for the benefit of programmers and others who need to refer to them. 18
  • 19. Ag Data Commons supports data dictionaries Encouraged as part of catalog entry in the Ag Data Commons ● A special designation for data dictionary resources in the submission form ● CSV format preferred, other machine-readable formats accepted 19
  • 20. NAL offers data dictionary resources Ag Data Commons submission manual ● https://data.nal.usda.gov > under the About tab ● Instructions for automatic and manual generation ● Blank template Data dictionary webinars ● National Agricultural Library YouTube channel ● Link under the Ag Data Commons “About” tab Direct questions / advice / help ● NAL-ADC-Curator@ars.usda.gov 20
  • 21. Data Management Plans More steps toward open data 21
  • 22. DMPs are required for USDA funding proposals USDA funding proposals now require a DMP There is a specific format for NIFA DMP - 2 pages with 5 sections* ● Expected data types ● Data formats (and standards) ● Data storage and preservation (of access) ● Data sharing, protection, and public access ● Roles and responsibilities *Note: Other agencies or institutions may require a different format 22
  • 23. NAL assists with DMPs USDA DMP guide ● https://www.nal.usda.gov/ks/guidelines-data-management-planning NAL provides DMP draft review ● USDA researchers and collaborators can send their drafts to NAL-ADC-Curator@ars.usda.gov for review DMP Webinars ● National Agricultural Library YouTube channel ● Linked under the Ag Data Commons “About” tab 23
  • 24. Other resources at NAL Webinars ● Recordings available publicly on the NAL YouTube channel ● Anyone may join future webinars - email NAL-ADC-Curator@ars.usda.gov to be added to the list Ag Data Commons site ● Submission manual, policy pages, etc., all linked under the “About” tab PubAg ●https://pubag.nal.usda.gov/ Knowledge Services website ● https://www.nal.usda.gov/ks 24
  • 25. Summary Open data ● Required for federal research ● Available and accessible for reuse and redistribution ● FAIR principles - Findable, Accessible, Interoperable, Reusable Ag Data Commons ● USDA’s catalog for ag research data ● Agricultural data submissions Guidelines and assistance at NAL ● Data dictionaries ● Data management plans 25