SlideShare a Scribd company logo
Ag Data Commons
Cynthia Parr
USDA ARS National Agricultural Library
A platform to harness the power of Digital Agriculture
Agricultural
Data
(Gather)
Agricultural
Knowledge
(Transform)
Agricultural
Decision-making
and Action
(Translate)
Why Ag Data Commons?
Federal directives:
Public access to open,
machine-readable data
Photo credit: Alpha Stock Images CC BY SA 3.0
USDA Enterprise Data Management
USDA Public Access Policy
ARS OSQR Procedure
NIFA RFP, Terms and Conditions
Cooperative agreements and contracts
• Data Management Plan
• Data to be made public in trusted
repository within 30 months unless
private, proprietary, or sensitive
• Datasets to be cataloged at Ag Data
Commons with appropriate identifiers
PLOS ONE Data Availability:
20% Currently in Repositories
U41A: How Safe and Persistent Is Your Research?
AGU Fall Meeting, December 14, 2017
Kerry Kroffe, Director, Editorial Services, PLOS
”Enabling FAIR Data” initiative
• Journal will require all data
supporting the article be in a data
citation and described in the Data
Availability Statement
• Editors and reviewers enforce
policy
• Ensure NO data is in the
supplement
• Repository selected by author must
be FAIR-compliant
• Journal community adopts and
enforces FAIR principles
Citation: Stall, S. (2017), Enabling findable, accessible, interoperable, and
reusable data, Eos, 98, https://doi.org/10.1029/2018EO081907. Published on 15
September 2017.
22%
34%
2%2%
40%
Required
Encouraged
Over half of top agricultural
journals encourage or require
open data
n = 50
Where USDA researchers published in 2016
(thanks Jon Sears)
17%
78%
5%
Yes
No
Undetermined
Researchers have few
options for open
submission in domain-
specific databases
n = 235 (thanks Erin Antognoli)
Where ag researchers deposit data in 2016
The Concept
• Discovery Interface
• Catalog
• APIs
• Computational Tools
• Data Analytic Tools
Ag Data Commons
Knowledge Base
Data Producers Data Consumers
•Publications
•Patents
•Grant Info.
Federal
Repository
(I)
University
Repository
(K)
Industry
Repository
(N)
Experiment Devices
Farm Equipment
UAVs, Sensors
FAIR Data Principles
Catalog and repository
ecosystem
Self-submission &
harvesting
Currently all open data,
linked to literature
Currently USDA-funded
datasets and databases
11% of records have data
in our repository –
issuing DOIs
Ag Data Commons https://data.nal.usda.gov/
8
Public interactive monthly platform statistics
Registered Users Catalogued Datasets
Downloads Citations
9
Organizing datasets
Photo credit:
Anjuli_ayer CC-
BY-NC-SA
10
Ag Data Commons Topics
NAL Thesaurus Terms
https://agclass.nal.usda.gov
ARS National Programs
11
https://www.ars.usda.gov/research/datasets/
ARS National Program 301
12
AgBioData program
13
AgBioData program
14
15
Harvesting metadata
Photo: CC BY
Tony Walmsley https://flic.kr/p/Ws9Nec
Harvesting metadata in DKAN
16
E.g. NCBI Bioprojects
USDA NAL Geodata
USFS Research Data Archive
E.g. Project Open Data,
CSW, OAI-PMH
Harvesting from distributed repositories
• Avoids duplication of submission effort
• More exposure = more impact
• Distributes costs for storage
• Keeps to specialized platforms for communities
• Usually lacks funding information
• Many lack DOIs
• Many lack methodological detail
• Challenging to match up with associated articles
17
Making data machine readable, linked
Promoting shared standards
JSON, RDF
Data dictionary
CSV, API, DB, code
Ag Data Commons data.nal.usda.gov
frictionlessdata.ioscience
NAL Resources
Ag Data Commons
https://data.nal.usda.gov
Data Management Plans
NOW REQUIRED BY MOST FUNDERS
NAL provides online resources & will
provide consultation on draft DMPs
https://www.nal.usda.gov/
click on DATA
20
DISCUSSION
How can Ag Data Commons help AgBioData
• Harvesting metadata?
• DOI service for subsets or entire versions of datasets?
• Compliance: linking data to grant and award numbers?
• Linking data to citations (re-use)?
• Discoverability?
• Collecting consistent documentation and API
information?
• Transformation services?
• Other?
21

More Related Content

What's hot

OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE
 
CSIRO investing in the future of data - John Morrissey
CSIRO investing in the future of data - John Morrissey CSIRO investing in the future of data - John Morrissey
CSIRO investing in the future of data - John Morrissey
ARDC
 
AReS and Altmetrics: How we use them at ILRI
AReS and Altmetrics: How we use them at ILRIAReS and Altmetrics: How we use them at ILRI
AReS and Altmetrics: How we use them at ILRI
ILRI
 
Datasets, Services & Tools - Pangiota Koltsida & Dimitris Gavrilis
Datasets, Services & Tools - Pangiota Koltsida & Dimitris Gavrilis Datasets, Services & Tools - Pangiota Koltsida & Dimitris Gavrilis
Datasets, Services & Tools - Pangiota Koltsida & Dimitris Gavrilis
Blue BRIDGE
 
A Lined Data Approach to Interoperability between Biomedical Resource Invento...
A Lined Data Approach to Interoperability between Biomedical Resource Invento...A Lined Data Approach to Interoperability between Biomedical Resource Invento...
A Lined Data Approach to Interoperability between Biomedical Resource Invento...
Trish Whetzel
 
#4 FAIR - Keith Russell
#4 FAIR - Keith Russell #4 FAIR - Keith Russell
#4 FAIR - Keith Russell
ARDC
 
IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...
IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...
IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...
Kerstin Lehnert
 
Building a national Data Repository Data Modelling
Building a national Data Repository Data ModellingBuilding a national Data Repository Data Modelling
Building a national Data Repository Data Modelling
Jisc RDM
 
#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17
#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17
#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17
ARDC
 
Semantic Web Tools For Agricultural Materials
Semantic Web Tools For Agricultural MaterialsSemantic Web Tools For Agricultural Materials
Semantic Web Tools For Agricultural Materials
Gerard Sylvester
 
Building a National Data Service Open Repositories 2018
Building a National Data Service Open Repositories 2018Building a National Data Service Open Repositories 2018
Building a National Data Service Open Repositories 2018
Jisc RDM
 
Monika solanki-agrisemantics2021
Monika solanki-agrisemantics2021Monika solanki-agrisemantics2021
Monika solanki-agrisemantics2021
Monika Solanki
 
Bioschemas overview
Bioschemas overviewBioschemas overview
Bioschemas overview
Bioschemas
 
Data editors meeting at SEFS
Data editors meeting at SEFSData editors meeting at SEFS
Data editors meeting at SEFSAaike De Wever
 
Responsible Research Data Management - RMIT - Mar 19
Responsible Research Data Management - RMIT - Mar 19Responsible Research Data Management - RMIT - Mar 19
Responsible Research Data Management - RMIT - Mar 19
Richard Ferrers
 
PLOS ALM Talk on UC3 Services and Altmetrics
PLOS ALM Talk on UC3 Services and AltmetricsPLOS ALM Talk on UC3 Services and Altmetrics
PLOS ALM Talk on UC3 Services and Altmetrics
Carly Strasser
 
Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...
Repository Fringe
 
Big Data is today: key issues for big data - Dr Ben Evans
Big Data is today: key issues for big data - Dr Ben EvansBig Data is today: key issues for big data - Dr Ben Evans
Big Data is today: key issues for big data - Dr Ben Evans
ARDC
 
Libraries & Research Data Management for CO Alliance of Resrch Libraries
Libraries & Research Data Management for CO Alliance of Resrch LibrariesLibraries & Research Data Management for CO Alliance of Resrch Libraries
Libraries & Research Data Management for CO Alliance of Resrch Libraries
Carly Strasser
 
FAIRsharing poster
FAIRsharing posterFAIRsharing poster
FAIRsharing poster
Susanna-Assunta Sansone
 

What's hot (20)

OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)
 
CSIRO investing in the future of data - John Morrissey
CSIRO investing in the future of data - John Morrissey CSIRO investing in the future of data - John Morrissey
CSIRO investing in the future of data - John Morrissey
 
AReS and Altmetrics: How we use them at ILRI
AReS and Altmetrics: How we use them at ILRIAReS and Altmetrics: How we use them at ILRI
AReS and Altmetrics: How we use them at ILRI
 
Datasets, Services & Tools - Pangiota Koltsida & Dimitris Gavrilis
Datasets, Services & Tools - Pangiota Koltsida & Dimitris Gavrilis Datasets, Services & Tools - Pangiota Koltsida & Dimitris Gavrilis
Datasets, Services & Tools - Pangiota Koltsida & Dimitris Gavrilis
 
A Lined Data Approach to Interoperability between Biomedical Resource Invento...
A Lined Data Approach to Interoperability between Biomedical Resource Invento...A Lined Data Approach to Interoperability between Biomedical Resource Invento...
A Lined Data Approach to Interoperability between Biomedical Resource Invento...
 
#4 FAIR - Keith Russell
#4 FAIR - Keith Russell #4 FAIR - Keith Russell
#4 FAIR - Keith Russell
 
IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...
IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...
IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...
 
Building a national Data Repository Data Modelling
Building a national Data Repository Data ModellingBuilding a national Data Repository Data Modelling
Building a national Data Repository Data Modelling
 
#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17
#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17
#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17
 
Semantic Web Tools For Agricultural Materials
Semantic Web Tools For Agricultural MaterialsSemantic Web Tools For Agricultural Materials
Semantic Web Tools For Agricultural Materials
 
Building a National Data Service Open Repositories 2018
Building a National Data Service Open Repositories 2018Building a National Data Service Open Repositories 2018
Building a National Data Service Open Repositories 2018
 
Monika solanki-agrisemantics2021
Monika solanki-agrisemantics2021Monika solanki-agrisemantics2021
Monika solanki-agrisemantics2021
 
Bioschemas overview
Bioschemas overviewBioschemas overview
Bioschemas overview
 
Data editors meeting at SEFS
Data editors meeting at SEFSData editors meeting at SEFS
Data editors meeting at SEFS
 
Responsible Research Data Management - RMIT - Mar 19
Responsible Research Data Management - RMIT - Mar 19Responsible Research Data Management - RMIT - Mar 19
Responsible Research Data Management - RMIT - Mar 19
 
PLOS ALM Talk on UC3 Services and Altmetrics
PLOS ALM Talk on UC3 Services and AltmetricsPLOS ALM Talk on UC3 Services and Altmetrics
PLOS ALM Talk on UC3 Services and Altmetrics
 
Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...
 
Big Data is today: key issues for big data - Dr Ben Evans
Big Data is today: key issues for big data - Dr Ben EvansBig Data is today: key issues for big data - Dr Ben Evans
Big Data is today: key issues for big data - Dr Ben Evans
 
Libraries & Research Data Management for CO Alliance of Resrch Libraries
Libraries & Research Data Management for CO Alliance of Resrch LibrariesLibraries & Research Data Management for CO Alliance of Resrch Libraries
Libraries & Research Data Management for CO Alliance of Resrch Libraries
 
FAIRsharing poster
FAIRsharing posterFAIRsharing poster
FAIRsharing poster
 

Similar to Ag Data Commons for AgBioData

Research data sharing
Research data sharingResearch data sharing
Research data sharing
CGIAR
 
Research Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementResearch Integrity Advisor and Data Management
Research Integrity Advisor and Data Management
ARDC
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
Anita de Waard
 
Open data and the ag data commons
Open data and the ag data commonsOpen data and the ag data commons
Open data and the ag data commons
Cyndy Parr
 
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveChris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
GigaScience, BGI Hong Kong
 
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Cyndy Parr
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collectionsabedejesus
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshop
Aaike De Wever
 
The Rocky Road to Reuse
The Rocky Road to ReuseThe Rocky Road to Reuse
The Rocky Road to Reuse
Anita de Waard
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6
ARDC
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
Karlsruhe Institute of Technology (KIT)
 
Hahnel "Open Data Policies: Opportunities, compliance and technology strategies"
Hahnel "Open Data Policies: Opportunities, compliance and technology strategies"Hahnel "Open Data Policies: Opportunities, compliance and technology strategies"
Hahnel "Open Data Policies: Opportunities, compliance and technology strategies"
National Information Standards Organization (NISO)
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
Heinz Pampel
 
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
Vivien Bonazzi
 
FAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health NetworkFAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health Network
Susanna-Assunta Sansone
 
Opening up data – Jisc and CNI conference 10 July 2014
Opening up data – Jisc and CNI conference 10 July 2014Opening up data – Jisc and CNI conference 10 July 2014
Opening up data – Jisc and CNI conference 10 July 2014
Jisc
 
2013 DataCite Summer Meeting - Out of Cite, Out of Mind: Report of the CODATA...
2013 DataCite Summer Meeting - Out of Cite, Out of Mind: Report of the CODATA...2013 DataCite Summer Meeting - Out of Cite, Out of Mind: Report of the CODATA...
2013 DataCite Summer Meeting - Out of Cite, Out of Mind: Report of the CODATA...
datacite
 
EOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdf
EOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdfEOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdf
EOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdf
Allyson Lister
 
Why we care about research data? Why we share?
Why we care about research data? Why we share?Why we care about research data? Why we share?
Why we care about research data? Why we share?
Richard Ferrers
 

Similar to Ag Data Commons for AgBioData (20)

Research data sharing
Research data sharingResearch data sharing
Research data sharing
 
Research Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementResearch Integrity Advisor and Data Management
Research Integrity Advisor and Data Management
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
Open data and the ag data commons
Open data and the ag data commonsOpen data and the ag data commons
Open data and the ag data commons
 
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveChris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
 
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collections
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshop
 
The Rocky Road to Reuse
The Rocky Road to ReuseThe Rocky Road to Reuse
The Rocky Road to Reuse
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
Hahnel "Open Data Policies: Opportunities, compliance and technology strategies"
Hahnel "Open Data Policies: Opportunities, compliance and technology strategies"Hahnel "Open Data Policies: Opportunities, compliance and technology strategies"
Hahnel "Open Data Policies: Opportunities, compliance and technology strategies"
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
 
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
 
FAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health NetworkFAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health Network
 
Opening up data – Jisc and CNI conference 10 July 2014
Opening up data – Jisc and CNI conference 10 July 2014Opening up data – Jisc and CNI conference 10 July 2014
Opening up data – Jisc and CNI conference 10 July 2014
 
2013 DataCite Summer Meeting - Out of Cite, Out of Mind: Report of the CODATA...
2013 DataCite Summer Meeting - Out of Cite, Out of Mind: Report of the CODATA...2013 DataCite Summer Meeting - Out of Cite, Out of Mind: Report of the CODATA...
2013 DataCite Summer Meeting - Out of Cite, Out of Mind: Report of the CODATA...
 
EOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdf
EOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdfEOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdf
EOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdf
 
Why we care about research data? Why we share?
Why we care about research data? Why we share?Why we care about research data? Why we share?
Why we care about research data? Why we share?
 

More from Cyndy Parr

Biodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscapeBiodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscape
Cyndy Parr
 
Public access to research results at USDA
Public access to research results at USDAPublic access to research results at USDA
Public access to research results at USDA
Cyndy Parr
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
Cyndy Parr
 
Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.
Cyndy Parr
 
TDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's WelcomeTDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's Welcome
Cyndy Parr
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princeton
Cyndy Parr
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
Cyndy Parr
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life
Cyndy Parr
 
Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...
Cyndy Parr
 
Using and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute data
Cyndy Parr
 
How the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute dataHow the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute data
Cyndy Parr
 
The Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of LifeThe Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of Life
Cyndy Parr
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Cyndy Parr
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesCyndy Parr
 
Species pages and portals
Species pages and portals Species pages and portals
Species pages and portals
Cyndy Parr
 
Building EOL species pages
Building EOL species pagesBuilding EOL species pages
Building EOL species pages
Cyndy Parr
 
Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...
Cyndy Parr
 
Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientists
Cyndy Parr
 
EOL and Science: Yes we can!
EOL and Science: Yes we can!EOL and Science: Yes we can!
EOL and Science: Yes we can!
Cyndy Parr
 
EOL China Center status
EOL China Center statusEOL China Center status
EOL China Center status
Cyndy Parr
 

More from Cyndy Parr (20)

Biodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscapeBiodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscape
 
Public access to research results at USDA
Public access to research results at USDAPublic access to research results at USDA
Public access to research results at USDA
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
 
Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.
 
TDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's WelcomeTDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's Welcome
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princeton
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life
 
Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...
 
Using and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute data
 
How the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute dataHow the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute data
 
The Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of LifeThe Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of Life
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypes
 
Species pages and portals
Species pages and portals Species pages and portals
Species pages and portals
 
Building EOL species pages
Building EOL species pagesBuilding EOL species pages
Building EOL species pages
 
Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...
 
Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientists
 
EOL and Science: Yes we can!
EOL and Science: Yes we can!EOL and Science: Yes we can!
EOL and Science: Yes we can!
 
EOL China Center status
EOL China Center statusEOL China Center status
EOL China Center status
 

Recently uploaded

How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 

Recently uploaded (20)

How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 

Ag Data Commons for AgBioData

  • 1. Ag Data Commons Cynthia Parr USDA ARS National Agricultural Library A platform to harness the power of Digital Agriculture
  • 2. Agricultural Data (Gather) Agricultural Knowledge (Transform) Agricultural Decision-making and Action (Translate) Why Ag Data Commons? Federal directives: Public access to open, machine-readable data
  • 3. Photo credit: Alpha Stock Images CC BY SA 3.0 USDA Enterprise Data Management USDA Public Access Policy ARS OSQR Procedure NIFA RFP, Terms and Conditions Cooperative agreements and contracts • Data Management Plan • Data to be made public in trusted repository within 30 months unless private, proprietary, or sensitive • Datasets to be cataloged at Ag Data Commons with appropriate identifiers
  • 4. PLOS ONE Data Availability: 20% Currently in Repositories U41A: How Safe and Persistent Is Your Research? AGU Fall Meeting, December 14, 2017 Kerry Kroffe, Director, Editorial Services, PLOS ”Enabling FAIR Data” initiative • Journal will require all data supporting the article be in a data citation and described in the Data Availability Statement • Editors and reviewers enforce policy • Ensure NO data is in the supplement • Repository selected by author must be FAIR-compliant • Journal community adopts and enforces FAIR principles Citation: Stall, S. (2017), Enabling findable, accessible, interoperable, and reusable data, Eos, 98, https://doi.org/10.1029/2018EO081907. Published on 15 September 2017.
  • 5. 22% 34% 2%2% 40% Required Encouraged Over half of top agricultural journals encourage or require open data n = 50 Where USDA researchers published in 2016 (thanks Jon Sears) 17% 78% 5% Yes No Undetermined Researchers have few options for open submission in domain- specific databases n = 235 (thanks Erin Antognoli) Where ag researchers deposit data in 2016
  • 6. The Concept • Discovery Interface • Catalog • APIs • Computational Tools • Data Analytic Tools Ag Data Commons Knowledge Base Data Producers Data Consumers •Publications •Patents •Grant Info. Federal Repository (I) University Repository (K) Industry Repository (N) Experiment Devices Farm Equipment UAVs, Sensors
  • 7. FAIR Data Principles Catalog and repository ecosystem Self-submission & harvesting Currently all open data, linked to literature Currently USDA-funded datasets and databases 11% of records have data in our repository – issuing DOIs Ag Data Commons https://data.nal.usda.gov/
  • 8. 8 Public interactive monthly platform statistics Registered Users Catalogued Datasets Downloads Citations
  • 10. 10 Ag Data Commons Topics NAL Thesaurus Terms https://agclass.nal.usda.gov
  • 15. 15 Harvesting metadata Photo: CC BY Tony Walmsley https://flic.kr/p/Ws9Nec
  • 16. Harvesting metadata in DKAN 16 E.g. NCBI Bioprojects USDA NAL Geodata USFS Research Data Archive E.g. Project Open Data, CSW, OAI-PMH
  • 17. Harvesting from distributed repositories • Avoids duplication of submission effort • More exposure = more impact • Distributes costs for storage • Keeps to specialized platforms for communities • Usually lacks funding information • Many lack DOIs • Many lack methodological detail • Challenging to match up with associated articles 17
  • 18. Making data machine readable, linked Promoting shared standards JSON, RDF Data dictionary CSV, API, DB, code Ag Data Commons data.nal.usda.gov frictionlessdata.ioscience
  • 19.
  • 20. NAL Resources Ag Data Commons https://data.nal.usda.gov Data Management Plans NOW REQUIRED BY MOST FUNDERS NAL provides online resources & will provide consultation on draft DMPs https://www.nal.usda.gov/ click on DATA 20
  • 21. DISCUSSION How can Ag Data Commons help AgBioData • Harvesting metadata? • DOI service for subsets or entire versions of datasets? • Compliance: linking data to grant and award numbers? • Linking data to citations (re-use)? • Discoverability? • Collecting consistent documentation and API information? • Transformation services? • Other? 21

Editor's Notes

  1. USDA is in the process of implementing new requirements for public access to federally funded data, and Ag Data Commons is a big part of implementing that. But even once we get past the gathering stage for all this diverse, scattered data, we want to be able to transform it into knowledge and translate it in ways that are actionable by society for decision.
  2. More journals are requiring the data associated with their published papers to be open. Top journals with ag content profiled (anything Jon wants to add about that?) PLOS ONE, Scientific Reports, Frontiers in Plant Science, Genome Announcements are top ag journals that require open data. Note: not every journal has a policy regarding open data one way or the other
  3. Whether the repositories are managed federally, by industry, or at universities, data should be managed in a place tailored to community needs However, there should be a central catalog, and the data owners are best suited to describing their data in that central catalog To be most useful and understandable we need rich metadata, but given the diversity of kinds of data it can’t be as high as the specialized community repositories need. NAL curators can help make sure the metadata as good as possible The platform should add value, by making available APIs, providing broadly useful tools for working with the data, and extracting the knowledge from the data and connecting it to publications and grant information
  4. Finally, our curators help programmers set up harvests. Given the wide variety of kinds of data, dsitributed platforms don’t use consistent standards so can’t do a distributed search If they are using standards, there are inevitably dialects of standards Programmers don’t understand, metadata librarians help, communicate with data owners
  5. We have a human readable page with some text descriptions, attached files, structured metadata We also promote a variety of ways to make things machine readable and actionable.
  6. How do we work with big data platforms? Just a comment that we are working with the SCINet team to coordinate policies and plans for what to do with big data when it is ready for release.