The Ag Data Commons is a platform for aggregating, cataloging, and sharing agricultural data. It harvests metadata from various federal and university repositories to make data more discoverable without duplication of submission efforts. Currently it catalogs open datasets and links them to related literature. In the future, it aims to harvest more funding information and methodological details to better link datasets to associated research articles and grants. The goal is to organize agricultural data according to shared standards to make it fully machine-readable and reusable to support further research and decision-making.
Ag Data Commons: Adding Value to open agricultural research dataCyndy Parr
A talk presented on 30 September 2013 at the Biodiversity Information Standards (Taxonomic Databases Working Group TDWG) annual meeting in Nairobi, Kenya
Sharing Agricultural Events Information: When and where is that workshop?Gauri Salokhe
In the last few years a strong need has emerged for a standard way to interchange various types of information, such as on organizations, projects, experts, events and news, in the agricultural community. This paper focuses on the metadata set for events, the Agricultural Events Application Profile (Ag-Events AP), created specifically to enhance description, exchange and reuse of information on events. The Ag-Events AP provides a minimum interoperability layer through which information about upcoming events related to agriculture can be described, shared and reused. The Ag-Events AP was developed by FAO, in collaboration with its partners, the Global Forum on Agricultural Research (GFAR) and Global Forest Information Service (GFIS), to offer a “minimum” set of metadata elements necessary to share quality descriptions about events. This paper talks about the work done on creating the AP, its use in various applications and the next steps.
This year the CGIAR Platform for Big Data in Agriculture will award up to 1 million USD in grants to two categories of Inspire Challenge winners. We hosted a Q&A webinar to address potential applicants' questions.
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Sky Bristol
Presentation on one of the strategic themes being considered for a U.S. Government Big Data R&D strategy - https://www.nitrd.gov/bigdata/rfi/02102014.aspx.
Ag Data Commons: Adding Value to open agricultural research dataCyndy Parr
A talk presented on 30 September 2013 at the Biodiversity Information Standards (Taxonomic Databases Working Group TDWG) annual meeting in Nairobi, Kenya
Sharing Agricultural Events Information: When and where is that workshop?Gauri Salokhe
In the last few years a strong need has emerged for a standard way to interchange various types of information, such as on organizations, projects, experts, events and news, in the agricultural community. This paper focuses on the metadata set for events, the Agricultural Events Application Profile (Ag-Events AP), created specifically to enhance description, exchange and reuse of information on events. The Ag-Events AP provides a minimum interoperability layer through which information about upcoming events related to agriculture can be described, shared and reused. The Ag-Events AP was developed by FAO, in collaboration with its partners, the Global Forum on Agricultural Research (GFAR) and Global Forest Information Service (GFIS), to offer a “minimum” set of metadata elements necessary to share quality descriptions about events. This paper talks about the work done on creating the AP, its use in various applications and the next steps.
This year the CGIAR Platform for Big Data in Agriculture will award up to 1 million USD in grants to two categories of Inspire Challenge winners. We hosted a Q&A webinar to address potential applicants' questions.
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Sky Bristol
Presentation on one of the strategic themes being considered for a U.S. Government Big Data R&D strategy - https://www.nitrd.gov/bigdata/rfi/02102014.aspx.
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE
Openness is the success factor for EOSC. OpenAIRE has been working in delivering an open access scholarly communication in Europe for the past 10 years and we now present how our work fits into the EOSC core developments
Pangiota Koltsida & Dimitris Gavrilis' presentation at the RDA Europe - BlueBRIDGE datathon on fisheries and aquaculture, 15-16 June 2017, Heraklion, Crete
A Lined Data Approach to Interoperability between Biomedical Resource Invento...Trish Whetzel
Overview of Resource Representation Coordination efforts to coordinate the representation of resources from Biositemaps, eagle-i, and the Neuroscience Information Framework.
Full Webinar: https://youtu.be/EDhJTCm9RN8
Transcript: https://www.slideshare.net/AustralianNationalDataService/transcript-4-fair-r-for-reusable
Other webinars in the series: http://www.ands.org.au/news-and-events/events/fair-webinar-series
#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17ARDC
Margie Smith
Full Webinar: https://youtu.be/EDhJTCm9RN8
Transcript: https://www.slideshare.net/AustralianNationalDataService/transcript-4-fair-r-for-reusable
Other webinars in the series: http://www.ands.org.au/news-and-events/events/fair-webinar-series
Overview of the Bioschemas Community Activity providing:
- Description of what Bioschemas is
- Details of adoption of Bioschemas
- Benefits of deploying Bioschemas
Slides presented as the introduction to the Bioschemas Tutorial delivered at ECCB 2018
Building data networks: exploring trust and interoperability between authoris...Repository Fringe
Building data networks: exploring trust and interoperability between authoris, repositories and journals. Varsha Khodiyar , Scientific Data; Neil Chue Hong, Journal of Open Research Software; Rachael Kotarski, DataCite, Peter McQuilton, BioSharing; Reza Salek, Metabolights. At Repository Fringe 2015
Big Data is today: key issues for big data - Dr Ben EvansARDC
Presentation in Canberra: Preparing for your data future seminar
Fri 22 July 2016
Big Data is today: key issues for big data
Dr Ben Evans
NCI - Associate Director
Research Engagements and Initiatives
Libraries & Research Data Management for CO Alliance of Resrch LibrariesCarly Strasser
Keynote presentation for the Colorado Alliance of Research Libraries 2014 Research Data Management Conference, 11 July 2014. Focuses on why data management and sharing is important, and the role of libraries.
The FAIR Cookbook poster, as presented at the ELIXIR-UK Node and the UK Conference of Bioinformatics and Computational Biology 2021: https://www.earlham.ac.uk/uk-conference-bioinformatics-and-computational-biology-21
Research Integrity Advisor and Data ManagementARDC
Dr Paul Wong from the Australian Research Data Commons presented at the University of Technology Sydney's RIA Data Management Workshop on 21 June 2018. In partnership with the Australian Research Council, the National Health and Medical Research Council, the Australian Research Data Commons, and RMIT University, this is part of a national workshop series in data management for research integrity advisors.
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE
Openness is the success factor for EOSC. OpenAIRE has been working in delivering an open access scholarly communication in Europe for the past 10 years and we now present how our work fits into the EOSC core developments
Pangiota Koltsida & Dimitris Gavrilis' presentation at the RDA Europe - BlueBRIDGE datathon on fisheries and aquaculture, 15-16 June 2017, Heraklion, Crete
A Lined Data Approach to Interoperability between Biomedical Resource Invento...Trish Whetzel
Overview of Resource Representation Coordination efforts to coordinate the representation of resources from Biositemaps, eagle-i, and the Neuroscience Information Framework.
Full Webinar: https://youtu.be/EDhJTCm9RN8
Transcript: https://www.slideshare.net/AustralianNationalDataService/transcript-4-fair-r-for-reusable
Other webinars in the series: http://www.ands.org.au/news-and-events/events/fair-webinar-series
#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17ARDC
Margie Smith
Full Webinar: https://youtu.be/EDhJTCm9RN8
Transcript: https://www.slideshare.net/AustralianNationalDataService/transcript-4-fair-r-for-reusable
Other webinars in the series: http://www.ands.org.au/news-and-events/events/fair-webinar-series
Overview of the Bioschemas Community Activity providing:
- Description of what Bioschemas is
- Details of adoption of Bioschemas
- Benefits of deploying Bioschemas
Slides presented as the introduction to the Bioschemas Tutorial delivered at ECCB 2018
Building data networks: exploring trust and interoperability between authoris...Repository Fringe
Building data networks: exploring trust and interoperability between authoris, repositories and journals. Varsha Khodiyar , Scientific Data; Neil Chue Hong, Journal of Open Research Software; Rachael Kotarski, DataCite, Peter McQuilton, BioSharing; Reza Salek, Metabolights. At Repository Fringe 2015
Big Data is today: key issues for big data - Dr Ben EvansARDC
Presentation in Canberra: Preparing for your data future seminar
Fri 22 July 2016
Big Data is today: key issues for big data
Dr Ben Evans
NCI - Associate Director
Research Engagements and Initiatives
Libraries & Research Data Management for CO Alliance of Resrch LibrariesCarly Strasser
Keynote presentation for the Colorado Alliance of Research Libraries 2014 Research Data Management Conference, 11 July 2014. Focuses on why data management and sharing is important, and the role of libraries.
The FAIR Cookbook poster, as presented at the ELIXIR-UK Node and the UK Conference of Bioinformatics and Computational Biology 2021: https://www.earlham.ac.uk/uk-conference-bioinformatics-and-computational-biology-21
Research Integrity Advisor and Data ManagementARDC
Dr Paul Wong from the Australian Research Data Commons presented at the University of Technology Sydney's RIA Data Management Workshop on 21 June 2018. In partnership with the Australian Research Council, the National Health and Medical Research Council, the Australian Research Data Commons, and RMIT University, this is part of a national workshop series in data management for research integrity advisors.
Webinar presentation by Cyndy Parr and Erin Antognoli hosted by Hunger Solutions Institute (HSI) and Presidents United to Solve Hunger (PUSH) at Auburn University on April 25, 2019.
Slides from Thursday 2nd August 2018 - Data in the Scholarly Communications Life Cycle Course which is part of the FORCE11 Scholarly Communications Institute.
Presenter - Natasha Simons
Researchers require infrastructures that ensure a maximum of accessibility, stability and reliability to facilitate working with and sharing of research data. Such infrastructures are being increasingly summarised under the term Research Data Repositories (RDR). The project re3data.org – Registry of Research Data Repositories – began to index research data repositories in 2012 and offers researchers, funding organisations, libraries and publishers an overview of the heterogeneous research data repository landscape. In December 2014 re3data.org listed more than 1,030 research data repositories, which are described in detail using the re3data.org schema (http://dx.doi.org/10.2312/re3.003). Information icons help researchers to identify easily an adequate repository for the storage and reuse of their data. This talk describes the heterogeneous RDR landscape and presents a typology of institutional, disciplinary, multidisciplinary and project-specific RDR. Further, it outlines the features of re3data. org and it shows current developments for integration into data management planning tools and other services.
By the end of 2015 re3data.org and Databib (Purdue University, USA) will merge their services, which will then be managed under the auspices of DataCite. The aim of this merger is to reduce duplication of effort and to serve the research community better with a single, sustainable registry of research data repositories. The talk will present this organisational development as a best practice example for the development of international research information services.
This presentation was provided by Mark Hahnel of Figshare, during the NISO Hot Topic Virtual Conference "Building Access, Openness, and Sharing." The event was held on Wednesday, September 28, 2022.
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
Heinz Pampel | GFZ German Research Centre for Geosciences, LIS
Maxi Kindling | Humboldt-Universität zu Berlin, Berlin School of Library and Information Science Frank Scholze | Karlsruhe Institute of Technology, KIT Library
RDA-Deutschland-Treffen 2015| Potsdam, November 26, 2015
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET
Abstract
In this presentation, Susan Gregurick, Ph.D., Associate Director of Data Science and Director, Office of Data Science Strategy at the National Institutes of Health, will share the NIH’s vision for a modernized, integrated FAIR biomedical data ecosystem and the strategic roadmap that NIH is following to achieve this vision. Dr. Gregurick will highlight projects being implemented by team members across the NIH’s 27 institutes and centers and will ways that industry, academia, and other communities can help NIH enable a FAIR data ecosystem. Finally, she will weave in how this strategy is being leveraged to address the COVID-19 pandemic.
Presenter: Susan Gregurick, Ph.D., Associate Director of Data Science and Director, Office of Data Science Strategy at the National Institutes of Health
dkNET Webinar Information: https://dknet.org/about/webinar
Overview of the role of FAIRsharing and a dedicated Collection of data resources (platforms and registries that collect, harmonize, and share participant-level clinical-epidemiological, OMICs, and/or imaging data) for the COVID-19 Clinical Research Coalition and The Tropical Disease Research initiatives: https://coronavirus.tghn.org/research-resources/data-sharing-covid-19
2013 DataCite Summer Meeting - Out of Cite, Out of Mind: Report of the CODATA...datacite
2013 DataCite Summer Meeting - Making Research better
DataCite. Co-sponsored by CODATA.
Thursday, 19 September 2013 at 13:00 - Friday, 20 September 2013 at 12:30
Washington, DC. National Academy of Sciences
http://datacite.eventbrite.co.uk/
EOSC-Life AGM 2022 Publishing FAIR RI data resources in EOSC.pdfAllyson Lister
FAIRsharing uses collections to create community-specific views of the resource descriptions we store and the relationships among them. This talk describes the work by EOSC-Life Work Package 1 to update and enrich the EOSC-Life collection, which groups together all resources created by EOSC-Life partners. Part of the EOSC-Life AGM 2022 (https://www.eosc-life.eu/news/3rd-agm/).
Why we care about research data? Why we share?Richard Ferrers
An introduction to why ANDS cares about research data. ANDS, the Australian National Data Service, encourages researchers to share data. This presentation explains why.
Biodiversity informatics and the agricultural data landscapeCyndy Parr
Introductory talk of a symposium on Agrobiodiversity informatics at the 2016 annual meeting of the Biodiversity Information Standards. Begins with an overview of the symposium and its speakers, and then launches into my talk.
Public access to research results at USDACyndy Parr
An update on public access activities at the National Agricultural Library and next steps, presented 11 January 2017 at the Earth Science Information Partners (ESIP) meeting in Bethesda, Maryland.
Preparing for data-intensive science across domains.Cyndy Parr
Presented at American Institute for Biological Sciences council meeting 8 December 2015. I focus on anecdotes from multiple domains on the kinds of skills and trajectories that empower scientists at multiple levels to become engaged in data-intensive science as data wranglers or tool-builders. Even if they don't have lots of funding from NSF or NIH.
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK Cyndy Parr
Talk presented at iEvoBio 2014 conference in Raleigh, North Carolina. Though there's a similar title and overlap with the talk I posted last week, there is new material here especially geared towards an informatics crowd savvy in the tools and technology.
Frontiers of discovery with Encyclopedia of LifeCyndy Parr
Presented at the National Museum of Natural History, Smithsonian Institution 18 June 2014
Describes, among other things, development of the TraitBank repository of species attributes, and the use of EOL and TraitBank in scientific research.
Practical interoperability across semantic stores of data for ecological, tax...Cyndy Parr
Presented at the Biodiversity Information Standards (Taxonomic Databases Working Group) 2013 meeting in Florence, Italy on 31 October 2013. Essentially, an introduction to aspects of the back end of the new trait repository of Encyclopedia of Life.
Using and extending Darwin Core for structured attribute dataCyndy Parr
Presented at the Biodiversity Information Standards (Taxonomic Databases Working Group) 2013 meeting in Florence, Italy on 29 October 2013. Essentially, an introduction to the new trait repository of Encyclopedia of Life.
A talk presented January 19, 2013 in the Indo-US Joint Workshop on Biodiversity Informatics at the Ashoka Trust for Research in Ecology and the Environment in Bangalore, India.
A talk presented January 20, 2013 in the Indo-US Joint Workshop on Biodiversity Informatics at the Ashoka Trust for Research in Ecology and the Environment in Bangalore, India.
A talk given at the Semantic Reasoning workshop held at the National Museum of Natural History September 6, 2012. The audience included computer scientists and biological scientists interested in using EOL for their research.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
3. Photo credit: Alpha Stock Images CC BY SA 3.0
USDA Enterprise Data Management
USDA Public Access Policy
ARS OSQR Procedure
NIFA RFP, Terms and Conditions
Cooperative agreements and contracts
• Data Management Plan
• Data to be made public in trusted
repository within 30 months unless
private, proprietary, or sensitive
• Datasets to be cataloged at Ag Data
Commons with appropriate identifiers
4. PLOS ONE Data Availability:
20% Currently in Repositories
U41A: How Safe and Persistent Is Your Research?
AGU Fall Meeting, December 14, 2017
Kerry Kroffe, Director, Editorial Services, PLOS
”Enabling FAIR Data” initiative
• Journal will require all data
supporting the article be in a data
citation and described in the Data
Availability Statement
• Editors and reviewers enforce
policy
• Ensure NO data is in the
supplement
• Repository selected by author must
be FAIR-compliant
• Journal community adopts and
enforces FAIR principles
Citation: Stall, S. (2017), Enabling findable, accessible, interoperable, and
reusable data, Eos, 98, https://doi.org/10.1029/2018EO081907. Published on 15
September 2017.
5. 22%
34%
2%2%
40%
Required
Encouraged
Over half of top agricultural
journals encourage or require
open data
n = 50
Where USDA researchers published in 2016
(thanks Jon Sears)
17%
78%
5%
Yes
No
Undetermined
Researchers have few
options for open
submission in domain-
specific databases
n = 235 (thanks Erin Antognoli)
Where ag researchers deposit data in 2016
6. The Concept
• Discovery Interface
• Catalog
• APIs
• Computational Tools
• Data Analytic Tools
Ag Data Commons
Knowledge Base
Data Producers Data Consumers
•Publications
•Patents
•Grant Info.
Federal
Repository
(I)
University
Repository
(K)
Industry
Repository
(N)
Experiment Devices
Farm Equipment
UAVs, Sensors
7. FAIR Data Principles
Catalog and repository
ecosystem
Self-submission &
harvesting
Currently all open data,
linked to literature
Currently USDA-funded
datasets and databases
11% of records have data
in our repository –
issuing DOIs
Ag Data Commons https://data.nal.usda.gov/
16. Harvesting metadata in DKAN
16
E.g. NCBI Bioprojects
USDA NAL Geodata
USFS Research Data Archive
E.g. Project Open Data,
CSW, OAI-PMH
17. Harvesting from distributed repositories
• Avoids duplication of submission effort
• More exposure = more impact
• Distributes costs for storage
• Keeps to specialized platforms for communities
• Usually lacks funding information
• Many lack DOIs
• Many lack methodological detail
• Challenging to match up with associated articles
17
18. Making data machine readable, linked
Promoting shared standards
JSON, RDF
Data dictionary
CSV, API, DB, code
Ag Data Commons data.nal.usda.gov
frictionlessdata.ioscience
19.
20. NAL Resources
Ag Data Commons
https://data.nal.usda.gov
Data Management Plans
NOW REQUIRED BY MOST FUNDERS
NAL provides online resources & will
provide consultation on draft DMPs
https://www.nal.usda.gov/
click on DATA
20
21. DISCUSSION
How can Ag Data Commons help AgBioData
• Harvesting metadata?
• DOI service for subsets or entire versions of datasets?
• Compliance: linking data to grant and award numbers?
• Linking data to citations (re-use)?
• Discoverability?
• Collecting consistent documentation and API
information?
• Transformation services?
• Other?
21
Editor's Notes
USDA is in the process of implementing new requirements for public access to federally funded data, and Ag Data Commons is a big part of implementing that.
But even once we get past the gathering stage for all this diverse, scattered data, we want to be able to transform it into knowledge and translate it in ways that are actionable by society for decision.
More journals are requiring the data associated with their published papers to be open.
Top journals with ag content profiled (anything Jon wants to add about that?)
PLOS ONE, Scientific Reports, Frontiers in Plant Science, Genome Announcements are top ag journals that require open data.
Note: not every journal has a policy regarding open data one way or the other
Whether the repositories are managed federally, by industry, or at universities, data should be managed in a place tailored to community needs
However, there should be a central catalog, and the data owners are best suited to describing their data in that central catalog
To be most useful and understandable we need rich metadata, but given the diversity of kinds of data it can’t be as high as the specialized community repositories need.
NAL curators can help make sure the metadata as good as possible
The platform should add value, by making available APIs, providing broadly useful tools for working with the data, and extracting the knowledge from the data and connecting it to publications and grant information
Finally, our curators help programmers set up harvests.
Given the wide variety of kinds of data, dsitributed platforms don’t use consistent standards so can’t do a distributed search
If they are using standards, there are inevitably dialects of standards
Programmers don’t understand, metadata librarians help, communicate with data owners
We have a human readable page with some text descriptions, attached files, structured metadata
We also promote a variety of ways to make things machine readable and actionable.
How do we work with big data platforms?
Just a comment that we are working with the SCINet team to coordinate policies and plans for what to do with big data when it is ready for release.