SlideShare a Scribd company logo
Global RDF Descriptors for
Germplasm Data
Vassilis Protonotarios
Agricultural Biotechnologist, PhD
Agro-Know, Greece
RDA 3° Plenary Meeting, Dublin, Ireland
Agricultural Data Interoperability Group Meeting
Background
Connecting the pieces
agINFRA Germplasm
Working Group
Agricultural Data
Interoperability IG
Germplasm Data
Analysis
Agricultural linked
data layer
The agINFRA project
• A project funded under the FP7 program of EC
• Consortium with expertise on
– Technology / infrastructures
– Data / data management
Combined to facilitate agricultural data sharing
More info at:
www.aginfra.eu
The agINFRA project
• Aims to enhance the interoperability between
the agricultural data sources
– Data sharing by
• Metadata aggregation & linking data
• Design and deploy the linked ag-data framework
– Methodology for linking data
– Provide the infrastructure needed
• Both cloud- and grid-based services
• Tools, APIs etc.
agINFRA major data types
agINFRA
Bibliographic
Agri Statistics
& Economics
Educational
Germplasm
Soil data
Profiles
Raw data
Other?
Focusing on germplasm
Local
Databases
National
Databases
Aggregators
GENESYS
EURISCO
GBIF
Italian
Italian
University
Italian research
center
Chinese
Chinese
research center
Data flow
Focusing on germplasm
Local
Databases
National
Databases
Aggregators
GENESYS
EURISCO
Italian
Italian
University
Italian research
center
Chinese
Chinese
research center
The issue ?
• Heterogeneity!
– Data types
– Data formats
– Data management workflows
– Standards used
– Metadata exposure options
– ….
• Lack of connectivity with other data sources
The agINFRA Germplasm Working
Group
The Germplasm Working Group
• Created in the context of the agINFRA project
• Initially included agINFRA stakeholders
– now expanded to host all stakeholders
• The group is NOT a group of experts on
germplasm data!
The scope of the agINFRA
Germplasm WG
• Enable/enhance interoperability between
germplasm databases
– By developing the services for
• exchanging their data and
• delivering their data to other partners
• Focusing on three actions:
1. Identify
2. Organize & Reuse
3. Propose
agINFRA Germplasm WG objectives
• IDENTIFY: collect all information related to germplasm
data
– People/groups
– Namespaces (metadata, KOS)
– Standards
– Workflows
– Events
• ORGANIZE & REUSE: engage all stakeholders & available
resources, analyze existing standards , facilitate
collaboration
• PROPOSE: linked data framework to connect data
sources
• facilitate data sharing between germplasm data sources
Germplasm related information
data
management
workflows
metadata
schemas
Working
groups in
germplasm
Events
(for connecting
stakeholders)
KOS
(ontologies,
thesauri,
vocabularies
etc.)
Data exposure
capabilities
Germplasm related information
data
management
workflows
metadata
schemas
Working
groups in
germplasm
Events
(for connecting
stakeholders)
KOS
(ontologies,
thesauri,
vocabularies
etc.)
Data exposure
capabilities
The Germplasm WG wiki
• Central point of reference
• Freely accessible (no login required)
http://wiki.aginfra.eu/index.php/Germplasm_Working_Group
Key outcomes of the group (1)
Dossier on Germplasm Information:
– Major programs
– Major information systems and services
– agINFRA germplasm data sources (CGRIS & CRA)
– Core standards for germplasm information
– Plant nomenclature, taxonomies and ontologies
– Plant genomic resources
– Related references and links
• Freely available from the Germplasm Group wiki
Key outcomes of the group (2)
Key outcomes of the group (3)
• Speakers from key players in the biodiversity
data field
– GBIF, EURISCO, GENESYS, CGIAR, EGFAR, CRA etc.
• Aimed to provide the basis for the linked
germplasm data framework
Existing work
DwC-G KOSs
• Germplasm Term Vocabulary
• A vocabulary of terms for describing and annotating
germplasm information resources
– http://purl.org/germplasm/germplasmTerm#TERM
• Germplasm Type vocabulary
• List of controlled values for some of the germplasm terms
– http://purl.org/germplasm/germplasmType#TYPE
• Germplasm ontology
• to digitize and provide persistent identifiers for the terms
contained within the PGR Descriptors publications
– http://purl.org/germplasm/ontology
DwC-G linked data
DwC-SW
• An ontology using Darwin Core terms to make it possible to
describe biodiversity resources in the Semantic Web.
https://code.google.com/p/darwin-sw
Bioversity Crop Descriptors
• Crop Descriptors
– Provide an international format and a universally understood
language for plant genetic resources data.
– They are targeted at farmers, curators, breeders, scientists
and users and facilitate the exchange and use of resources.
– Information includes such details as the plant's height,
flowering patterns and ancestral history.
• FAO/Bioversity Multi-crop Passport Descriptors (MCPD)
– Originally published in 2001
– widely used as the international standard to facilitate
germplasm passport information exchange.
– Now expanded to include emerging documentation needs,
this new version resulted from consultation with more than
300 scientists from 187 institutions in 87 countries.
Wheat descriptors
• Descriptors for wheat and Aegilops (1978)
• Descriptors for wheat (Revised) (1985)
– Not available as Linked Data
Methodology: towards the RDF
germplasm descriptors
Linked Data vocabularies
• Metadata vocabularies: Metadata sets, metadata element
sets
– they provide metadata elements to describe individual pieces of
information in the data sets.
– Example: Dublin Core is a vocabulary that prescribes the
property dc:date for the publishing date of a document.
• Value vocabularies (KOS): Controlled vocabularies,
authority data
– they provide sets of values for (some of) the metadata
elements.
– Example: AGROVOC provides a set of values for agricultural
topics that can be used as values for the dc:subject property.
LOD guidelines (Berners Lee, 2006)
1.“Use URIs as names for things”
– concepts / values in value vocabularies and classes and properties in description vocabularies, as well
as the vocabularies themselves, have to be identified by URIs.
2.“Use HTTP URIs so that people can look up those names”
– the URIs for concept / values, classes and properties, as well as vocabularies, have to be resolved as
HTTP URLs.
3.“When someone looks up a URI, provide useful information”
– the URLs for concepts, classes and properties, as well as vocabularies, have to return an HTML page
with useful information when requested by browsers, or RDF when requested by RDF software;
besides, vocabularies should be available for querying behind a SPARQL endpoint.
4.“Include links to other URIs, so that more things can be
discovered”
– the URIs of concepts, classes and properties should whenever possible be linked to URIs in other
vocabularies, for instance as close match of another concept or sub-class of another class.
Proposed methodology
1. Analyze metadata schemas & KOSs used to
describe germplasm resources
2. Define attributes & vocabularies that can be
used to expose germplasm resources in linked
data format.
3. Provide a set of recommendations for the
exposure of germplasm resources as linked data
4. Embed the recommendations in the data
infrastructure of agINFRA
– to allow the exposure of germplasm resources as
LOD.
The next steps
Application of the linked agricultural
data framework in germplasm
1. Definition of base schema
– Darwin Core for Germplasm to be used as base
schema
• Already available in SKOS
• Vocabularies published as linked data
– Germplasm Vocabularies
• Germplasm Term Vocabulary
• Germplasm Type Vocabulary
– Germplasm ontology
2. Publication of local classifications / lists for
germplasm as LOD KOSs
– if possible use DwC Types directly
Application of the linked agricultural
data framework in germplasm
3. Linking of terms in new KOSs to terms in existing
KOSs
– e.g. DwC Types, AGROVOC
4. Link CAAS and CRA germplasm records using
scientific name > AGROVOC
5. Collaboration with technical partners
– technical specifications on how to write procedures that extract the
relevant data from the database and "triplify" them (i.e. both serialize
them as RDF and use URIs instead of just strings whenever possible, also
linking to AGROVOC URIs when possible).
…and more next steps (optional)
• Update the existing analysis with new data
• Collect new user requirements
• (re)define the mappings between metadata
schemas and KOSs (if needed)
• Fine-tune the linked data approach
Time plan
Time plan
• June 2014: Germplasm vocabularies
– Metadata model: Darwin Core SW + DwC-G as the
reference
• Publish local classifications / lists for germplasm as LOD
KOSs (if possible use DwC Types directly)
• Link terms in new KOSs to terms in existing KOSs (e.g.
DwC Types, AGROVOC)
• Germplasm phenotypic values / classifications linked to
Phenotypic Ontology terms?
Time plan
• August 2014: Germplasm RDF
– Expose some RDF output and API access for
germplasm datasets (basic DwC RDF, essentially
basic passport descriptors).
– Mandatory data for interlinking: scientific name
OR AGROVOC term
Time plan
• October 2014: Consuming data from agINFRA
services and components
– Link CGRIS and CRA germplasm records using
scientific name > AGROVOC
Source: http://verastic.com/social/why-do-people-not-say-thank-you.html
vprot@agroknow.gr

More Related Content

What's hot

Leveraging Crossref Funding Data
Leveraging Crossref Funding DataLeveraging Crossref Funding Data
Leveraging Crossref Funding Data
Crossref
 
The DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with ConfidenceThe DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with Confidence
Merce Crosas
 
Biostatistics & Bioinformatics
Biostatistics & BioinformaticsBiostatistics & Bioinformatics
Biostatistics & Bioinformaticsgumccomm
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOM
Carole Goble
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research Commons
Carole Goble
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Stuart Chalk
 
Data editors meeting at SEFS
Data editors meeting at SEFSData editors meeting at SEFS
Data editors meeting at SEFSAaike De Wever
 
Overview of the NIH BD2K CEDAR centre, on metadata and standards
Overview of the NIH BD2K CEDAR centre, on metadata and standardsOverview of the NIH BD2K CEDAR centre, on metadata and standards
Overview of the NIH BD2K CEDAR centre, on metadata and standards
Susanna-Assunta Sansone
 
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
Araport
 
Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.
FAIRDOM
 
Introduction of Linked Data for Science
Introduction of Linked Data for ScienceIntroduction of Linked Data for Science
Introduction of Linked Data for Science
National Institute of Informatics (NII)
 
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014Susanna-Assunta Sansone
 
Biositemaps: A Framework for Biomedical Resource Discovery
Biositemaps: A Framework for Biomedical Resource DiscoveryBiositemaps: A Framework for Biomedical Resource Discovery
Biositemaps: A Framework for Biomedical Resource Discovery
Trish Whetzel
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark GreavesMediabistro
 
DMPTool Webinar 11: Complementary Tools
DMPTool Webinar 11: Complementary ToolsDMPTool Webinar 11: Complementary Tools
DMPTool Webinar 11: Complementary Tools
University of California Curation Center
 
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
dkNET
 
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
Susanna-Assunta Sansone
 
High quality data publications: drives and needs - Sansone, BDebate, 12 Nov 2014
High quality data publications: drives and needs - Sansone, BDebate, 12 Nov 2014High quality data publications: drives and needs - Sansone, BDebate, 12 Nov 2014
High quality data publications: drives and needs - Sansone, BDebate, 12 Nov 2014
Susanna-Assunta Sansone
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
Stuart Chalk
 

What's hot (20)

Leveraging Crossref Funding Data
Leveraging Crossref Funding DataLeveraging Crossref Funding Data
Leveraging Crossref Funding Data
 
The DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with ConfidenceThe DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with Confidence
 
Biostatistics & Bioinformatics
Biostatistics & BioinformaticsBiostatistics & Bioinformatics
Biostatistics & Bioinformatics
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOM
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research Commons
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
 
Implementation of semantic network dictionary system
Implementation of semantic network dictionary system Implementation of semantic network dictionary system
Implementation of semantic network dictionary system
 
Data editors meeting at SEFS
Data editors meeting at SEFSData editors meeting at SEFS
Data editors meeting at SEFS
 
Overview of the NIH BD2K CEDAR centre, on metadata and standards
Overview of the NIH BD2K CEDAR centre, on metadata and standardsOverview of the NIH BD2K CEDAR centre, on metadata and standards
Overview of the NIH BD2K CEDAR centre, on metadata and standards
 
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
 
Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.
 
Introduction of Linked Data for Science
Introduction of Linked Data for ScienceIntroduction of Linked Data for Science
Introduction of Linked Data for Science
 
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
 
Biositemaps: A Framework for Biomedical Resource Discovery
Biositemaps: A Framework for Biomedical Resource DiscoveryBiositemaps: A Framework for Biomedical Resource Discovery
Biositemaps: A Framework for Biomedical Resource Discovery
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
 
DMPTool Webinar 11: Complementary Tools
DMPTool Webinar 11: Complementary ToolsDMPTool Webinar 11: Complementary Tools
DMPTool Webinar 11: Complementary Tools
 
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
 
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
 
High quality data publications: drives and needs - Sansone, BDebate, 12 Nov 2014
High quality data publications: drives and needs - Sansone, BDebate, 12 Nov 2014High quality data publications: drives and needs - Sansone, BDebate, 12 Nov 2014
High quality data publications: drives and needs - Sansone, BDebate, 12 Nov 2014
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
 

Similar to Global RDF Descriptors for Germplasm Data

Agro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystemAgro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystem
Nikos Manouselis
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
Carole Goble
 
IGAD_CODATA
IGAD_CODATAIGAD_CODATA
IGAD_CODATA
Hugo Besemer
 
Dataset description: DCAT and other vocabularies
Dataset description: DCAT and other vocabulariesDataset description: DCAT and other vocabularies
Dataset description: DCAT and other vocabularies
Valeria Pesce
 
eROSA Stakeholder WS1: Data discovery through federated dataset catalogues
eROSA Stakeholder WS1: Data discovery through federated dataset catalogueseROSA Stakeholder WS1: Data discovery through federated dataset catalogues
eROSA Stakeholder WS1: Data discovery through federated dataset catalogues
e-ROSA
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
Carole Goble
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
Carole Goble
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
FAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsFAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basics
OpenAIRE
 
FAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology Agency
Peter McQuilton
 
pro-iBiosphere 2013-05 Linked Open Data (Gregor Hagedorn)
pro-iBiosphere 2013-05 Linked Open Data (Gregor Hagedorn)pro-iBiosphere 2013-05 Linked Open Data (Gregor Hagedorn)
pro-iBiosphere 2013-05 Linked Open Data (Gregor Hagedorn)Gregor Hagedorn
 
Parr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbagParr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbagCyndy Parr
 
2009 11 icudl
2009 11 icudl2009 11 icudl
2009 11 icudl
Johannes Keizer
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
Karlsruhe Institute of Technology (KIT)
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...ICZN
 
The CIARD RINGValeri
The CIARD RINGValeriThe CIARD RINGValeri
The CIARD RINGValeri
CIARD Movement
 
GARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceGARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant Science
David Johnson
 
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
Peter McQuilton
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
Catherine Canevet
 

Similar to Global RDF Descriptors for Germplasm Data (20)

Agro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystemAgro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystem
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
IGAD_CODATA
IGAD_CODATAIGAD_CODATA
IGAD_CODATA
 
Dataset description: DCAT and other vocabularies
Dataset description: DCAT and other vocabulariesDataset description: DCAT and other vocabularies
Dataset description: DCAT and other vocabularies
 
eROSA Stakeholder WS1: Data discovery through federated dataset catalogues
eROSA Stakeholder WS1: Data discovery through federated dataset catalogueseROSA Stakeholder WS1: Data discovery through federated dataset catalogues
eROSA Stakeholder WS1: Data discovery through federated dataset catalogues
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
 
2005 09 Dc Keynote
2005 09 Dc Keynote2005 09 Dc Keynote
2005 09 Dc Keynote
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
FAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsFAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basics
 
FAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology Agency
 
pro-iBiosphere 2013-05 Linked Open Data (Gregor Hagedorn)
pro-iBiosphere 2013-05 Linked Open Data (Gregor Hagedorn)pro-iBiosphere 2013-05 Linked Open Data (Gregor Hagedorn)
pro-iBiosphere 2013-05 Linked Open Data (Gregor Hagedorn)
 
Parr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbagParr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbag
 
2009 11 icudl
2009 11 icudl2009 11 icudl
2009 11 icudl
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
 
The CIARD RINGValeri
The CIARD RINGValeriThe CIARD RINGValeri
The CIARD RINGValeri
 
GARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceGARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant Science
 
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
 

More from Vassilis Protonotarios

Doing business with Open Data in agriculture
Doing business with Open Data in agricultureDoing business with Open Data in agriculture
Doing business with Open Data in agriculture
Vassilis Protonotarios
 
Legal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystemLegal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystem
Vassilis Protonotarios
 
Agricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDAAgricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDA
Vassilis Protonotarios
 
Agro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blogAgro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blog
Vassilis Protonotarios
 
Introduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety DataIntroduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety Data
Vassilis Protonotarios
 
Seeding organic agriculture courses on Moodle: the agriMoodle Case
Seeding organic agriculture courses on Moodle:  the agriMoodle CaseSeeding organic agriculture courses on Moodle:  the agriMoodle Case
Seeding organic agriculture courses on Moodle: the agriMoodle Case
Vassilis Protonotarios
 
KOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet OntologyKOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet Ontology
Vassilis Protonotarios
 
Major germplasm data sources and referatories
Major germplasm data sources and referatoriesMajor germplasm data sources and referatories
Major germplasm data sources and referatories
Vassilis Protonotarios
 
agINFRA Germplasm metadata analysis
agINFRA Germplasm metadata analysisagINFRA Germplasm metadata analysis
agINFRA Germplasm metadata analysis
Vassilis Protonotarios
 
Designing Data Products
Designing Data ProductsDesigning Data Products
Designing Data Products
Vassilis Protonotarios
 
Using language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptionsUsing language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptions
Vassilis Protonotarios
 
Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Vassilis Protonotarios
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetVassilis Protonotarios
 
AgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the WorkshopAgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the Workshop
Vassilis Protonotarios
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetVassilis Protonotarios
 
Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...
Vassilis Protonotarios
 
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Vassilis Protonotarios
 
Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)
Vassilis Protonotarios
 
Identifying the Training Content Needs in Vocational Education & Training Pr...
Identifying the Training Content Needs in Vocational Education  & Training Pr...Identifying the Training Content Needs in Vocational Education  & Training Pr...
Identifying the Training Content Needs in Vocational Education & Training Pr...
Vassilis Protonotarios
 
Pecha Kucha
Pecha KuchaPecha Kucha

More from Vassilis Protonotarios (20)

Doing business with Open Data in agriculture
Doing business with Open Data in agricultureDoing business with Open Data in agriculture
Doing business with Open Data in agriculture
 
Legal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystemLegal interoperability in the fishery and marine data ecosystem
Legal interoperability in the fishery and marine data ecosystem
 
Agricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDAAgricultural Data Interest Group & Wheat Data Working Group of RDA
Agricultural Data Interest Group & Wheat Data Working Group of RDA
 
Agro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blogAgro-Know internal training: Using the Agro-Know blog
Agro-Know internal training: Using the Agro-Know blog
 
Introduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety DataIntroduction to Agriculture & Food Safety Data
Introduction to Agriculture & Food Safety Data
 
Seeding organic agriculture courses on Moodle: the agriMoodle Case
Seeding organic agriculture courses on Moodle:  the agriMoodle CaseSeeding organic agriculture courses on Moodle:  the agriMoodle Case
Seeding organic agriculture courses on Moodle: the agriMoodle Case
 
KOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet OntologyKOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet Ontology
 
Major germplasm data sources and referatories
Major germplasm data sources and referatoriesMajor germplasm data sources and referatories
Major germplasm data sources and referatories
 
agINFRA Germplasm metadata analysis
agINFRA Germplasm metadata analysisagINFRA Germplasm metadata analysis
agINFRA Germplasm metadata analysis
 
Designing Data Products
Designing Data ProductsDesigning Data Products
Designing Data Products
 
Using language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptionsUsing language services to enrich the LOs' descriptions
Using language services to enrich the LOs' descriptions
 
Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...Using Agricultural Learning Portals in Developing Countries: The case of Orga...
Using Agricultural Learning Portals in Developing Countries: The case of Orga...
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.Edunet
 
AgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the WorkshopAgEdWS12 - Introduction to the Workshop
AgEdWS12 - Introduction to the Workshop
 
Developing a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.EdunetDeveloping a network of content providers: The case of Organic.Edunet
Developing a network of content providers: The case of Organic.Edunet
 
Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...Introducing a content integration process for a federation of agricultural in...
Introducing a content integration process for a federation of agricultural in...
 
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
Organic.Edunet Web Portal - User Satisfaction Analysis (EFITA 2011)
 
Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)Designing a Training Session for Public Authorities (EFITA 2011)
Designing a Training Session for Public Authorities (EFITA 2011)
 
Identifying the Training Content Needs in Vocational Education & Training Pr...
Identifying the Training Content Needs in Vocational Education  & Training Pr...Identifying the Training Content Needs in Vocational Education  & Training Pr...
Identifying the Training Content Needs in Vocational Education & Training Pr...
 
Pecha Kucha
Pecha KuchaPecha Kucha
Pecha Kucha
 

Recently uploaded

CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 

Recently uploaded (20)

CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 

Global RDF Descriptors for Germplasm Data

  • 1. Global RDF Descriptors for Germplasm Data Vassilis Protonotarios Agricultural Biotechnologist, PhD Agro-Know, Greece RDA 3° Plenary Meeting, Dublin, Ireland Agricultural Data Interoperability Group Meeting
  • 3. Connecting the pieces agINFRA Germplasm Working Group Agricultural Data Interoperability IG Germplasm Data Analysis Agricultural linked data layer
  • 4. The agINFRA project • A project funded under the FP7 program of EC • Consortium with expertise on – Technology / infrastructures – Data / data management Combined to facilitate agricultural data sharing More info at: www.aginfra.eu
  • 5. The agINFRA project • Aims to enhance the interoperability between the agricultural data sources – Data sharing by • Metadata aggregation & linking data • Design and deploy the linked ag-data framework – Methodology for linking data – Provide the infrastructure needed • Both cloud- and grid-based services • Tools, APIs etc.
  • 6. agINFRA major data types agINFRA Bibliographic Agri Statistics & Economics Educational Germplasm Soil data Profiles Raw data Other?
  • 9. The issue ? • Heterogeneity! – Data types – Data formats – Data management workflows – Standards used – Metadata exposure options – …. • Lack of connectivity with other data sources
  • 10. The agINFRA Germplasm Working Group
  • 11. The Germplasm Working Group • Created in the context of the agINFRA project • Initially included agINFRA stakeholders – now expanded to host all stakeholders • The group is NOT a group of experts on germplasm data!
  • 12. The scope of the agINFRA Germplasm WG • Enable/enhance interoperability between germplasm databases – By developing the services for • exchanging their data and • delivering their data to other partners • Focusing on three actions: 1. Identify 2. Organize & Reuse 3. Propose
  • 13. agINFRA Germplasm WG objectives • IDENTIFY: collect all information related to germplasm data – People/groups – Namespaces (metadata, KOS) – Standards – Workflows – Events • ORGANIZE & REUSE: engage all stakeholders & available resources, analyze existing standards , facilitate collaboration • PROPOSE: linked data framework to connect data sources • facilitate data sharing between germplasm data sources
  • 14. Germplasm related information data management workflows metadata schemas Working groups in germplasm Events (for connecting stakeholders) KOS (ontologies, thesauri, vocabularies etc.) Data exposure capabilities
  • 15. Germplasm related information data management workflows metadata schemas Working groups in germplasm Events (for connecting stakeholders) KOS (ontologies, thesauri, vocabularies etc.) Data exposure capabilities
  • 16. The Germplasm WG wiki • Central point of reference • Freely accessible (no login required) http://wiki.aginfra.eu/index.php/Germplasm_Working_Group
  • 17. Key outcomes of the group (1) Dossier on Germplasm Information: – Major programs – Major information systems and services – agINFRA germplasm data sources (CGRIS & CRA) – Core standards for germplasm information – Plant nomenclature, taxonomies and ontologies – Plant genomic resources – Related references and links • Freely available from the Germplasm Group wiki
  • 18. Key outcomes of the group (2)
  • 19. Key outcomes of the group (3) • Speakers from key players in the biodiversity data field – GBIF, EURISCO, GENESYS, CGIAR, EGFAR, CRA etc. • Aimed to provide the basis for the linked germplasm data framework
  • 21. DwC-G KOSs • Germplasm Term Vocabulary • A vocabulary of terms for describing and annotating germplasm information resources – http://purl.org/germplasm/germplasmTerm#TERM • Germplasm Type vocabulary • List of controlled values for some of the germplasm terms – http://purl.org/germplasm/germplasmType#TYPE • Germplasm ontology • to digitize and provide persistent identifiers for the terms contained within the PGR Descriptors publications – http://purl.org/germplasm/ontology
  • 23. DwC-SW • An ontology using Darwin Core terms to make it possible to describe biodiversity resources in the Semantic Web. https://code.google.com/p/darwin-sw
  • 24. Bioversity Crop Descriptors • Crop Descriptors – Provide an international format and a universally understood language for plant genetic resources data. – They are targeted at farmers, curators, breeders, scientists and users and facilitate the exchange and use of resources. – Information includes such details as the plant's height, flowering patterns and ancestral history. • FAO/Bioversity Multi-crop Passport Descriptors (MCPD) – Originally published in 2001 – widely used as the international standard to facilitate germplasm passport information exchange. – Now expanded to include emerging documentation needs, this new version resulted from consultation with more than 300 scientists from 187 institutions in 87 countries.
  • 25. Wheat descriptors • Descriptors for wheat and Aegilops (1978) • Descriptors for wheat (Revised) (1985) – Not available as Linked Data
  • 26. Methodology: towards the RDF germplasm descriptors
  • 27. Linked Data vocabularies • Metadata vocabularies: Metadata sets, metadata element sets – they provide metadata elements to describe individual pieces of information in the data sets. – Example: Dublin Core is a vocabulary that prescribes the property dc:date for the publishing date of a document. • Value vocabularies (KOS): Controlled vocabularies, authority data – they provide sets of values for (some of) the metadata elements. – Example: AGROVOC provides a set of values for agricultural topics that can be used as values for the dc:subject property.
  • 28. LOD guidelines (Berners Lee, 2006) 1.“Use URIs as names for things” – concepts / values in value vocabularies and classes and properties in description vocabularies, as well as the vocabularies themselves, have to be identified by URIs. 2.“Use HTTP URIs so that people can look up those names” – the URIs for concept / values, classes and properties, as well as vocabularies, have to be resolved as HTTP URLs. 3.“When someone looks up a URI, provide useful information” – the URLs for concepts, classes and properties, as well as vocabularies, have to return an HTML page with useful information when requested by browsers, or RDF when requested by RDF software; besides, vocabularies should be available for querying behind a SPARQL endpoint. 4.“Include links to other URIs, so that more things can be discovered” – the URIs of concepts, classes and properties should whenever possible be linked to URIs in other vocabularies, for instance as close match of another concept or sub-class of another class.
  • 29. Proposed methodology 1. Analyze metadata schemas & KOSs used to describe germplasm resources 2. Define attributes & vocabularies that can be used to expose germplasm resources in linked data format. 3. Provide a set of recommendations for the exposure of germplasm resources as linked data 4. Embed the recommendations in the data infrastructure of agINFRA – to allow the exposure of germplasm resources as LOD.
  • 31. Application of the linked agricultural data framework in germplasm 1. Definition of base schema – Darwin Core for Germplasm to be used as base schema • Already available in SKOS • Vocabularies published as linked data – Germplasm Vocabularies • Germplasm Term Vocabulary • Germplasm Type Vocabulary – Germplasm ontology 2. Publication of local classifications / lists for germplasm as LOD KOSs – if possible use DwC Types directly
  • 32. Application of the linked agricultural data framework in germplasm 3. Linking of terms in new KOSs to terms in existing KOSs – e.g. DwC Types, AGROVOC 4. Link CAAS and CRA germplasm records using scientific name > AGROVOC 5. Collaboration with technical partners – technical specifications on how to write procedures that extract the relevant data from the database and "triplify" them (i.e. both serialize them as RDF and use URIs instead of just strings whenever possible, also linking to AGROVOC URIs when possible).
  • 33. …and more next steps (optional) • Update the existing analysis with new data • Collect new user requirements • (re)define the mappings between metadata schemas and KOSs (if needed) • Fine-tune the linked data approach
  • 35. Time plan • June 2014: Germplasm vocabularies – Metadata model: Darwin Core SW + DwC-G as the reference • Publish local classifications / lists for germplasm as LOD KOSs (if possible use DwC Types directly) • Link terms in new KOSs to terms in existing KOSs (e.g. DwC Types, AGROVOC) • Germplasm phenotypic values / classifications linked to Phenotypic Ontology terms?
  • 36. Time plan • August 2014: Germplasm RDF – Expose some RDF output and API access for germplasm datasets (basic DwC RDF, essentially basic passport descriptors). – Mandatory data for interlinking: scientific name OR AGROVOC term
  • 37. Time plan • October 2014: Consuming data from agINFRA services and components – Link CGRIS and CRA germplasm records using scientific name > AGROVOC

Editor's Notes

  1. Heterogeneous data types and formats,
  2. OAI-PMH harvesting is not an option in the case of germplasm data
  3. https://code.google.com/p/darwincore-germplasm/wiki/ToC: http://purl.org/germplasm/vocabulary
  4. https://code.google.com/p/darwincore-germplasm/wiki/ToC: http://purl.org/germplasm/vocabulary