SlideShare a Scribd company logo
RDA Wheat Data Interoperability
Cookbook and last developments
9th
March 2015, San Diego
2
The WDI working group in brief
 Endorsement: March 2014
 Members: ~=30 members and 15 active members, Wheat
scientists, data and metadata technologists
 The goal: contribute to the improvement of Wheat related
data interoperability by
 Building a common interoperability framework (metadata, data formats and
vocabularies)
 Providing guidelines for describing, representing and linking Wheat related
data
3
 Deliverables
 A report of the survey of existing standards
 A cookbook intended for the Wheat data managers community, which
provides them with guidelines on what data formats, metadata, vocabularies
and ontologies they should use to describe, represent and link different
types of Wheat data.
 A library of linked vocabularies and ontologies in machine readable formats
with respect to the Linked Data standards.
 A prototype which showcases the gain of interoperability
Initial plans
4
Where we are
5
Data type Data formats currently used Recommendations
Standardized Tool specific Non
standardized
SNPs VCF BAM/SAM,
BED,
VARSCAN,
VEP
VCF files generated by using the
survey sequences of IWGSC +
metadata about VCF files to
enrich the information about the
SNPs.
genome
annotations
Genbank Flat File,
General Feature
Format (GFF), EMBL
GFF 3 + specifications with
regard the description of specific
columns
Germplasms MPCD, ABCD, Darwin
Core, Darwin Core
Germplasm
Grin Global tabulated MPCD
Gene
expression
Many format standards
laid out by repositories
such as NCBI (GEO)
and EBI Array Express
Existing format standards laid out
by the repositories such as NCBI
(GEO) and EBI Array Express +
ENA
Physical maps GFF Cmap, fpc GFF3
Genetic maps Cmap, gnpmap GFF3 (to be confirmed)
Phenotypes Drops, ped, isa-
tab, ephesis
tabulated Isa-tab
6
Examples of use cases
Title Searching for germplasm with specific traits
Description Example of searching for germplasm with specific traits - tagged with ontology terms?
Data types Germplasm
Phenotype
Challenges ● Metadata very important ~ standardized format
● Association of genes to traits, linked to germplasm, marker information
● Need for quality controls- how confident are you of the data source?
● Provenance of the germplasm- pedigree, ownership,
● Standard system for tracking germplasm, names
Title Identification of wheat genes that control root growth
Description Requires: Annotated genes (Gene Ontology, PFam, and other functional annotation)
Data types Genomic annotations? - Gene location ? (IWGS-SS ID or MIPS HCS link)
Challenges Mapping between wheat genes and orthologs from other species (deduce function by seq. similarity);
Access to RNASeq data (genes that are not expressed in roots may be irrelevant) ; mapping of wheat
genes and information on their function based on literature
Title Query on trial data associated with varieties
Data types Phenotypic data, GIS data, (wheat economy/production data)
Description To search wheat varieties with distribution maps, production figures, performances in wheat mega
environments, associated projects worldwide plus layers of climatic data on specific wheat production
areas and disease prevention information.
Challenges Phenotypic data should be linked to GIS data. Using keywords or ontology terms a system or a tool
should be able to pull out such information from different websites/systems developed by wheat
community.
7
8
 Assess the level of visibility and interoperability of Wheat
related vocabularies and ontologies
 Is the vocabulary/ontology updated regularly?
 What license and/or copyright is used?
 Is the vocabulary/ontology part of any ontology communities or listing
services?
 Is the vocabulary/ontology used or implemented in any database/repository?
 Does the vocabulary/ontology interlink and/or map to other vocabularies and
ontologies?
 Does the vocabulary/ontology
 Identify the domain covered by the ontologies and
vocabularies
 Refine the cookbook
 Collect more interoperability use cases
 Collect some technical details
Wheat related ontologies & vocabularies survey
9
Wheat related ontologies & vocabularies survey
The Wheat related BioPortal allows one to search for terms across multiple ontologies, browse
mappings between terms in different ontologies, receive recommendations on which ontologies are
most relevant for a corpus, annotate text with terms from ontologies
11
Next steps
 Metadata (harmonization, minimal metadata sets)
 Mappings
 Next workshop (summer 2015)
 Review and complete the recommendations
 Refine and complete the guidelines and the best practices
 Finalize the repository of Wheat related vocabularies
 Prototyping: a semantic knowledge base
 Integrate data from different data sources
 Provide smart search capabilities that leverage the vocabularies used against
the metadata.
12
Thank you!

More Related Content

What's hot

Ontology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrievalOntology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrieval
AIMS (Agricultural Information Management Standards)
 
Ontology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrievalOntology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrieval
AIMS (Agricultural Information Management Standards)
 
Names
NamesNames
Digging for Buried Treasure: Strategies for Promoting Institutional Repositor...
Digging for Buried Treasure: Strategies for Promoting Institutional Repositor...Digging for Buried Treasure: Strategies for Promoting Institutional Repositor...
Digging for Buried Treasure: Strategies for Promoting Institutional Repositor...
Electronic Resources & Libraries
 
bioinfomatics
bioinfomaticsbioinfomatics
bioinfomatics
nguyenpg
 
08 wp7 progresses&results-20130221
08 wp7 progresses&results-2013022108 wp7 progresses&results-20130221
08 wp7 progresses&results-20130221
fruitbreedomics
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
Saramita De Chakravarti
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
KAUSHAL SAHU
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
Carole Goble
 
Presentation from Code Camp 2017
Presentation from Code Camp 2017Presentation from Code Camp 2017
Presentation from Code Camp 2017
Mitch Miller
 
Introduction to Biological databases
Introduction to Biological databasesIntroduction to Biological databases
Genome science intermine
Genome science intermineGenome science intermine
Genome science intermine
ELIXIR UK
 
Data retrieval tools
Data retrieval toolsData retrieval tools
Data retrieval tools
Vidya Kalaivani Rajkumar
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
SATHIYA NARAYANAN
 
(Expasy)
(Expasy)(Expasy)
(Expasy)
Mazhar Khan
 
Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications
Trish Whetzel
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
nadeem akhter
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic Web
Chimezie Ogbuji
 

What's hot (19)

Ontology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrievalOntology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrieval
 
Ontology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrievalOntology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrieval
 
Names
NamesNames
Names
 
Digging for Buried Treasure: Strategies for Promoting Institutional Repositor...
Digging for Buried Treasure: Strategies for Promoting Institutional Repositor...Digging for Buried Treasure: Strategies for Promoting Institutional Repositor...
Digging for Buried Treasure: Strategies for Promoting Institutional Repositor...
 
bioinfomatics
bioinfomaticsbioinfomatics
bioinfomatics
 
08 wp7 progresses&results-20130221
08 wp7 progresses&results-2013022108 wp7 progresses&results-20130221
08 wp7 progresses&results-20130221
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
Presentation from Code Camp 2017
Presentation from Code Camp 2017Presentation from Code Camp 2017
Presentation from Code Camp 2017
 
Introduction to Biological databases
Introduction to Biological databasesIntroduction to Biological databases
Introduction to Biological databases
 
Genome science intermine
Genome science intermineGenome science intermine
Genome science intermine
 
Data retrieval tools
Data retrieval toolsData retrieval tools
Data retrieval tools
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
(Expasy)
(Expasy)(Expasy)
(Expasy)
 
Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications Enabling Semantically Aware Software Applications
Enabling Semantically Aware Software Applications
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic Web
 

Similar to RDA Wheat Data Interoperability Cookbook and last developments

Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
Catherine Canevet
 
Amman Workshop #2 - M MacKay
Amman Workshop #2 - M MacKayAmman Workshop #2 - M MacKay
Amman Workshop #2 - M MacKay
Bioversity International
 
Global RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataGlobal RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm Data
Vassilis Protonotarios
 
Protease Phylogeny
 Protease Phylogeny  Protease Phylogeny
Protease Phylogeny
Chris Southan
 
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
Susanna-Assunta Sansone
 
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
CGIAR Generation Challenge Programme
 
eROSA Stakeholder WS1: Ontological annotations supporting FAIR agricultural data
eROSA Stakeholder WS1: Ontological annotations supporting FAIR agricultural dataeROSA Stakeholder WS1: Ontological annotations supporting FAIR agricultural data
eROSA Stakeholder WS1: Ontological annotations supporting FAIR agricultural data
e-ROSA
 
Benchmarking Versioning for Big Linked Data
Benchmarking Versioning for Big Linked DataBenchmarking Versioning for Big Linked Data
Benchmarking Versioning for Big Linked Data
Graph-TA
 
Presentation of HOBBIT's versioning benchmark at Graph-TA
Presentation of HOBBIT's versioning benchmark at Graph-TAPresentation of HOBBIT's versioning benchmark at Graph-TA
Presentation of HOBBIT's versioning benchmark at Graph-TA
Holistic Benchmarking of Big Linked Data
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
open_phacts
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005
GenomeInABottle
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
How to make your published data findable, accessible, interoperable and reusable
How to make your published data findable, accessible, interoperable and reusableHow to make your published data findable, accessible, interoperable and reusable
How to make your published data findable, accessible, interoperable and reusable
Phoenix Bioinformatics
 
FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use Case
Rothamsted Research, UK
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems Biology
FAIRDOM
 
FruitBreedomics KOM Stakeholders meeting 31-03-2011 9 WP7 presentation and fe...
FruitBreedomics KOM Stakeholders meeting 31-03-2011 9 WP7 presentation and fe...FruitBreedomics KOM Stakeholders meeting 31-03-2011 9 WP7 presentation and fe...
FruitBreedomics KOM Stakeholders meeting 31-03-2011 9 WP7 presentation and fe...
fruitbreedomics
 
call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...
International Journal of Engineering Inventions www.ijeijournal.com
 
Ondex: Data integration and visualisation
Ondex: Data integration and visualisationOndex: Data integration and visualisation
Ondex: Data integration and visualisation
Biogeeks
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
Dr. Naveen Gaurav srivastava
 
Karyotype DAS client
Karyotype DAS clientKaryotype DAS client
Karyotype DAS client
Rafael C. Jimenez
 

Similar to RDA Wheat Data Interoperability Cookbook and last developments (20)

Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
 
Amman Workshop #2 - M MacKay
Amman Workshop #2 - M MacKayAmman Workshop #2 - M MacKay
Amman Workshop #2 - M MacKay
 
Global RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataGlobal RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm Data
 
Protease Phylogeny
 Protease Phylogeny  Protease Phylogeny
Protease Phylogeny
 
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
 
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
 
eROSA Stakeholder WS1: Ontological annotations supporting FAIR agricultural data
eROSA Stakeholder WS1: Ontological annotations supporting FAIR agricultural dataeROSA Stakeholder WS1: Ontological annotations supporting FAIR agricultural data
eROSA Stakeholder WS1: Ontological annotations supporting FAIR agricultural data
 
Benchmarking Versioning for Big Linked Data
Benchmarking Versioning for Big Linked DataBenchmarking Versioning for Big Linked Data
Benchmarking Versioning for Big Linked Data
 
Presentation of HOBBIT's versioning benchmark at Graph-TA
Presentation of HOBBIT's versioning benchmark at Graph-TAPresentation of HOBBIT's versioning benchmark at Graph-TA
Presentation of HOBBIT's versioning benchmark at Graph-TA
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
How to make your published data findable, accessible, interoperable and reusable
How to make your published data findable, accessible, interoperable and reusableHow to make your published data findable, accessible, interoperable and reusable
How to make your published data findable, accessible, interoperable and reusable
 
FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use Case
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems Biology
 
FruitBreedomics KOM Stakeholders meeting 31-03-2011 9 WP7 presentation and fe...
FruitBreedomics KOM Stakeholders meeting 31-03-2011 9 WP7 presentation and fe...FruitBreedomics KOM Stakeholders meeting 31-03-2011 9 WP7 presentation and fe...
FruitBreedomics KOM Stakeholders meeting 31-03-2011 9 WP7 presentation and fe...
 
call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...
 
Ondex: Data integration and visualisation
Ondex: Data integration and visualisationOndex: Data integration and visualisation
Ondex: Data integration and visualisation
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
 
Karyotype DAS client
Karyotype DAS clientKaryotype DAS client
Karyotype DAS client
 

More from CIARD Movement

Efficient & effective data management for research projects : ILRI's Data Ma...
Efficient & effective  data management for research projects : ILRI's Data Ma...Efficient & effective  data management for research projects : ILRI's Data Ma...
Efficient & effective data management for research projects : ILRI's Data Ma...
CIARD Movement
 
Social Media in: Disseminating and Sharing Agriculture Data/Information
Social Media in: Disseminating and Sharing Agriculture Data/InformationSocial Media in: Disseminating and Sharing Agriculture Data/Information
Social Media in: Disseminating and Sharing Agriculture Data/Information
CIARD Movement
 
DSpace at ILRI : A semi-technical overview of “CGSpace”
DSpace at ILRI : A semi-technical overview of “CGSpace”DSpace at ILRI : A semi-technical overview of “CGSpace”
DSpace at ILRI : A semi-technical overview of “CGSpace”
CIARD Movement
 
University of Nairobi, Open Access Initiatives
University of Nairobi, Open Access InitiativesUniversity of Nairobi, Open Access Initiatives
University of Nairobi, Open Access Initiatives
CIARD Movement
 
Knowledge Management at KEFRI
Knowledge Management at KEFRIKnowledge Management at KEFRI
Knowledge Management at KEFRI
CIARD Movement
 
Open Research Data – the KALRO experience
Open Research Data – the KALRO experienceOpen Research Data – the KALRO experience
Open Research Data – the KALRO experience
CIARD Movement
 
JKUAT Case on Open Access
JKUAT Case on Open AccessJKUAT Case on Open Access
JKUAT Case on Open Access
CIARD Movement
 
JKUAT Case on Open Access
JKUAT Case on Open AccessJKUAT Case on Open Access
JKUAT Case on Open Access
CIARD Movement
 
Open Data and Open Science in Agriculture: Management
Open Data and Open Science in Agriculture: ManagementOpen Data and Open Science in Agriculture: Management
Open Data and Open Science in Agriculture: Management
CIARD Movement
 
Open Access Initiatives and Challenges in Kenya: Universities
Open Access Initiatives and Challenges in Kenya: UniversitiesOpen Access Initiatives and Challenges in Kenya: Universities
Open Access Initiatives and Challenges in Kenya: Universities
CIARD Movement
 
ICT Centre of Excellence and Open Data –iCEOD
ICT Centre of Excellence and Open Data –iCEODICT Centre of Excellence and Open Data –iCEOD
ICT Centre of Excellence and Open Data –iCEOD
CIARD Movement
 
Open Data and Big Data Capacity Building Initiative
Open Data and Big Data Capacity Building InitiativeOpen Data and Big Data Capacity Building Initiative
Open Data and Big Data Capacity Building Initiative
CIARD Movement
 
Forum on Open Data and Open Science in Agriculture in Kenya: African Journal ...
Forum on Open Data and Open Science in Agriculture in Kenya: African Journal ...Forum on Open Data and Open Science in Agriculture in Kenya: African Journal ...
Forum on Open Data and Open Science in Agriculture in Kenya: African Journal ...
CIARD Movement
 
Open Data and Open Science in Agriculture : Experiences and Opinions
Open Data and Open Science in Agriculture : Experiences and Opinions Open Data and Open Science in Agriculture : Experiences and Opinions
Open Data and Open Science in Agriculture : Experiences and Opinions
CIARD Movement
 
Open Access, Open Data and Open Science in the context of agricultural research
Open Access, Open Data and Open Science in the context of agricultural researchOpen Access, Open Data and Open Science in the context of agricultural research
Open Access, Open Data and Open Science in the context of agricultural research
CIARD Movement
 
Introducing the GODAN Secretariat
Introducing the GODAN SecretariatIntroducing the GODAN Secretariat
Introducing the GODAN Secretariat
CIARD Movement
 
Research Data Management at International Food Policy Research Institute-IFPRI
Research Data Management at International Food Policy Research Institute-IFPRIResearch Data Management at International Food Policy Research Institute-IFPRI
Research Data Management at International Food Policy Research Institute-IFPRI
CIARD Movement
 
Enabling Global Solutions for Agricultural and Nutrition Challenges through L...
Enabling Global Solutions for Agricultural and Nutrition Challenges through L...Enabling Global Solutions for Agricultural and Nutrition Challenges through L...
Enabling Global Solutions for Agricultural and Nutrition Challenges through L...
CIARD Movement
 
The CIARD RINGValeri
The CIARD RINGValeriThe CIARD RINGValeri
The CIARD RINGValeri
CIARD Movement
 
Turning three thesauri into a Global Agricultural Concept Scheme
Turning three thesauri into a  Global Agricultural Concept SchemeTurning three thesauri into a  Global Agricultural Concept Scheme
Turning three thesauri into a Global Agricultural Concept Scheme
CIARD Movement
 

More from CIARD Movement (20)

Efficient & effective data management for research projects : ILRI's Data Ma...
Efficient & effective  data management for research projects : ILRI's Data Ma...Efficient & effective  data management for research projects : ILRI's Data Ma...
Efficient & effective data management for research projects : ILRI's Data Ma...
 
Social Media in: Disseminating and Sharing Agriculture Data/Information
Social Media in: Disseminating and Sharing Agriculture Data/InformationSocial Media in: Disseminating and Sharing Agriculture Data/Information
Social Media in: Disseminating and Sharing Agriculture Data/Information
 
DSpace at ILRI : A semi-technical overview of “CGSpace”
DSpace at ILRI : A semi-technical overview of “CGSpace”DSpace at ILRI : A semi-technical overview of “CGSpace”
DSpace at ILRI : A semi-technical overview of “CGSpace”
 
University of Nairobi, Open Access Initiatives
University of Nairobi, Open Access InitiativesUniversity of Nairobi, Open Access Initiatives
University of Nairobi, Open Access Initiatives
 
Knowledge Management at KEFRI
Knowledge Management at KEFRIKnowledge Management at KEFRI
Knowledge Management at KEFRI
 
Open Research Data – the KALRO experience
Open Research Data – the KALRO experienceOpen Research Data – the KALRO experience
Open Research Data – the KALRO experience
 
JKUAT Case on Open Access
JKUAT Case on Open AccessJKUAT Case on Open Access
JKUAT Case on Open Access
 
JKUAT Case on Open Access
JKUAT Case on Open AccessJKUAT Case on Open Access
JKUAT Case on Open Access
 
Open Data and Open Science in Agriculture: Management
Open Data and Open Science in Agriculture: ManagementOpen Data and Open Science in Agriculture: Management
Open Data and Open Science in Agriculture: Management
 
Open Access Initiatives and Challenges in Kenya: Universities
Open Access Initiatives and Challenges in Kenya: UniversitiesOpen Access Initiatives and Challenges in Kenya: Universities
Open Access Initiatives and Challenges in Kenya: Universities
 
ICT Centre of Excellence and Open Data –iCEOD
ICT Centre of Excellence and Open Data –iCEODICT Centre of Excellence and Open Data –iCEOD
ICT Centre of Excellence and Open Data –iCEOD
 
Open Data and Big Data Capacity Building Initiative
Open Data and Big Data Capacity Building InitiativeOpen Data and Big Data Capacity Building Initiative
Open Data and Big Data Capacity Building Initiative
 
Forum on Open Data and Open Science in Agriculture in Kenya: African Journal ...
Forum on Open Data and Open Science in Agriculture in Kenya: African Journal ...Forum on Open Data and Open Science in Agriculture in Kenya: African Journal ...
Forum on Open Data and Open Science in Agriculture in Kenya: African Journal ...
 
Open Data and Open Science in Agriculture : Experiences and Opinions
Open Data and Open Science in Agriculture : Experiences and Opinions Open Data and Open Science in Agriculture : Experiences and Opinions
Open Data and Open Science in Agriculture : Experiences and Opinions
 
Open Access, Open Data and Open Science in the context of agricultural research
Open Access, Open Data and Open Science in the context of agricultural researchOpen Access, Open Data and Open Science in the context of agricultural research
Open Access, Open Data and Open Science in the context of agricultural research
 
Introducing the GODAN Secretariat
Introducing the GODAN SecretariatIntroducing the GODAN Secretariat
Introducing the GODAN Secretariat
 
Research Data Management at International Food Policy Research Institute-IFPRI
Research Data Management at International Food Policy Research Institute-IFPRIResearch Data Management at International Food Policy Research Institute-IFPRI
Research Data Management at International Food Policy Research Institute-IFPRI
 
Enabling Global Solutions for Agricultural and Nutrition Challenges through L...
Enabling Global Solutions for Agricultural and Nutrition Challenges through L...Enabling Global Solutions for Agricultural and Nutrition Challenges through L...
Enabling Global Solutions for Agricultural and Nutrition Challenges through L...
 
The CIARD RINGValeri
The CIARD RINGValeriThe CIARD RINGValeri
The CIARD RINGValeri
 
Turning three thesauri into a Global Agricultural Concept Scheme
Turning three thesauri into a  Global Agricultural Concept SchemeTurning three thesauri into a  Global Agricultural Concept Scheme
Turning three thesauri into a Global Agricultural Concept Scheme
 

Recently uploaded

JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDSJAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
Sérgio Sacani
 
Noida Call Girls Number 9999965857 Vip Call Girls Lady Of Your Dream Ready To...
Noida Call Girls Number 9999965857 Vip Call Girls Lady Of Your Dream Ready To...Noida Call Girls Number 9999965857 Vip Call Girls Lady Of Your Dream Ready To...
Noida Call Girls Number 9999965857 Vip Call Girls Lady Of Your Dream Ready To...
choudharydenunisha
 
一比一原版(macewan学位证书)加拿大麦科文大学毕业证如何办理
一比一原版(macewan学位证书)加拿大麦科文大学毕业证如何办理一比一原版(macewan学位证书)加拿大麦科文大学毕业证如何办理
一比一原版(macewan学位证书)加拿大麦科文大学毕业证如何办理
xzydcvt
 
Reaching the age of Adolescence- Class 8
Reaching the age of Adolescence- Class 8Reaching the age of Adolescence- Class 8
Reaching the age of Adolescence- Class 8
abhinayakamasamudram
 
Delhi Call Girls ✓WhatsApp 9999965857 🔝Top Class Call Girl Service Available
Delhi Call Girls ✓WhatsApp 9999965857 🔝Top Class Call Girl Service AvailableDelhi Call Girls ✓WhatsApp 9999965857 🔝Top Class Call Girl Service Available
Delhi Call Girls ✓WhatsApp 9999965857 🔝Top Class Call Girl Service Available
kk090568
 
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
Sérgio Sacani
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
PirithiRaju
 
Synopsis presentation VDR gene polymorphism and anemia (2).pptx
Synopsis presentation VDR gene polymorphism and anemia (2).pptxSynopsis presentation VDR gene polymorphism and anemia (2).pptx
Synopsis presentation VDR gene polymorphism and anemia (2).pptx
FarhanaHussain18
 
23PH301 - Optics - Unit 2 - Interference
23PH301 - Optics - Unit 2 - Interference23PH301 - Optics - Unit 2 - Interference
23PH301 - Optics - Unit 2 - Interference
RDhivya6
 
Explainable Deepfake Image/Video Detection
Explainable Deepfake Image/Video DetectionExplainable Deepfake Image/Video Detection
Explainable Deepfake Image/Video Detection
VasileiosMezaris
 
Nutaceuticsls herbal drug technology CVS, cancer.pptx
Nutaceuticsls herbal drug technology CVS, cancer.pptxNutaceuticsls herbal drug technology CVS, cancer.pptx
Nutaceuticsls herbal drug technology CVS, cancer.pptx
vimalveerammal
 
the fundamental unit of life CBSE class 9.pptx
the fundamental unit of life CBSE class 9.pptxthe fundamental unit of life CBSE class 9.pptx
the fundamental unit of life CBSE class 9.pptx
parminder0808singh
 
seed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdfseed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdf
Nistarini College, Purulia (W.B) India
 
Mapping the Growth of Supermassive Black Holes as a Function of Galaxy Stella...
Mapping the Growth of Supermassive Black Holes as a Function of Galaxy Stella...Mapping the Growth of Supermassive Black Holes as a Function of Galaxy Stella...
Mapping the Growth of Supermassive Black Holes as a Function of Galaxy Stella...
Sérgio Sacani
 
Hariyalikart Case Study of helping farmers in Bihar
Hariyalikart Case Study of helping farmers in BiharHariyalikart Case Study of helping farmers in Bihar
Hariyalikart Case Study of helping farmers in Bihar
rajsaurav589
 
Nereis Type Study for BSc 1st semester.ppt
Nereis Type Study for BSc 1st semester.pptNereis Type Study for BSc 1st semester.ppt
Nereis Type Study for BSc 1st semester.ppt
underratedsunrise
 
Call Girls Noida🔥9873777170🔥Gorgeous Escorts in Noida Available 24/7
Call Girls Noida🔥9873777170🔥Gorgeous Escorts in Noida Available 24/7Call Girls Noida🔥9873777170🔥Gorgeous Escorts in Noida Available 24/7
Call Girls Noida🔥9873777170🔥Gorgeous Escorts in Noida Available 24/7
yashika sharman06
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
PsychoTech Services
 
Lattice Defects in ionic solid compound.pptx
Lattice Defects in ionic solid compound.pptxLattice Defects in ionic solid compound.pptx
Lattice Defects in ionic solid compound.pptx
DrRajeshDas
 
Anti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark UniverseAnti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark Universe
Sérgio Sacani
 

Recently uploaded (20)

JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDSJAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
 
Noida Call Girls Number 9999965857 Vip Call Girls Lady Of Your Dream Ready To...
Noida Call Girls Number 9999965857 Vip Call Girls Lady Of Your Dream Ready To...Noida Call Girls Number 9999965857 Vip Call Girls Lady Of Your Dream Ready To...
Noida Call Girls Number 9999965857 Vip Call Girls Lady Of Your Dream Ready To...
 
一比一原版(macewan学位证书)加拿大麦科文大学毕业证如何办理
一比一原版(macewan学位证书)加拿大麦科文大学毕业证如何办理一比一原版(macewan学位证书)加拿大麦科文大学毕业证如何办理
一比一原版(macewan学位证书)加拿大麦科文大学毕业证如何办理
 
Reaching the age of Adolescence- Class 8
Reaching the age of Adolescence- Class 8Reaching the age of Adolescence- Class 8
Reaching the age of Adolescence- Class 8
 
Delhi Call Girls ✓WhatsApp 9999965857 🔝Top Class Call Girl Service Available
Delhi Call Girls ✓WhatsApp 9999965857 🔝Top Class Call Girl Service AvailableDelhi Call Girls ✓WhatsApp 9999965857 🔝Top Class Call Girl Service Available
Delhi Call Girls ✓WhatsApp 9999965857 🔝Top Class Call Girl Service Available
 
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
 
Synopsis presentation VDR gene polymorphism and anemia (2).pptx
Synopsis presentation VDR gene polymorphism and anemia (2).pptxSynopsis presentation VDR gene polymorphism and anemia (2).pptx
Synopsis presentation VDR gene polymorphism and anemia (2).pptx
 
23PH301 - Optics - Unit 2 - Interference
23PH301 - Optics - Unit 2 - Interference23PH301 - Optics - Unit 2 - Interference
23PH301 - Optics - Unit 2 - Interference
 
Explainable Deepfake Image/Video Detection
Explainable Deepfake Image/Video DetectionExplainable Deepfake Image/Video Detection
Explainable Deepfake Image/Video Detection
 
Nutaceuticsls herbal drug technology CVS, cancer.pptx
Nutaceuticsls herbal drug technology CVS, cancer.pptxNutaceuticsls herbal drug technology CVS, cancer.pptx
Nutaceuticsls herbal drug technology CVS, cancer.pptx
 
the fundamental unit of life CBSE class 9.pptx
the fundamental unit of life CBSE class 9.pptxthe fundamental unit of life CBSE class 9.pptx
the fundamental unit of life CBSE class 9.pptx
 
seed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdfseed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdf
 
Mapping the Growth of Supermassive Black Holes as a Function of Galaxy Stella...
Mapping the Growth of Supermassive Black Holes as a Function of Galaxy Stella...Mapping the Growth of Supermassive Black Holes as a Function of Galaxy Stella...
Mapping the Growth of Supermassive Black Holes as a Function of Galaxy Stella...
 
Hariyalikart Case Study of helping farmers in Bihar
Hariyalikart Case Study of helping farmers in BiharHariyalikart Case Study of helping farmers in Bihar
Hariyalikart Case Study of helping farmers in Bihar
 
Nereis Type Study for BSc 1st semester.ppt
Nereis Type Study for BSc 1st semester.pptNereis Type Study for BSc 1st semester.ppt
Nereis Type Study for BSc 1st semester.ppt
 
Call Girls Noida🔥9873777170🔥Gorgeous Escorts in Noida Available 24/7
Call Girls Noida🔥9873777170🔥Gorgeous Escorts in Noida Available 24/7Call Girls Noida🔥9873777170🔥Gorgeous Escorts in Noida Available 24/7
Call Girls Noida🔥9873777170🔥Gorgeous Escorts in Noida Available 24/7
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
 
Lattice Defects in ionic solid compound.pptx
Lattice Defects in ionic solid compound.pptxLattice Defects in ionic solid compound.pptx
Lattice Defects in ionic solid compound.pptx
 
Anti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark UniverseAnti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark Universe
 

RDA Wheat Data Interoperability Cookbook and last developments

  • 1. RDA Wheat Data Interoperability Cookbook and last developments 9th March 2015, San Diego
  • 2. 2 The WDI working group in brief  Endorsement: March 2014  Members: ~=30 members and 15 active members, Wheat scientists, data and metadata technologists  The goal: contribute to the improvement of Wheat related data interoperability by  Building a common interoperability framework (metadata, data formats and vocabularies)  Providing guidelines for describing, representing and linking Wheat related data
  • 3. 3  Deliverables  A report of the survey of existing standards  A cookbook intended for the Wheat data managers community, which provides them with guidelines on what data formats, metadata, vocabularies and ontologies they should use to describe, represent and link different types of Wheat data.  A library of linked vocabularies and ontologies in machine readable formats with respect to the Linked Data standards.  A prototype which showcases the gain of interoperability Initial plans
  • 5. 5 Data type Data formats currently used Recommendations Standardized Tool specific Non standardized SNPs VCF BAM/SAM, BED, VARSCAN, VEP VCF files generated by using the survey sequences of IWGSC + metadata about VCF files to enrich the information about the SNPs. genome annotations Genbank Flat File, General Feature Format (GFF), EMBL GFF 3 + specifications with regard the description of specific columns Germplasms MPCD, ABCD, Darwin Core, Darwin Core Germplasm Grin Global tabulated MPCD Gene expression Many format standards laid out by repositories such as NCBI (GEO) and EBI Array Express Existing format standards laid out by the repositories such as NCBI (GEO) and EBI Array Express + ENA Physical maps GFF Cmap, fpc GFF3 Genetic maps Cmap, gnpmap GFF3 (to be confirmed) Phenotypes Drops, ped, isa- tab, ephesis tabulated Isa-tab
  • 6. 6 Examples of use cases Title Searching for germplasm with specific traits Description Example of searching for germplasm with specific traits - tagged with ontology terms? Data types Germplasm Phenotype Challenges ● Metadata very important ~ standardized format ● Association of genes to traits, linked to germplasm, marker information ● Need for quality controls- how confident are you of the data source? ● Provenance of the germplasm- pedigree, ownership, ● Standard system for tracking germplasm, names Title Identification of wheat genes that control root growth Description Requires: Annotated genes (Gene Ontology, PFam, and other functional annotation) Data types Genomic annotations? - Gene location ? (IWGS-SS ID or MIPS HCS link) Challenges Mapping between wheat genes and orthologs from other species (deduce function by seq. similarity); Access to RNASeq data (genes that are not expressed in roots may be irrelevant) ; mapping of wheat genes and information on their function based on literature Title Query on trial data associated with varieties Data types Phenotypic data, GIS data, (wheat economy/production data) Description To search wheat varieties with distribution maps, production figures, performances in wheat mega environments, associated projects worldwide plus layers of climatic data on specific wheat production areas and disease prevention information. Challenges Phenotypic data should be linked to GIS data. Using keywords or ontology terms a system or a tool should be able to pull out such information from different websites/systems developed by wheat community.
  • 7. 7
  • 8. 8  Assess the level of visibility and interoperability of Wheat related vocabularies and ontologies  Is the vocabulary/ontology updated regularly?  What license and/or copyright is used?  Is the vocabulary/ontology part of any ontology communities or listing services?  Is the vocabulary/ontology used or implemented in any database/repository?  Does the vocabulary/ontology interlink and/or map to other vocabularies and ontologies?  Does the vocabulary/ontology  Identify the domain covered by the ontologies and vocabularies  Refine the cookbook  Collect more interoperability use cases  Collect some technical details Wheat related ontologies & vocabularies survey
  • 9. 9 Wheat related ontologies & vocabularies survey
  • 10. The Wheat related BioPortal allows one to search for terms across multiple ontologies, browse mappings between terms in different ontologies, receive recommendations on which ontologies are most relevant for a corpus, annotate text with terms from ontologies
  • 11. 11 Next steps  Metadata (harmonization, minimal metadata sets)  Mappings  Next workshop (summer 2015)  Review and complete the recommendations  Refine and complete the guidelines and the best practices  Finalize the repository of Wheat related vocabularies  Prototyping: a semantic knowledge base  Integrate data from different data sources  Provide smart search capabilities that leverage the vocabularies used against the metadata.