SlideShare a Scribd company logo
1 of 26
FAIR Agronomy, where are we?
The KnetMiner Use Case
SWAT4LS 2022
Marco Brandizi marco.brandizi@rothamsted.ac.uk
Keywan Hassani-Pak keywan.hassani-pak@rothamsted.ac.uk
Find this presentation on SlideShare
background source: https://www.flickr.com/photos/60091022@N08/22004495993/
Hello!
https://knetminer.com
• Rothamsted Research is a non-profit research centre, focused on agricultural
science, including farming, plant biology, statistics and bioinformatics, livestock
management, entomology
• 300 people, founded in 1843, international collaborations, national
capabilities
• Our team develops the KnetMiner gene discovery platform, offering
knowledge graph based services for plants, pests and their interactions
• We are part of the Computational and Analytical Sciences department, a
community of experts in a wide range of analytical technologies and data
analysis tools
What is the future for SW4LS? What can Agronomy learn from Biomedical SW?
Network knowledge is thriving
There is more than Semantic Web and Linked Data now
Lightweight modelling is replac… adding to OWL
Agrifood is leveraging biomedics experience
background source: https://wallpapersafari.com/w/yAIwLx
Network knowledge is thriving
Network Knowledge is Thriving
Network Knowledge is Thriving
Good Reads:
• Recent trends in knowledge graphs: theory and practice,
https://link.springer.com/article/10.1007/s00500-021-05756-8
• Wikidata: A large-scale collaborative ontological medical database,
https://www.sciencedirect.com/science/article/pii/S1532046419302114
• KG-COVID-19: A Framework to Produce Customized Knowledge Graphs
for COVID-19 Response, https://doi.org/10.1016/j.patter.2020.100155
• Constructing and Mining Web-Scale Knowledge Graphs,
https://fdocuments.net/reader/full/kdd14-t2-bordes-gabrilovich-3
• Graph Embeddings: The Secret Ingredient for Relationship-Driven AI,
https://www.youtube.com/watch?v=-CscGHDXrZY
The KnetMiner Knowledge Graphs
The KnetMiner Knowledge Graphs
The KnetMiner Knowledge Graphs
The KnetMiner Knowledge Graphs
OXL format
There is more than Semantic Web and Linked Data now
More than Semantic Web and Linked Data
The Hybrid KnetMiner
KnetMiner
ontology (BioKNO)
knetminer.org
rdf2pg tool
+RDF exporter1
1) Based on https://github.com/EBIBioSamples/java2rdf
Exploiting both RDF and Cypher
Lightweight modelling is adding to OWL
Lightweight Data Modelling
source: https://tinyurl.com/y52c263s
source: https://www.schemaapp.com
Lightweight Data Modelling in KnetMiner and Beyond
AgriSchemas
ontology (BioKNO)
knetminer.com
AgriSchemas
Use case Data Types Data Sources Status
Molecular Biology Gene, Protein, Pathway
encodes, participates
Via Knetminer: ENSEMBL, UniProt,
TILLING, wheat-expression.com,
KEGG
Done.
Ontology Annotations Ontology Term
(schema:DefinedTerm)
dc:type, schema:additionalType
Via Knetminer: GO, PO, CROP-
Onto
Done.
Experiments Study, agri:StudyFactor,
PropertyValue
EBI/GXA, GLTen, MIAPPE/BrAPI
sources, ?
GXA Done
MIAPPE, much work done during
ELIXIR BioHackathon, going on
with monthly calls
GLTen use case drafted
Literature agri:ScholarlyPublication
mentions
Via Knetminer: PubMed Done
Gene Expression bioschema:expressedIn, reified
statements, agri:evidence,
agri:pvalue, agri:baseCondition
EBI/GXA, Via Knetminer: wheat-
expression.com
GXA
Host-pathogen interaction Gene, Phenotype,
agri:ScholarlyPublication
agri:HostPathogenInteraction
agri:evidence
PHI-Base Use case drafted
Weather ? ? TO DO
Dataset metadata Dataset, DataCatalog
license, distribution
knetminer.org/data ongoing
AgriSchemas Progress
Use case Data Types Data Sources Status
Molecular Biology Gene, Protein, Pathway
encodes, participates
Via Knetminer: ENSEMBL, UniProt,
TILLING, wheat-expression.com,
KEGG
Done.
Ontology Annotations Ontology Term
(schema:DefinedTerm)
dc:type, schema:additionalType
Via Knetminer: GO, PO, CROP-Onto Done.
Experiments Study, agri:StudyFactor,
PropertyValue
EBI/GXA, GLTen, MIAPPE/BrAPI
sources, ?
GXA Done
MIAPPE, much work done during
ELIXIR BioHackathon, going on with
monthly calls
GLTen use case drafted
Literature agri:ScholarlyPublication
mentions
Via Knetminer: PubMed Done
Gene Expression bioschema:expressedIn, reified
statements, agri:evidence,
agri:pvalue, agri:baseCondition
EBI/GXA, Via Knetminer: wheat-
expression.com
GXA
Host-pathogen interaction Gene, Phenotype,
agri:ScholarlyPublication
agri:HostPathogenInteraction
agri:evidence
PHI-Base Use case drafted
Weather ? ? TO DO
Dataset metadata Dataset, DataCatalog
license, distribution
knetminer.org/data ongoing
KnetMiner & AgriSchemas
SELECT ?gene ?geneAcc ?condLabel ?studyTitle ?study ?pub ?pubTitle ?pubYear
?condTerm {
?gene a bioschema:Gene; dcterms:identifier ?geneAcc.
# ?gene bioschema:expressedIn ?condition also available
?expStatement a rdfs:Statement;
rdf:subject ?gene;
rdf:predicate bioschema:expressedIn;
rdf:object ?condition;
agri:evidence ?study.
?pub schema:mentions ?gene.
?pub a agri:ScholarlyPublication; dcterms:title ?pubTitle.
OPTIONAL { ?pub schema:datePublished ?pubYear }
?condition schema:name ?condLabel. OPTIONAL { ?condition dc:type ?condTerm. }
?study dc:title ?studyTitle;
}
KnetMiner & AgriSchemas
Agrifood is leveraging biomedics experience
Agrifood is leveraging biomedics experience
Biomedics Agrifood
Resources NCBI, EBI, BioPortal, Bio2RDF. Many used in plant biology too (eg, ENSEMBL,
PubMED). Smaller similar projects, eg, AgroLD,
AgroPortal, CGIAR/GARDIAN, WheatIS, more data
integration needed.
Ontologies Integration efforts by OBO. MolBio ontologies (eg, GO,
CL), medical ontologies (eg, UMLS, PATO), experimental
reporting ontologies (eg, OBI, SIO, EFO), specie
classification (NCBITax).
Significant overlapping (eg, GO, NCBITax), good
coverage of plant biology (eg, Crop Ontology, Plant Trait
Onto). Food-related ontologies (eg, FoodOn) and
taxonomies (eg, AGROVOC). Coverage of different
aspects (eg, animals, forestry). Lack of coverage? Eg,
weather.
Schemas and formats Established formats (eg, FASTA, ISA-Tab, CDISC),
established standardisations (HL7, FHIR) ongoing
standardization efforts (BioSchemas).
Significant overlapping (eg, FASTA, ISA-Tab). More
heterogeneous landscape, eg, MIAPPE and COPO not
widely adopted, simpler solutions preferred (eg,
Frictionless in DFW)
Data Access Heterogeneous APIs, much use of SPARQL, but still
limited.
A few established APIs (eg, BrAPI, WheatIS), more
desirable.
Data and AI Much traditional machine learning, a number of graph
embedding projects (eg, https://tinyurl.com/yxbau649,
https://tinyurl.com/yyq9rdh9)
ML for precision farming (eg,
https://tinyurl.com/y5zopcd5). Some graph embedding
projects (eg, https://tinyurl.com/yxt9lgk7,
https://tinyurl.com/y3mnr2n2)
References
• AgriSchemas
• https://github.com/Rothamsted/agri-schemas
• Use cases: https://github.com/Rothamsted/agri-schemas/tree/master/drafts/201904-dfw-
hackathon
• Real data & ETL tools: https://github.com/Rothamsted/agri-schemas/tree/master/dfw-dataset
• Knetminer
• Web site: http://knetminer.org
• Publication: https://doi.org/10.1111/pbi.13583
• Case study about FAIR data:
• https://knetminer.com/cases/the-power-of-standardised-and-fair-knowledge-graphs.html
• FAIR data infrastructure: https://doi.org/10.1515/jib-2018-0023
• Data endpoint: http://knetminer.org/data
• DFW
• AgriSchemas and DFW:
• https://designingfuturewheat.org.uk/dfw-and-fair-agriculture-data-the-knetminer-
experience/
• Me
• https://www.slideshare.net/mbrandizi, https://marcobrandizi.info/about-me/
Acknowledgements
Ajit Singh
Software Engineer
• Samiul Haque, Ed Eyles, IT admins
• Joseph Hearnshaw, software engineer
• Louis Timberlake, visiting student
• Alice Minotto, Earlham Institute, hosting providers
• Robert Davey, Earlham Institute, DFW WP4 coordinator
• William Brown, Ricardo Gregorio, IT admins
• Monika Mistry, master Student, data Curator
• Sandeep Amberkar, bioinformatician, data curator
• Richard Holland, ext contractors, developers
Keywan Hassani-Pak
KnetMiner Team Leader
Chris Rawlings
Head of Computational & Analytical Sciences
Jeremy Parsons
Bioinformatics Scientist

More Related Content

What's hot

Multi-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsMulti-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsChristoph Steinbeck
 
Application of bioinformatics
Application of bioinformaticsApplication of bioinformatics
Application of bioinformaticsKamlesh Patade
 
Intro bioinformatics
Intro bioinformaticsIntro bioinformatics
Intro bioinformaticsChris Dwan
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked DataMichel Dumontier
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to BioinformaticsLeighton Pritchard
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!adcobb
 
Bioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureBioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureRobert Cormia
 
Ontomaton icbo2013-alternative order-t_wv3
Ontomaton icbo2013-alternative order-t_wv3Ontomaton icbo2013-alternative order-t_wv3
Ontomaton icbo2013-alternative order-t_wv3Philippe Rocca-Serra
 
Bioinformatics, Its Usage and Advantages
Bioinformatics, Its Usage and AdvantagesBioinformatics, Its Usage and Advantages
Bioinformatics, Its Usage and Advantagesbioinformatt
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesAmos Watentena
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...Alejandra Gonzalez-Beltran
 
B.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsB.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsRai University
 
Database technologies in bioinformatics
Database technologies in bioinformaticsDatabase technologies in bioinformatics
Database technologies in bioinformaticsGleb Sklyr
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Alejandra Gonzalez-Beltran
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final PresentationShruthi Choudary
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsShikha Thakur
 
Data sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK StoryData sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK StoryResearch Information Network
 
Careers in bioinformatics
Careers in bioinformaticsCareers in bioinformatics
Careers in bioinformaticsentranzz123
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformaticsphilmaweb
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchAnshika Bansal
 

What's hot (20)

Multi-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsMulti-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application Domains
 
Application of bioinformatics
Application of bioinformaticsApplication of bioinformatics
Application of bioinformatics
 
Intro bioinformatics
Intro bioinformaticsIntro bioinformatics
Intro bioinformatics
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 
Bioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureBioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of Nature
 
Ontomaton icbo2013-alternative order-t_wv3
Ontomaton icbo2013-alternative order-t_wv3Ontomaton icbo2013-alternative order-t_wv3
Ontomaton icbo2013-alternative order-t_wv3
 
Bioinformatics, Its Usage and Advantages
Bioinformatics, Its Usage and AdvantagesBioinformatics, Its Usage and Advantages
Bioinformatics, Its Usage and Advantages
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And Challenges
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...
 
B.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsB.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformatics
 
Database technologies in bioinformatics
Database technologies in bioinformaticsDatabase technologies in bioinformatics
Database technologies in bioinformatics
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final Presentation
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of Bioinformatics
 
Data sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK StoryData sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK Story
 
Careers in bioinformatics
Careers in bioinformaticsCareers in bioinformatics
Careers in bioinformatics
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 

Similar to FAIR Agronomy, where are we? The KnetMiner Use Case

Data management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK StoryData management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK StoryCarole Goble
 
Ontology for the Financial Services Industry
Ontology for the Financial Services IndustryOntology for the Financial Services Industry
Ontology for the Financial Services IndustryBarry Smith
 
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Amit Sheth
 
Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Dag Endresen
 
Meta genomics talks
Meta genomics talksMeta genomics talks
Meta genomics talksAteeq Khaliq
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesMonica Munoz-Torres
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
Bioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsBioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsProf. Wim Van Criekinge
 
Falck zepeda ashs washington dc 2018
Falck zepeda ashs washington dc 2018Falck zepeda ashs washington dc 2018
Falck zepeda ashs washington dc 2018Jose Falck Zepeda
 
Talk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meetingTalk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meetingJonathan Eisen
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersLarry Smarr
 
Sharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceSharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceDag Endresen
 
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...CGIAR Generation Challenge Programme
 
ICAR2016 TAIR talk
ICAR2016 TAIR talkICAR2016 TAIR talk
ICAR2016 TAIR talkDonghui Li
 
Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)Dag Endresen
 

Similar to FAIR Agronomy, where are we? The KnetMiner Use Case (20)

The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli...
The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli...The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli...
The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli...
 
Data management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK StoryData management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK Story
 
Ontology for the Financial Services Industry
Ontology for the Financial Services IndustryOntology for the Financial Services Industry
Ontology for the Financial Services Industry
 
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
 
Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)
 
Meta genomics talks
Meta genomics talksMeta genomics talks
Meta genomics talks
 
Bioinformatics-2009-Moura-1096-8
Bioinformatics-2009-Moura-1096-8Bioinformatics-2009-Moura-1096-8
Bioinformatics-2009-Moura-1096-8
 
Introduction to Proteogenomics
Introduction to Proteogenomics Introduction to Proteogenomics
Introduction to Proteogenomics
 
bioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics databioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics data
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Bioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsBioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformatics
 
Falck zepeda ashs washington dc 2018
Falck zepeda ashs washington dc 2018Falck zepeda ashs washington dc 2018
Falck zepeda ashs washington dc 2018
 
Talk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meetingTalk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meeting
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics Researchers
 
Sharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceSharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conference
 
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
 
ICAR2016 TAIR talk
ICAR2016 TAIR talkICAR2016 TAIR talk
ICAR2016 TAIR talk
 
Amman Workshop #2 - M MacKay
Amman Workshop #2 - M MacKayAmman Workshop #2 - M MacKay
Amman Workshop #2 - M MacKay
 
Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)
 

More from Rothamsted Research, UK

Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
Publishing and Consuming FAIR DataA Case in the Agri-Food DomainPublishing and Consuming FAIR DataA Case in the Agri-Food Domain
Publishing and Consuming FAIR Data A Case in the Agri-Food DomainRothamsted Research, UK
 
AgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use CasesAgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use CasesRothamsted Research, UK
 
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...Rothamsted Research, UK
 
A Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMinerA Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMinerRothamsted Research, UK
 
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...Rothamsted Research, UK
 
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...Rothamsted Research, UK
 
graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...Rothamsted Research, UK
 
myEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference servicemyEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference serviceRothamsted Research, UK
 
BioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS TutorialBioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS TutorialRothamsted Research, UK
 

More from Rothamsted Research, UK (20)

Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
Publishing and Consuming FAIR DataA Case in the Agri-Food DomainPublishing and Consuming FAIR DataA Case in the Agri-Food Domain
Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
 
Continuos Integration @Knetminer
Continuos Integration @KnetminerContinuos Integration @Knetminer
Continuos Integration @Knetminer
 
AgriSchemas Progress Report
AgriSchemas Progress ReportAgriSchemas Progress Report
AgriSchemas Progress Report
 
AgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use CasesAgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use Cases
 
Notes about SWAT4LS 2018
Notes about SWAT4LS 2018Notes about SWAT4LS 2018
Notes about SWAT4LS 2018
 
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
 
Knetminer Backend Training, Nov 2018
Knetminer Backend Training, Nov 2018Knetminer Backend Training, Nov 2018
Knetminer Backend Training, Nov 2018
 
A Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMinerA Preliminary survey of RDF/Neo4j as backends for KnetMiner
A Preliminary survey of RDF/Neo4j as backends for KnetMiner
 
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
 
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
 
graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...graph2tab, a library to convert experimental workflow graphs into tabular for...
graph2tab, a library to convert experimental workflow graphs into tabular for...
 
Interoperable Open Data: Which Recipes?
Interoperable Open Data: Which Recipes?Interoperable Open Data: Which Recipes?
Interoperable Open Data: Which Recipes?
 
Linked Data with the EBI RDF Platform
Linked Data with the EBI RDF PlatformLinked Data with the EBI RDF Platform
Linked Data with the EBI RDF Platform
 
BioSD Linked Data: Lessons Learned
BioSD Linked Data: Lessons LearnedBioSD Linked Data: Lessons Learned
BioSD Linked Data: Lessons Learned
 
BioSD Tutorial 2014 Editition
BioSD Tutorial 2014 EdititionBioSD Tutorial 2014 Editition
BioSD Tutorial 2014 Editition
 
myEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference servicemyEquivalents, aka a new cross-reference service
myEquivalents, aka a new cross-reference service
 
Dev 2014 LOD tutorial
Dev 2014 LOD tutorialDev 2014 LOD tutorial
Dev 2014 LOD tutorial
 
BioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS TutorialBioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS Tutorial
 
Semic 2013
Semic 2013Semic 2013
Semic 2013
 
Uk onto net_2013_notes_brandizi
Uk onto net_2013_notes_brandiziUk onto net_2013_notes_brandizi
Uk onto net_2013_notes_brandizi
 

Recently uploaded

Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzohaibmir069
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfWadeK3
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 

Recently uploaded (20)

Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistan
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 

FAIR Agronomy, where are we? The KnetMiner Use Case

  • 1. FAIR Agronomy, where are we? The KnetMiner Use Case SWAT4LS 2022 Marco Brandizi marco.brandizi@rothamsted.ac.uk Keywan Hassani-Pak keywan.hassani-pak@rothamsted.ac.uk Find this presentation on SlideShare background source: https://www.flickr.com/photos/60091022@N08/22004495993/
  • 2. Hello! https://knetminer.com • Rothamsted Research is a non-profit research centre, focused on agricultural science, including farming, plant biology, statistics and bioinformatics, livestock management, entomology • 300 people, founded in 1843, international collaborations, national capabilities • Our team develops the KnetMiner gene discovery platform, offering knowledge graph based services for plants, pests and their interactions • We are part of the Computational and Analytical Sciences department, a community of experts in a wide range of analytical technologies and data analysis tools
  • 3. What is the future for SW4LS? What can Agronomy learn from Biomedical SW? Network knowledge is thriving There is more than Semantic Web and Linked Data now Lightweight modelling is replac… adding to OWL Agrifood is leveraging biomedics experience background source: https://wallpapersafari.com/w/yAIwLx
  • 6. Network Knowledge is Thriving Good Reads: • Recent trends in knowledge graphs: theory and practice, https://link.springer.com/article/10.1007/s00500-021-05756-8 • Wikidata: A large-scale collaborative ontological medical database, https://www.sciencedirect.com/science/article/pii/S1532046419302114 • KG-COVID-19: A Framework to Produce Customized Knowledge Graphs for COVID-19 Response, https://doi.org/10.1016/j.patter.2020.100155 • Constructing and Mining Web-Scale Knowledge Graphs, https://fdocuments.net/reader/full/kdd14-t2-bordes-gabrilovich-3 • Graph Embeddings: The Secret Ingredient for Relationship-Driven AI, https://www.youtube.com/watch?v=-CscGHDXrZY
  • 10. The KnetMiner Knowledge Graphs OXL format
  • 11. There is more than Semantic Web and Linked Data now
  • 12. More than Semantic Web and Linked Data
  • 13. The Hybrid KnetMiner KnetMiner ontology (BioKNO) knetminer.org rdf2pg tool +RDF exporter1 1) Based on https://github.com/EBIBioSamples/java2rdf
  • 14. Exploiting both RDF and Cypher
  • 15. Lightweight modelling is adding to OWL
  • 16. Lightweight Data Modelling source: https://tinyurl.com/y52c263s source: https://www.schemaapp.com
  • 17. Lightweight Data Modelling in KnetMiner and Beyond AgriSchemas ontology (BioKNO) knetminer.com
  • 19. Use case Data Types Data Sources Status Molecular Biology Gene, Protein, Pathway encodes, participates Via Knetminer: ENSEMBL, UniProt, TILLING, wheat-expression.com, KEGG Done. Ontology Annotations Ontology Term (schema:DefinedTerm) dc:type, schema:additionalType Via Knetminer: GO, PO, CROP- Onto Done. Experiments Study, agri:StudyFactor, PropertyValue EBI/GXA, GLTen, MIAPPE/BrAPI sources, ? GXA Done MIAPPE, much work done during ELIXIR BioHackathon, going on with monthly calls GLTen use case drafted Literature agri:ScholarlyPublication mentions Via Knetminer: PubMed Done Gene Expression bioschema:expressedIn, reified statements, agri:evidence, agri:pvalue, agri:baseCondition EBI/GXA, Via Knetminer: wheat- expression.com GXA Host-pathogen interaction Gene, Phenotype, agri:ScholarlyPublication agri:HostPathogenInteraction agri:evidence PHI-Base Use case drafted Weather ? ? TO DO Dataset metadata Dataset, DataCatalog license, distribution knetminer.org/data ongoing
  • 20. AgriSchemas Progress Use case Data Types Data Sources Status Molecular Biology Gene, Protein, Pathway encodes, participates Via Knetminer: ENSEMBL, UniProt, TILLING, wheat-expression.com, KEGG Done. Ontology Annotations Ontology Term (schema:DefinedTerm) dc:type, schema:additionalType Via Knetminer: GO, PO, CROP-Onto Done. Experiments Study, agri:StudyFactor, PropertyValue EBI/GXA, GLTen, MIAPPE/BrAPI sources, ? GXA Done MIAPPE, much work done during ELIXIR BioHackathon, going on with monthly calls GLTen use case drafted Literature agri:ScholarlyPublication mentions Via Knetminer: PubMed Done Gene Expression bioschema:expressedIn, reified statements, agri:evidence, agri:pvalue, agri:baseCondition EBI/GXA, Via Knetminer: wheat- expression.com GXA Host-pathogen interaction Gene, Phenotype, agri:ScholarlyPublication agri:HostPathogenInteraction agri:evidence PHI-Base Use case drafted Weather ? ? TO DO Dataset metadata Dataset, DataCatalog license, distribution knetminer.org/data ongoing
  • 21. KnetMiner & AgriSchemas SELECT ?gene ?geneAcc ?condLabel ?studyTitle ?study ?pub ?pubTitle ?pubYear ?condTerm { ?gene a bioschema:Gene; dcterms:identifier ?geneAcc. # ?gene bioschema:expressedIn ?condition also available ?expStatement a rdfs:Statement; rdf:subject ?gene; rdf:predicate bioschema:expressedIn; rdf:object ?condition; agri:evidence ?study. ?pub schema:mentions ?gene. ?pub a agri:ScholarlyPublication; dcterms:title ?pubTitle. OPTIONAL { ?pub schema:datePublished ?pubYear } ?condition schema:name ?condLabel. OPTIONAL { ?condition dc:type ?condTerm. } ?study dc:title ?studyTitle; }
  • 23. Agrifood is leveraging biomedics experience
  • 24. Agrifood is leveraging biomedics experience Biomedics Agrifood Resources NCBI, EBI, BioPortal, Bio2RDF. Many used in plant biology too (eg, ENSEMBL, PubMED). Smaller similar projects, eg, AgroLD, AgroPortal, CGIAR/GARDIAN, WheatIS, more data integration needed. Ontologies Integration efforts by OBO. MolBio ontologies (eg, GO, CL), medical ontologies (eg, UMLS, PATO), experimental reporting ontologies (eg, OBI, SIO, EFO), specie classification (NCBITax). Significant overlapping (eg, GO, NCBITax), good coverage of plant biology (eg, Crop Ontology, Plant Trait Onto). Food-related ontologies (eg, FoodOn) and taxonomies (eg, AGROVOC). Coverage of different aspects (eg, animals, forestry). Lack of coverage? Eg, weather. Schemas and formats Established formats (eg, FASTA, ISA-Tab, CDISC), established standardisations (HL7, FHIR) ongoing standardization efforts (BioSchemas). Significant overlapping (eg, FASTA, ISA-Tab). More heterogeneous landscape, eg, MIAPPE and COPO not widely adopted, simpler solutions preferred (eg, Frictionless in DFW) Data Access Heterogeneous APIs, much use of SPARQL, but still limited. A few established APIs (eg, BrAPI, WheatIS), more desirable. Data and AI Much traditional machine learning, a number of graph embedding projects (eg, https://tinyurl.com/yxbau649, https://tinyurl.com/yyq9rdh9) ML for precision farming (eg, https://tinyurl.com/y5zopcd5). Some graph embedding projects (eg, https://tinyurl.com/yxt9lgk7, https://tinyurl.com/y3mnr2n2)
  • 25. References • AgriSchemas • https://github.com/Rothamsted/agri-schemas • Use cases: https://github.com/Rothamsted/agri-schemas/tree/master/drafts/201904-dfw- hackathon • Real data & ETL tools: https://github.com/Rothamsted/agri-schemas/tree/master/dfw-dataset • Knetminer • Web site: http://knetminer.org • Publication: https://doi.org/10.1111/pbi.13583 • Case study about FAIR data: • https://knetminer.com/cases/the-power-of-standardised-and-fair-knowledge-graphs.html • FAIR data infrastructure: https://doi.org/10.1515/jib-2018-0023 • Data endpoint: http://knetminer.org/data • DFW • AgriSchemas and DFW: • https://designingfuturewheat.org.uk/dfw-and-fair-agriculture-data-the-knetminer- experience/ • Me • https://www.slideshare.net/mbrandizi, https://marcobrandizi.info/about-me/
  • 26. Acknowledgements Ajit Singh Software Engineer • Samiul Haque, Ed Eyles, IT admins • Joseph Hearnshaw, software engineer • Louis Timberlake, visiting student • Alice Minotto, Earlham Institute, hosting providers • Robert Davey, Earlham Institute, DFW WP4 coordinator • William Brown, Ricardo Gregorio, IT admins • Monika Mistry, master Student, data Curator • Sandeep Amberkar, bioinformatician, data curator • Richard Holland, ext contractors, developers Keywan Hassani-Pak KnetMiner Team Leader Chris Rawlings Head of Computational & Analytical Sciences Jeremy Parsons Bioinformatics Scientist

Editor's Notes

  1. Our application is an example of gene exploration app over KGs. Knowledge is matched to search input and ranked based on search input plus biological significance (semantic motifs, later)
  2. Initially, and still quite so, based on ad-hoc workflow system (KnetBuilder) and ad-hoc KG format (OXL). So, KGs come in many ways.
  3. Many different knowledge graph databases exist, and also several ETL + data exchange formats. SPARQL alternatives exist for graph-like data access.
  4. We support both SPARQL and Cypher, they’ve different sets of pros/cons. RDF is at the base of modelling and ETL. We have developed the rdf2pg tool for combining the two.
  5. An example of Cypher benefits: keyword-found entities are matched to genes via well-known paths (semantic motifs). Initially, SM were based on a limited transition machine syntax, now we have a Cypher traverser, more expressive syntax.
  6. Everyone knows schema.org. Here, two examples of how rather commercial apps are leveraging it. schema.org and bioschemas are “lightweight”, easier to use and practical for application development. Moreover, they’re complementary to “true” ontologies like GO.
  7. What we do with schema.org and bioschemas: our ontology mapped to the standards, more datasets integrated, a new project, AgriSchemas, to extend bioschemas to the agrifood domain.
  8. A prototype with real datasets is on line. Here, a query that combine specific KnetMiner data (with several mappings and URI reuse) with EBI/GXA data mapped to AgriSchemas.
  9. And here the results. As you see, Knetminer info (eg, gene/pub associations computed via text mining) can be linked to GXA info (ie, gene expression), including ontology annotations. If you want to collaborate, we have ideas on how to exploit this combination for novel insights in wheat.