SlideShare a Scribd company logo
1 of 17
Download to read offline
Enhancing Data Integration with Text Analysis to 
Find Proteins Implicated in Plant Stress Response
Keywan Hassani‐Pak
keywan.hassani‐pak@bbsrc.ac.uk
Integrative Bioinformatics 2010
Outline
• Motivation
– Prior information for candidate genes
– Structured data and unstructured text
• Methods
– Text mining plugin for Ondex
– Application case
• Results
– Visualisation
– Association networks
– Filtering noise
– Validation
• Summary
Motivation
• High throughput ‘omics research can identify many candidate genes
• Interpretation of experimental results needs prior information
• Most important sources for prior information are
– Structured bioinformatics databases 
– Unstructured scientific literature
• GOAL: Automated methods for the integration of prior information
Identify genes that
alter expression
over time
DBs
Literature
Public Data SourcesTime Course Microarray Data
Gene1
Gene2
Gene3
…
...
GeneN
Candidate Genes
Experiment1
Experiment2
…
Get prior information 
for genes regarding 
the experiment
Structured Data vs. Unstructured Text
• Data integration methods
– Syntactic and semantic heterogeneity
– Literature references
• Text mining methods
– Identify facts hidden in unstructured text
– Integrate facts with database entries
http://www.nactem.ac.uk/software/kleio
http://www.uniprot.org
Integrative Text Mining
• Old: Data integration and text mining systems have been largely 
developed independently
• Idea: Combining structured knowledge stored in public data bases with 
unstructured information in literature
• New: Text mining plugin for the data integration framework Ondex
Data
Transformation
Clients/ToolsHeterogeneous
Data Sources
UniProt
OBO
Parser
Parser
Ondex
CoreGeneralizedObjectDataModel
DatabaseLayer
Mapping
Methods
Accession
Name based
BLAST
Data Exchange
Taverna
Cytoscape
Ondex
Frontend
Lucene
KEGG Parser
OXL/RDF
WebService
Text Mining
MEDLINE Parser
Ondex Integrator
www.ondex.org
Advanced Knowledge Base
1. Structured information 
– Bioinformatics databases, ontologies
– Curated citations in structured data sources (e.g. from UniProt)
2. Unstructured information
– MEDLINE titles and abstracts are indexed and normalised (by Lucene)
– Information Retrieval strategies: exact, fuzzy, proximity
– Named Entity Recognition: concept‐based (names and synonyms) 
– Score: tf‐idf weight (term frequency * inverse document frequency)
text‐mining
x
y
BA
is_related
Publication
Concepts
published_in
weighted association network
IP=1.7; M=1.2; N=2
yx
BA
Association Scores
weighted association network
N=29; M=3.1; IP=22.4
BA
= N
tf‐idf = 3.1 = M
tf‐idf = 1.7
tf‐idf = 0.9
IP = 22.4
...
Phenotypes
Worldwide Data Resources
Time Course Microarray Data
Network Inference
‐ Literature
‐ Public databases
‐ Public experiments
Identification of key 
regulatory genes
Knock out experiments
Overexpresser  experiments
Identify genes that alter 
expression over time
Prior information
Ondex
The PRESTA project
http://www2.warwick.ac.uk/fac/sci/whri/research/presta
Application Case: Knowledge Base for 
Stress Response in Arabidopsis
• Publications (the corpus)
– MEDLINE: search ‘Arabidopsis thaliana’ 
28653 publications
• Proteins
– UniProtKB: search ‘taxid:3702 + reviewed’ 
8582 proteins
– 13502 curated citations
• Plant Stress Ontology
– 33 stresses/treatments related to PRESTA 
experiments
– Biotic: Bacteria, Fungus, etc. 
– Abiotic: Drought, Salt, Light, Hormone, etc.
Stress
Protein
Publication
Enzyme
13502 352445194
published_in
is_related
X. campestris
Network Visualisation
Protein‐Stress Association Network
• 3145 proteins linked to 32 stresses by 10777 relations
• On average
• each protein associated with 3.4 stresses
• each stress associated with 337 proteins
• Filtering associations based on three confidence scores IP, M and N
X. campestris
Ethylene
Metric Min Max
IP 0.01 347.26
M 0.01 26.86
N 1 600
How to find cut‐offs for filtering?
• Problem: Text mining results often error‐prone 
• Aim: Improving signal‐to‐noise ratio by setting optimal cut‐offs
• Co‐citation number (N) is simplest way to potentially reduce noise in 
such association networks
• Filtering by IP and M should be more selective as both consider 
frequency of terms in the corpus
• However, none of the metrics is superior overall
• Considering several metrics at the same time seems to be method of 
choice to reduce noise and highlight key associations
a. b.
TM
AHD
AHD
TM
Validation of Protein‐Ethylene Pairs
• Ethylene association network contained 533 proteins
• Ideally read all abstracts and evaluate association
• Comparison with Arabidopsis Hormone Database (AHD)
a. 31 curated associations:  71.0% recall
b. 166 total associations (inc. GO): 44.8% recall
Top 10 protein predictions
ACCESSION NAME PUBMED YEAR M N IP PVAL TRUE
AT3G05420 ACBP4 18836139 2008 13.51 1 13.51 1.00* yes
AT1G31812 ACBP6 18836139 2008 11.57 2 17.14 0.50 yes
AT3G03190 ATGSTF6 14617075 2003 7.36 7 15.75 0.25 yes
AT4G26080 ABI1 19705149 2009 6.66 10 12.22 0.39 yes
AT3G21510 AHP1 18384742 2008 6.60 3 6.70 0.17 yes
AT1G75040 PR‐5 15988566 2005 5.18 12 5.47 0.07 yes
AT2G45820 Remorin 9159183 1997 5.04 4 6.77 0.86 no
AT3G11410 PP2CA 19705149 2009 5.00 1 5.00 1.00 yes
AT1G09570 Phytochrome A 8703080 1996 4.79 11 8.47 0.19 no
AT1G04240 IAA3 19213814 2009 4.54 3 5.14 0.67 yes
• Evaluated top 10 proteins (sorted by M score) from our analyses that are 
linked  to ethylene but were not found in AHD. 
• P‐value relates to the significance of the IP score.  
However if N=1  P=1 (*)
• Evidence text
• PMID:18836139: the interaction of ACBP4 and AtEBP may be related to AtEBP‐mediated 
defence possibly via ethylene and/or jasmonate signalling.
• PMID:19705149: protein phosphatase 2C ABI1 modulates biosynthesis ratio of ABA and 
ethylene.
Future Work
• Integrate more advanced text mining methods
• Extensive analysis and evaluation of our association metrics
• Investigate alternative association metrics
• Finding best cut‐off for optimal signal‐to‐noise ratio
• Apply method to more application cases
Summary
• Prior information needs to be extracted from structured data and 
unstructured text
• Developed a flexible text mining plugin for the data integration framework 
Ondex (open source)
• Can be linked into various bioinformatics workflow to enhance high‐
throughput ‘omics research
• First report of systematically combining data integration with basic text 
mining
• Generated prior information for Arabidopsis proteins regarding the 
PRESTA experiments
Acknowledgements
ONDEX BBSRC SABR Project BB/F006039
PRESTA BBSRC SABR project BB/F005806
Catherine Canevet
Chris Rawlings
Roxane Legaie
Hugo van den Berg
Jay Moore
THANK YOU!
Contact:
keywan.hassani‐pak@bbsrc.ac.uk

More Related Content

What's hot

A Topic map-based ontology IR system versus Clustering-based IR System: A Com...
A Topic map-based ontology IR system versus Clustering-based IR System: A Com...A Topic map-based ontology IR system versus Clustering-based IR System: A Com...
A Topic map-based ontology IR system versus Clustering-based IR System: A Com...tmra
 
A survey of heterogeneous information network analysis
A survey of heterogeneous information network analysisA survey of heterogeneous information network analysis
A survey of heterogeneous information network analysisSOYEON KIM
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...William Gunn
 
Ontology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemOntology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemIJTET Journal
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Jian Qin
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataMichel Dumontier
 
Data Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceData Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceJian Qin
 
Managing and Sharing Research Data - Workshop at UiO - December 04, 2017
Managing and Sharing Research Data - Workshop at UiO - December 04, 2017Managing and Sharing Research Data - Workshop at UiO - December 04, 2017
Managing and Sharing Research Data - Workshop at UiO - December 04, 2017Michel Heeremans
 
Bringing bioinformatics into the library
Bringing bioinformatics into the libraryBringing bioinformatics into the library
Bringing bioinformatics into the libraryC. Tobin Magle
 
Application and Implementation of different deep learning
Application and Implementation of different deep learningApplication and Implementation of different deep learning
Application and Implementation of different deep learningJIEJackyZOUChou
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?Maryann Martone
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
Technology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksTechnology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksAlexander Pico
 
The NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGThe NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGPhilip Bourne
 
Agile Curation: 2015 AGU Presentation
Agile Curation: 2015 AGU PresentationAgile Curation: 2015 AGU Presentation
Agile Curation: 2015 AGU PresentationJosh Young
 
Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Alexander Pico
 
Privacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use CasesPrivacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use CasesMicah Altman
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EITESANGO
 
Pistoia Alliance Debates: Text Mining for Pharma R&D in a Social World (17th ...
Pistoia Alliance Debates: Text Mining for Pharma R&D in a Social World (17th ...Pistoia Alliance Debates: Text Mining for Pharma R&D in a Social World (17th ...
Pistoia Alliance Debates: Text Mining for Pharma R&D in a Social World (17th ...Pistoia Alliance
 

What's hot (20)

A Topic map-based ontology IR system versus Clustering-based IR System: A Com...
A Topic map-based ontology IR system versus Clustering-based IR System: A Com...A Topic map-based ontology IR system versus Clustering-based IR System: A Com...
A Topic map-based ontology IR system versus Clustering-based IR System: A Com...
 
A survey of heterogeneous information network analysis
A survey of heterogeneous information network analysisA survey of heterogeneous information network analysis
A survey of heterogeneous information network analysis
 
A0210110
A0210110A0210110
A0210110
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
Ontology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemOntology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval System
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
 
Data Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information ScienceData Science and What It Means to Library and Information Science
Data Science and What It Means to Library and Information Science
 
Managing and Sharing Research Data - Workshop at UiO - December 04, 2017
Managing and Sharing Research Data - Workshop at UiO - December 04, 2017Managing and Sharing Research Data - Workshop at UiO - December 04, 2017
Managing and Sharing Research Data - Workshop at UiO - December 04, 2017
 
Bringing bioinformatics into the library
Bringing bioinformatics into the libraryBringing bioinformatics into the library
Bringing bioinformatics into the library
 
Application and Implementation of different deep learning
Application and Implementation of different deep learningApplication and Implementation of different deep learning
Application and Implementation of different deep learning
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Technology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive NetworksTechnology R&D Theme 2: From Descriptive to Predictive Networks
Technology R&D Theme 2: From Descriptive to Predictive Networks
 
The NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGThe NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAG
 
Agile Curation: 2015 AGU Presentation
Agile Curation: 2015 AGU PresentationAgile Curation: 2015 AGU Presentation
Agile Curation: 2015 AGU Presentation
 
Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020
 
Privacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use CasesPrivacy in Research Data Managemnt - Use Cases
Privacy in Research Data Managemnt - Use Cases
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017
 
Pistoia Alliance Debates: Text Mining for Pharma R&D in a Social World (17th ...
Pistoia Alliance Debates: Text Mining for Pharma R&D in a Social World (17th ...Pistoia Alliance Debates: Text Mining for Pharma R&D in a Social World (17th ...
Pistoia Alliance Debates: Text Mining for Pharma R&D in a Social World (17th ...
 

Viewers also liked

From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...Catherine Canevet
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnected Data World
 
Spark, Deep Learning and Life Sciences, Systems Biology in the Big Data Age
Spark, Deep Learning and Life Sciences, Systems Biology in the Big Data AgeSpark, Deep Learning and Life Sciences, Systems Biology in the Big Data Age
Spark, Deep Learning and Life Sciences, Systems Biology in the Big Data Agebatchinsights
 
Knowledge Assembly at Scale with Semantic and Probabilistic Techniques
Knowledge Assembly at Scale with Semantic and Probabilistic TechniquesKnowledge Assembly at Scale with Semantic and Probabilistic Techniques
Knowledge Assembly at Scale with Semantic and Probabilistic TechniquesConnected Data World
 
BioPAX for semantic web based data integration
BioPAX for semantic web based data integrationBioPAX for semantic web based data integration
BioPAX for semantic web based data integrationCatherine Canevet
 
Using the Ondex system for exploring Arabidopsis regulatory networks
Using the Ondex system for exploring Arabidopsis regulatory networksUsing the Ondex system for exploring Arabidopsis regulatory networks
Using the Ondex system for exploring Arabidopsis regulatory networksCatherine Canevet
 
Automatic extraction of microorganisms and their habitats from free text usin...
Automatic extraction of microorganisms and their habitats from free text usin...Automatic extraction of microorganisms and their habitats from free text usin...
Automatic extraction of microorganisms and their habitats from free text usin...Catherine Canevet
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisCatherine Canevet
 
The Ondex Data Integration Framework
The Ondex Data Integration FrameworkThe Ondex Data Integration Framework
The Ondex Data Integration Frameworkbosc
 
Creating an integrated Ondex knowledge base for comparative gene function ana...
Creating an integrated Ondex knowledge base for comparative gene function ana...Creating an integrated Ondex knowledge base for comparative gene function ana...
Creating an integrated Ondex knowledge base for comparative gene function ana...Catherine Canevet
 
Maven and google pharma r&d (1)
Maven and google pharma r&d  (1)Maven and google pharma r&d  (1)
Maven and google pharma r&d (1)Matt Barnes
 

Viewers also liked (12)

From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics Institute
 
Spark, Deep Learning and Life Sciences, Systems Biology in the Big Data Age
Spark, Deep Learning and Life Sciences, Systems Biology in the Big Data AgeSpark, Deep Learning and Life Sciences, Systems Biology in the Big Data Age
Spark, Deep Learning and Life Sciences, Systems Biology in the Big Data Age
 
Knowledge Assembly at Scale with Semantic and Probabilistic Techniques
Knowledge Assembly at Scale with Semantic and Probabilistic TechniquesKnowledge Assembly at Scale with Semantic and Probabilistic Techniques
Knowledge Assembly at Scale with Semantic and Probabilistic Techniques
 
BioPAX for semantic web based data integration
BioPAX for semantic web based data integrationBioPAX for semantic web based data integration
BioPAX for semantic web based data integration
 
Using the Ondex system for exploring Arabidopsis regulatory networks
Using the Ondex system for exploring Arabidopsis regulatory networksUsing the Ondex system for exploring Arabidopsis regulatory networks
Using the Ondex system for exploring Arabidopsis regulatory networks
 
Automatic extraction of microorganisms and their habitats from free text usin...
Automatic extraction of microorganisms and their habitats from free text usin...Automatic extraction of microorganisms and their habitats from free text usin...
Automatic extraction of microorganisms and their habitats from free text usin...
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
 
The Ondex Data Integration Framework
The Ondex Data Integration FrameworkThe Ondex Data Integration Framework
The Ondex Data Integration Framework
 
Creating an integrated Ondex knowledge base for comparative gene function ana...
Creating an integrated Ondex knowledge base for comparative gene function ana...Creating an integrated Ondex knowledge base for comparative gene function ana...
Creating an integrated Ondex knowledge base for comparative gene function ana...
 
Daniel Lee STAN
Daniel Lee STANDaniel Lee STAN
Daniel Lee STAN
 
Maven and google pharma r&d (1)
Maven and google pharma r&d  (1)Maven and google pharma r&d  (1)
Maven and google pharma r&d (1)
 

Similar to Enhancing Data Integration with Text Analysis to Find Genes Implicated in Plant Stress Response

Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMichel Dumontier
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003robertstevens65
 
The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...Neuroscience Information Framework
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesBastian Greshake
 
Using the Micropublications ontology and the Open Annotation Data Model to re...
Using the Micropublications ontology and the Open Annotation Data Model to re...Using the Micropublications ontology and the Open Annotation Data Model to re...
Using the Micropublications ontology and the Open Annotation Data Model to re...jodischneider
 
David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access DataSciSIG
 
NIH Data Science Special Interest Group
NIH Data Science Special Interest GroupNIH Data Science Special Interest Group
NIH Data Science Special Interest GroupYaffa Rubinstien
 
Fair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevortFair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevortData Science NIH
 
Biological data bioinformatics
Biological data bioinformatics Biological data bioinformatics
Biological data bioinformatics AakifahAmreen
 
Data and Donuts: How to write a data management plan
Data and Donuts: How to write a data management planData and Donuts: How to write a data management plan
Data and Donuts: How to write a data management planC. Tobin Magle
 
Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Ann-Marie Roche
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...Ian Foster
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Michel Dumontier
 
ELSS use cases and strategy
ELSS use cases and strategyELSS use cases and strategy
ELSS use cases and strategyAnton Yuryev
 
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...TERN Australia
 

Similar to Enhancing Data Integration with Text Analysis to Find Genes Implicated in Plant Stress Response (20)

KnetMiner Overview Oct 2017
KnetMiner Overview Oct 2017KnetMiner Overview Oct 2017
KnetMiner Overview Oct 2017
 
Semantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including AstrophysicsSemantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including Astrophysics
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
SciBite
SciBiteSciBite
SciBite
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003
 
The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association Studies
 
Using the Micropublications ontology and the Open Annotation Data Model to re...
Using the Micropublications ontology and the Open Annotation Data Model to re...Using the Micropublications ontology and the Open Annotation Data Model to re...
Using the Micropublications ontology and the Open Annotation Data Model to re...
 
David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access
 
NIH Data Science Special Interest Group
NIH Data Science Special Interest GroupNIH Data Science Special Interest Group
NIH Data Science Special Interest Group
 
Fair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevortFair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevort
 
Biological data bioinformatics
Biological data bioinformatics Biological data bioinformatics
Biological data bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Data and Donuts: How to write a data management plan
Data and Donuts: How to write a data management planData and Donuts: How to write a data management plan
Data and Donuts: How to write a data management plan
 
Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 
ELSS use cases and strategy
ELSS use cases and strategyELSS use cases and strategy
ELSS use cases and strategy
 
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
 
KnetMiner - EBI Workshop 2017
KnetMiner - EBI Workshop 2017KnetMiner - EBI Workshop 2017
KnetMiner - EBI Workshop 2017
 

Enhancing Data Integration with Text Analysis to Find Genes Implicated in Plant Stress Response