BioCuration 2019 - Evidence and Conclusion Ontology 2019 Update

D

The Evidence and Conclusion Ontology (ECO) describes types of evidence relevant to biological investigations. First developed in the early 2000s, ECO now consists of over 1700 defined classes and is used by a large, and growing, list of resources. ECO imports close to 1000 classes from the Ontology for Biomedical Investigations and the Gene Ontology for use in logical definitions. Historically, ECO terms have generally been categorized by either the biological context of the evidence (e.g. gene expression) or the technique used to generate the evidence (e.g. PCR-based evidence). The result is that sometimes terms that have related biological context are found under different unrelated nodes. To address this, we have been performing a rigorous review of the structure and logic of the branches of ECO. Working with additional input from collaborators through the issue tracker on GitHub, term labels, definitions, and relationships are being evaluated and updated. The goal of these changes is to increase the logical consistency of ECO, make it easier for users to find and understand terms, and allow for ECO to continue to grow and support its users. In addition to the structural review, we have been working with CollecTF to utilize ECO for automated text mining. To generate a curated corpus for this effort, we have been annotating ECO terms to sentences which contain evidence-based assertions about gene products, taxonomic entities, and sequence features. From this effort we have developed clearly-defined annotation guidelines that have been passed on to a team of undergraduates who are continuing the curation effort. Annotations are limited to single sentences, or to two consecutive sentences, containing the evidence instance and assertion clause. The quality of the mapping to ECO and the strength of the author’s assertion are also captured. ECO is freely available at http://evidenceontology.org/ and https://github.com/evidenceontology.

The Evidence and Conclusion Ontology systematically
describes scientific evidence types that support biological
assertions. ECO is structured around two root classes:
'evidence' and 'assertion method’. Terms describing types of
evidence are grouped under 'evidence’, while the 'assertion
method', provides a mechanism for recording if a particular
assertion was made by a human or in an automated fashion.
ECO supports >20 user groups with their annotation
efforts, e.g. UniProt-Gene Ontology Annotation1
(UniProt-
GOA) has >628 million evidence-linked GO annotations2.
ECO is released into the public domain under CC0 1.0
Universal (CC0 1.0) license.
James B. Munro1, Elizabeth T. Hobbs2, Suvarna Nadendla1*, Rebecca C. Tauber1*, Stephen
Goralski2, Ivan Erill2, Marcus C. Chibucos1, & Michelle Giglio1
1Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD
2Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, MD
*Contact: email - rctauber@gmail.com; snadendla@som.umaryland.edu
Abstract: The Evidence and Conclusion Ontology (ECO) describes types of evidence relevant to biological investigations. First developed in the early 2000s, ECO now
consists of over 1700 defined classes and is used by a large, and growing, list of resources. ECO imports close to 1000 classes from the Ontology for Biomedical
Investigations and the Gene Ontology for use in logical definitions. Historically, ECO terms have generally been categorized by either the biological context of the
evidence (e.g. gene expression) or the technique used to generate the evidence (e.g. PCR-based evidence). The result is that sometimes terms that have related
biological context are found under different unrelated nodes. To address this, we have been performing a rigorous review of the structure and logic of the branches of
ECO. Working with additional input from collaborators through the issue tracker on GitHub, term labels, definitions, and relationships are being evaluated and updated.
The goal of these changes is to increase the logical consistency of ECO, make it easier for users to find and understand terms, and allow for ECO to continue to grow
and support its users. In addition to the structural review, we have been working with CollecTF to utilize ECO for automated text mining. To generate a curated corpus for
this effort, we have been annotating ECO terms to sentences which contain evidence-based assertions about gene products, taxonomic entities, and sequence features.
From this effort we have developed clearly-defined annotation guidelines that have been passed on to a team of undergraduates who are continuing the curation effort.
Annotations are limited to single sentences, or to two consecutive sentences, containing the evidence instance and assertion clause. The quality of the mapping to ECO
and the strength of the author’s assertion are also captured. ECO is freely available at http://evidenceontology.org/ and https://github.com/evidenceontology.
/evidenceontology
Thank you to our collaborators and various user groups for supporting
the growth of ECO.
Collaborations:
ECO is supported by the National Science
Foundation (NSF) Division of Biological
Infrastructure (DBI) under Award Number 1458400.
Find us at http://evidenceontology.org/
1. E.C. Dimmer, R.P. Huntley, Y. Alam-Faruque, T. Sawford, C. O'Donovan, M.J.
Martinet, … R. Apweiler. (2012). The UniProt-GO Annotation database in 2011.
Nucleic Acids Res., 40, D565–D570.
2. M.C. Chibucos, D.A. Siegele, J.C. Hu, M. Giglio (2017). The Evidence and Conclusion
Ontology (ECO): Supporting GO Annotations. Methods in Mol. Biol., 1446, 245-
259.
3. S. Kilic, E.R. White, D.M. Sagitova, J.P. Cornish, & I. Erill. (2014). CollecTF: A
database of experimentally validated transcription factor-binding sites in bacteria.
Nucleic Acids Res., 42, D156-D160.
4. The Gene Ontology Consortium. (2015). Gene Ontology Consortium: going forward.
Nucleic Acids Research, 43, D1049-D1056.
5. A. Bandrowski, R. Brinkman, M. Brochhausen, M.H. Brush, B. Bug, M.C. Chibucos. et
al. (2016). The Ontology for Biomedical Investigations, PLoS One, 11(4):e0154556.
6. L.M. Schriml, E. Mitraka, J. Munro, B. Tauber, M. Schor, L. Nickle, V. Felix, Li. Jeng,
C. Bearer. et al. Human Disease Ontology 2018 update: classification, content and
workflow expansion, Nucleic Acids Research, Volume 47, Issue D1, 08 January
2019, Pages D955–D962.
7. M.C. Chibuocos, A.E. Zweifel, J.C. Herrera, W. Meza, S. Eslamfam, P. Uetz, … M.G.
Giglio. (2014). An ontology for microbial phenotypes. BMC Microbiology, 14, 294.
8. Wikidata. https://www.wikidata.org/wiki/Wikidata:Main_Page
• Currently, there are 1760 terms in ECO. All the terms have
textual definitions.
• 1339 ECO terms have logical definitions. Of these, 186 have
logical definitions that link out to other vocabularies such as the
GO4 and the OBI5, 1147 terms have logical definitions linking the
class to an ECO assertion method, and 6 terms have logical
definitions linking to other internal class.
Future direction
• Continue to work with our collaborators.
• Collaboration with Confidence Information Ontology for
expanding the model of capturing confidence information.
The Human Disease Ontology6
, to
incorporate classes representing
definition sources.
The Ontology for Microbial Phenotypes7
,
to expand classes for phenotype
annotations.
The Ontology for Biomedical
Investigations5, to complete the
harmonization project.
The Gene Ontology4
, to continue
support representing evidence in
gene products annotations.
Wikidata8, to support annotations of
genes, proteins and diseases in it’s
structured data storage repository.
Increased Logical Consistency
Node Expansion
We have been working with CollecTF3
to utilize ECO for an automated text mining effort. As a part of this project, a
curated corpus of high quality experimental evidence annotations consisting of gene products, sequence feature,
phenotype, and taxonomy/phylogeny, etc. is generated from sentences in scientific articles. This corpus is used as
an annotated training set for building an automated text mining model.
Guidelines for annotation
Annotation process
Interactive Text Mining (Future plan)
Before
After
Inter-Annotator Agreement
Kappa Equation :
Ao = observed agreement; Ae = expected agreement
deprecated

Recommended

ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog... by
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...dolleyj
2.3K views1 slide
The Gene Ontology & Gene Ontology Annotation resources by
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesMelanie Courtot
5.8K views36 slides
UniProt-GOA by
UniProt-GOAUniProt-GOA
UniProt-GOAEBI
1.8K views40 slides
Light Intro to the Gene Ontology by
Light Intro to the Gene OntologyLight Intro to the Gene Ontology
Light Intro to the Gene Ontologynniiicc
985 views29 slides
Ontologies for life sciences: examples from the gene ontology by
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyMelanie Courtot
2.2K views154 slides
Model repositories and standard formats for model reusability by
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityUniversity Medicine Greifswald
153 views23 slides

More Related Content

What's hot

Gene Ontology WormBase Workshop International Worm Meeting 2015 by
Gene Ontology WormBase Workshop International Worm Meeting 2015Gene Ontology WormBase Workshop International Worm Meeting 2015
Gene Ontology WormBase Workshop International Worm Meeting 2015raymond91105
754 views18 slides
Developing Frameworks and Tools for Animal Trait Ontology (ATO) by
Developing Frameworks and Tools for Animal Trait Ontology (ATO) Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Developing Frameworks and Tools for Animal Trait Ontology (ATO) Jie Bao
518 views1 slide
GI 2013 - ENCODE Project Data Access via RESTful API and JSON by
GI 2013 - ENCODE Project Data Access via RESTful API and JSONGI 2013 - ENCODE Project Data Access via RESTful API and JSON
GI 2013 - ENCODE Project Data Access via RESTful API and JSONENCODE-DCC
617 views1 slide
The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli... by
The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli...The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli...
The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli...AIMS (Agricultural Information Management Standards)
867 views17 slides
US2TS presentation on Gene Ontology by
US2TS presentation on Gene OntologyUS2TS presentation on Gene Ontology
US2TS presentation on Gene OntologyChris Mungall
357 views15 slides
Experiences in the biosciences with the open biological ontologies foundry an... by
Experiences in the biosciences with the open biological ontologies foundry an...Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...Chris Mungall
441 views25 slides

What's hot(20)

Gene Ontology WormBase Workshop International Worm Meeting 2015 by raymond91105
Gene Ontology WormBase Workshop International Worm Meeting 2015Gene Ontology WormBase Workshop International Worm Meeting 2015
Gene Ontology WormBase Workshop International Worm Meeting 2015
raymond91105754 views
Developing Frameworks and Tools for Animal Trait Ontology (ATO) by Jie Bao
Developing Frameworks and Tools for Animal Trait Ontology (ATO) Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Developing Frameworks and Tools for Animal Trait Ontology (ATO)
Jie Bao518 views
GI 2013 - ENCODE Project Data Access via RESTful API and JSON by ENCODE-DCC
GI 2013 - ENCODE Project Data Access via RESTful API and JSONGI 2013 - ENCODE Project Data Access via RESTful API and JSON
GI 2013 - ENCODE Project Data Access via RESTful API and JSON
ENCODE-DCC617 views
US2TS presentation on Gene Ontology by Chris Mungall
US2TS presentation on Gene OntologyUS2TS presentation on Gene Ontology
US2TS presentation on Gene Ontology
Chris Mungall357 views
Experiences in the biosciences with the open biological ontologies foundry an... by Chris Mungall
Experiences in the biosciences with the open biological ontologies foundry an...Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...
Chris Mungall441 views
Introduction to Ontologies for Environmental Biology by Barry Smith
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental Biology
Barry Smith5K views
Bio-ontologies in bioinformatics: Growing up challenges by Janna Hastings
Bio-ontologies in bioinformatics: Growing up challengesBio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challenges
Janna Hastings904 views
Nowomics at Cambridge Open Research by Nowomics
Nowomics at Cambridge Open ResearchNowomics at Cambridge Open Research
Nowomics at Cambridge Open Research
Nowomics298 views
Ontology Development Kit: Bio-Ontologies 2019 by Chris Mungall
Ontology Development Kit: Bio-Ontologies 2019Ontology Development Kit: Bio-Ontologies 2019
Ontology Development Kit: Bio-Ontologies 2019
Chris Mungall816 views
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ... by Cyndy Parr
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Cyndy Parr5K views
GloBI Status Update 23 May 2013 by jhpoelen245
GloBI Status Update 23 May 2013GloBI Status Update 23 May 2013
GloBI Status Update 23 May 2013
jhpoelen2455.3K views
Can machines understand the scientific literature by petermurrayrust
Can machines understand the scientific literatureCan machines understand the scientific literature
Can machines understand the scientific literature
petermurrayrust1K views
EOL and Science: Yes we can! by Cyndy Parr
EOL and Science: Yes we can!EOL and Science: Yes we can!
EOL and Science: Yes we can!
Cyndy Parr347 views
Biot6838 2016 fall presentation by ciakov
Biot6838 2016 fall presentationBiot6838 2016 fall presentation
Biot6838 2016 fall presentation
ciakov218 views
Content Mining of Science in Cambridge by TheContentMine
Content Mining of Science in CambridgeContent Mining of Science in Cambridge
Content Mining of Science in Cambridge
TheContentMine181 views

Similar to BioCuration 2019 - Evidence and Conclusion Ontology 2019 Update

Prosdocimi ucb cdao by
Prosdocimi ucb cdaoProsdocimi ucb cdao
Prosdocimi ucb cdaoFrancisco Prosdocimi
471 views31 slides
Web Apollo: Lessons learned from community-based biocuration efforts. by
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Monica Munoz-Torres
1K views20 slides
How Bio Ontologies Enable Open Science by
How Bio Ontologies Enable Open ScienceHow Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open Sciencedrnigam
1.4K views46 slides
www.ijerd.com by
www.ijerd.comwww.ijerd.com
www.ijerd.comIJERD Editor
594 views10 slides
Collaboratively Creating the Knowledge Graph of Life by
Collaboratively Creating the Knowledge Graph of LifeCollaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeChris Mungall
238 views49 slides
bioinformatics enabling knowledge generation from agricultural omics data by
bioinformatics enabling knowledge generation from agricultural omics databioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics dataInternational Institute of Tropical Agriculture
912 views50 slides

Similar to BioCuration 2019 - Evidence and Conclusion Ontology 2019 Update(20)

Web Apollo: Lessons learned from community-based biocuration efforts. by Monica Munoz-Torres
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.
How Bio Ontologies Enable Open Science by drnigam
How Bio Ontologies Enable Open ScienceHow Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open Science
drnigam1.4K views
Collaboratively Creating the Knowledge Graph of Life by Chris Mungall
Collaboratively Creating the Knowledge Graph of LifeCollaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of Life
Chris Mungall238 views
Building a Network of Interoperable and Independently Produced Linked and Ope... by Michel Dumontier
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
Michel Dumontier581 views
Computing on the shoulders of giants by Benjamin Good
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giants
Benjamin Good723 views
Essential Requirements for Community Annotation Tools by Monica Munoz-Torres
Essential Requirements for Community Annotation ToolsEssential Requirements for Community Annotation Tools
Essential Requirements for Community Annotation Tools
Semantics for Bioinformatics: What, Why and How of Search, Integration and An... by Amit Sheth
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Amit Sheth1.9K views
Using the NCBO Annotator to Develop an Ontology-Based Index of Biomedical Res... by Trish Whetzel
Using the NCBO Annotator to Develop an Ontology-Based Index of Biomedical Res...Using the NCBO Annotator to Develop an Ontology-Based Index of Biomedical Res...
Using the NCBO Annotator to Develop an Ontology-Based Index of Biomedical Res...
Trish Whetzel252 views
The W3C PROV standard: data model for the provenance of information, and enab... by Paolo Missier
The W3C PROV standard:data model for the provenance of information, and enab...The W3C PROV standard:data model for the provenance of information, and enab...
The W3C PROV standard: data model for the provenance of information, and enab...
Paolo Missier3.3K views
AB3ACBS 2016: EMBL Australia Bioinformatics Resource by Philippa Griffin
AB3ACBS 2016: EMBL Australia Bioinformatics ResourceAB3ACBS 2016: EMBL Australia Bioinformatics Resource
AB3ACBS 2016: EMBL Australia Bioinformatics Resource
Philippa Griffin236 views
Eko Artificial Life, Determinacy of Ecological Resilience and Classification ... by ijtsrd
Eko Artificial Life, Determinacy of Ecological Resilience and Classification ...Eko Artificial Life, Determinacy of Ecological Resilience and Classification ...
Eko Artificial Life, Determinacy of Ecological Resilience and Classification ...
ijtsrd46 views

Recently uploaded

Structure and Functions of Cell.pdf by
Structure and Functions of Cell.pdfStructure and Functions of Cell.pdf
Structure and Functions of Cell.pdfNithya Murugan
719 views10 slides
Monthly Information Session for MV Asterix (November) by
Monthly Information Session for MV Asterix (November)Monthly Information Session for MV Asterix (November)
Monthly Information Session for MV Asterix (November)Esquimalt MFRC
72 views26 slides
Recap of our Class by
Recap of our ClassRecap of our Class
Recap of our ClassCorinne Weisgerber
88 views15 slides
ICS3211_lecture 08_2023.pdf by
ICS3211_lecture 08_2023.pdfICS3211_lecture 08_2023.pdf
ICS3211_lecture 08_2023.pdfVanessa Camilleri
231 views30 slides
MIXING OF PHARMACEUTICALS.pptx by
MIXING OF PHARMACEUTICALS.pptxMIXING OF PHARMACEUTICALS.pptx
MIXING OF PHARMACEUTICALS.pptxAnupkumar Sharma
95 views35 slides
Use of Probiotics in Aquaculture.pptx by
Use of Probiotics in Aquaculture.pptxUse of Probiotics in Aquaculture.pptx
Use of Probiotics in Aquaculture.pptxAKSHAY MANDAL
119 views15 slides

Recently uploaded(20)

Structure and Functions of Cell.pdf by Nithya Murugan
Structure and Functions of Cell.pdfStructure and Functions of Cell.pdf
Structure and Functions of Cell.pdf
Nithya Murugan719 views
Monthly Information Session for MV Asterix (November) by Esquimalt MFRC
Monthly Information Session for MV Asterix (November)Monthly Information Session for MV Asterix (November)
Monthly Information Session for MV Asterix (November)
Esquimalt MFRC72 views
Use of Probiotics in Aquaculture.pptx by AKSHAY MANDAL
Use of Probiotics in Aquaculture.pptxUse of Probiotics in Aquaculture.pptx
Use of Probiotics in Aquaculture.pptx
AKSHAY MANDAL119 views
Psychology KS4 by WestHatch
Psychology KS4Psychology KS4
Psychology KS4
WestHatch98 views
Narration lesson plan by TARIQ KHAN
Narration lesson planNarration lesson plan
Narration lesson plan
TARIQ KHAN61 views
Ch. 8 Political Party and Party System.pptx by Rommel Regala
Ch. 8 Political Party and Party System.pptxCh. 8 Political Party and Party System.pptx
Ch. 8 Political Party and Party System.pptx
Rommel Regala54 views
ISO/IEC 27001 and ISO/IEC 27005: Managing AI Risks Effectively by PECB
ISO/IEC 27001 and ISO/IEC 27005: Managing AI Risks EffectivelyISO/IEC 27001 and ISO/IEC 27005: Managing AI Risks Effectively
ISO/IEC 27001 and ISO/IEC 27005: Managing AI Risks Effectively
PECB 623 views
Sociology KS5 by WestHatch
Sociology KS5Sociology KS5
Sociology KS5
WestHatch85 views
The basics - information, data, technology and systems.pdf by JonathanCovena1
The basics - information, data, technology and systems.pdfThe basics - information, data, technology and systems.pdf
The basics - information, data, technology and systems.pdf
JonathanCovena1146 views
How to empty an One2many field in Odoo by Celine George
How to empty an One2many field in OdooHow to empty an One2many field in Odoo
How to empty an One2many field in Odoo
Celine George87 views
Education and Diversity.pptx by DrHafizKosar
Education and Diversity.pptxEducation and Diversity.pptx
Education and Diversity.pptx
DrHafizKosar193 views
Ch. 7 Political Participation and Elections.pptx by Rommel Regala
Ch. 7 Political Participation and Elections.pptxCh. 7 Political Participation and Elections.pptx
Ch. 7 Political Participation and Elections.pptx
Rommel Regala111 views

BioCuration 2019 - Evidence and Conclusion Ontology 2019 Update

  • 1. The Evidence and Conclusion Ontology systematically describes scientific evidence types that support biological assertions. ECO is structured around two root classes: 'evidence' and 'assertion method’. Terms describing types of evidence are grouped under 'evidence’, while the 'assertion method', provides a mechanism for recording if a particular assertion was made by a human or in an automated fashion. ECO supports >20 user groups with their annotation efforts, e.g. UniProt-Gene Ontology Annotation1 (UniProt- GOA) has >628 million evidence-linked GO annotations2. ECO is released into the public domain under CC0 1.0 Universal (CC0 1.0) license. James B. Munro1, Elizabeth T. Hobbs2, Suvarna Nadendla1*, Rebecca C. Tauber1*, Stephen Goralski2, Ivan Erill2, Marcus C. Chibucos1, & Michelle Giglio1 1Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 2Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, MD *Contact: email - rctauber@gmail.com; snadendla@som.umaryland.edu Abstract: The Evidence and Conclusion Ontology (ECO) describes types of evidence relevant to biological investigations. First developed in the early 2000s, ECO now consists of over 1700 defined classes and is used by a large, and growing, list of resources. ECO imports close to 1000 classes from the Ontology for Biomedical Investigations and the Gene Ontology for use in logical definitions. Historically, ECO terms have generally been categorized by either the biological context of the evidence (e.g. gene expression) or the technique used to generate the evidence (e.g. PCR-based evidence). The result is that sometimes terms that have related biological context are found under different unrelated nodes. To address this, we have been performing a rigorous review of the structure and logic of the branches of ECO. Working with additional input from collaborators through the issue tracker on GitHub, term labels, definitions, and relationships are being evaluated and updated. The goal of these changes is to increase the logical consistency of ECO, make it easier for users to find and understand terms, and allow for ECO to continue to grow and support its users. In addition to the structural review, we have been working with CollecTF to utilize ECO for automated text mining. To generate a curated corpus for this effort, we have been annotating ECO terms to sentences which contain evidence-based assertions about gene products, taxonomic entities, and sequence features. From this effort we have developed clearly-defined annotation guidelines that have been passed on to a team of undergraduates who are continuing the curation effort. Annotations are limited to single sentences, or to two consecutive sentences, containing the evidence instance and assertion clause. The quality of the mapping to ECO and the strength of the author’s assertion are also captured. ECO is freely available at http://evidenceontology.org/ and https://github.com/evidenceontology. /evidenceontology Thank you to our collaborators and various user groups for supporting the growth of ECO. Collaborations: ECO is supported by the National Science Foundation (NSF) Division of Biological Infrastructure (DBI) under Award Number 1458400. Find us at http://evidenceontology.org/ 1. E.C. Dimmer, R.P. Huntley, Y. Alam-Faruque, T. Sawford, C. O'Donovan, M.J. Martinet, … R. Apweiler. (2012). The UniProt-GO Annotation database in 2011. Nucleic Acids Res., 40, D565–D570. 2. M.C. Chibucos, D.A. Siegele, J.C. Hu, M. Giglio (2017). The Evidence and Conclusion Ontology (ECO): Supporting GO Annotations. Methods in Mol. Biol., 1446, 245- 259. 3. S. Kilic, E.R. White, D.M. Sagitova, J.P. Cornish, & I. Erill. (2014). CollecTF: A database of experimentally validated transcription factor-binding sites in bacteria. Nucleic Acids Res., 42, D156-D160. 4. The Gene Ontology Consortium. (2015). Gene Ontology Consortium: going forward. Nucleic Acids Research, 43, D1049-D1056. 5. A. Bandrowski, R. Brinkman, M. Brochhausen, M.H. Brush, B. Bug, M.C. Chibucos. et al. (2016). The Ontology for Biomedical Investigations, PLoS One, 11(4):e0154556. 6. L.M. Schriml, E. Mitraka, J. Munro, B. Tauber, M. Schor, L. Nickle, V. Felix, Li. Jeng, C. Bearer. et al. Human Disease Ontology 2018 update: classification, content and workflow expansion, Nucleic Acids Research, Volume 47, Issue D1, 08 January 2019, Pages D955–D962. 7. M.C. Chibuocos, A.E. Zweifel, J.C. Herrera, W. Meza, S. Eslamfam, P. Uetz, … M.G. Giglio. (2014). An ontology for microbial phenotypes. BMC Microbiology, 14, 294. 8. Wikidata. https://www.wikidata.org/wiki/Wikidata:Main_Page • Currently, there are 1760 terms in ECO. All the terms have textual definitions. • 1339 ECO terms have logical definitions. Of these, 186 have logical definitions that link out to other vocabularies such as the GO4 and the OBI5, 1147 terms have logical definitions linking the class to an ECO assertion method, and 6 terms have logical definitions linking to other internal class. Future direction • Continue to work with our collaborators. • Collaboration with Confidence Information Ontology for expanding the model of capturing confidence information. The Human Disease Ontology6 , to incorporate classes representing definition sources. The Ontology for Microbial Phenotypes7 , to expand classes for phenotype annotations. The Ontology for Biomedical Investigations5, to complete the harmonization project. The Gene Ontology4 , to continue support representing evidence in gene products annotations. Wikidata8, to support annotations of genes, proteins and diseases in it’s structured data storage repository. Increased Logical Consistency Node Expansion We have been working with CollecTF3 to utilize ECO for an automated text mining effort. As a part of this project, a curated corpus of high quality experimental evidence annotations consisting of gene products, sequence feature, phenotype, and taxonomy/phylogeny, etc. is generated from sentences in scientific articles. This corpus is used as an annotated training set for building an automated text mining model. Guidelines for annotation Annotation process Interactive Text Mining (Future plan) Before After Inter-Annotator Agreement Kappa Equation : Ao = observed agreement; Ae = expected agreement deprecated