This document discusses using ontologies and automated reasoning to classify proteins and cell types. It describes how protein phosphatases and cell types were classified by defining classes in terms of their functional domains/features. Reasoning over the ontologies inferred new subclass relationships and allowed automated classification of instances based on their described features. Normalizing tangled ontologies into separate modules for different classification axes was shown to produce cleaner models that enabled more flexible querying.
An Ontology-Based Intelligent Speed Adaptation System for Autonomous CarsLihua Zhao
Intelligent Speed Adaptation (ISA) is one of the key tech- nologies for Advanced Driver Assistance Systems (ADAS), which aims to reduce car accidents by supporting drivers to comply with the speed limit. Context awareness is indispensable for autonomous cars to perceive driving environment, where the information should be represented in a machine-understandable format. Ontologies can represent knowledge in a format that machines can understand and perform human-like reason- ing. In this paper, we present an ontology-based ISA system that can detect overspeed situations by accessing to the ontology-based Knowl- edge Base (KB). We conducted experiments on a car simulator as well as on real-world data collected with an intelligent car. Sensor data are converted into RDF stream data and we construct SPARQL queries and a C-SPARQL query to access to the Knowledge Base. Experimental re- sults show that the ISA system can promptly detect overspeed situations by accessing to the ontology-based Knowledge Base.
An Ontology-Based Intelligent Speed Adaptation System for Autonomous CarsLihua Zhao
Intelligent Speed Adaptation (ISA) is one of the key tech- nologies for Advanced Driver Assistance Systems (ADAS), which aims to reduce car accidents by supporting drivers to comply with the speed limit. Context awareness is indispensable for autonomous cars to perceive driving environment, where the information should be represented in a machine-understandable format. Ontologies can represent knowledge in a format that machines can understand and perform human-like reason- ing. In this paper, we present an ontology-based ISA system that can detect overspeed situations by accessing to the ontology-based Knowl- edge Base (KB). We conducted experiments on a car simulator as well as on real-world data collected with an intelligent car. Sensor data are converted into RDF stream data and we construct SPARQL queries and a C-SPARQL query to access to the Knowledge Base. Experimental re- sults show that the ISA system can promptly detect overspeed situations by accessing to the ontology-based Knowledge Base.
Epidermal Growth Factor use in Diabetic Foot UlcersG H PRABHU
EGF is used by many diabetologists to treat Diabetic Foot Ulcers. If used early EGF can prevent progression of DFU resulting in amputations and related morbidity and mortality.
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...bioejjournal
Tissue plasminogen activator (tPA) has noteworthy application in treatment of acute myocardial
infarctions. This study focuses on expression of rt-PA using microbial systems in order to reduce cost
without compromising on quality as an alternative to commercial (rt-PA)produced by using mammalian
host systems. In the present study, Pichia pastoris X-33strain was used as a host with pICZαA expression
vector to obtain extracellular expression of full length tPA gene. Specific primers were designed in such a
way to get native tPA protein sequence in subsequent purification steps. Further, construct pICZαA-tPA
was developed and electroporated into host to achieve stablert-PA gene by achieving genome integration.
The transformants were screened for phenotypic characters.Mut+
phenotypic colony named Pichia tPA-3
showed expression of rt-PA at 66 kDa on SDS PAGE. Size Exclusion Chromatography (SEC) was
performed, resulting in product peak at same RT as reference standard. (alteplase).Cloning and expression
of rt-PA was successfully achieved in microbial system. Further process optimization at larger scales will
surely provide cost effective alternative to mammalian system forrt-PA production.
CLONING AND EXTRACELLULAR EXPRESSION OF RECOMBINANT TISSUE PLASMINOGEN ACTIVA...bioejjournal
Tissue plasminogen activator (tPA) has noteworthy application in treatment of acute myocardial infarctions. This study focuses on expression of rt-PA using microbial systems in order to reduce cost
without compromising on quality as an alternative to commercial (rt-PA)produced by using mammalian host systems. In the present study, Pichia pastoris X-33strain was used as a host with pICZαA expression vector to obtain extracellular expression of full length tPA gene. Specific primers were designed in such a way to get native tPA protein sequence in subsequent purification steps
Cloning and extracellular expression of recombinant tissue plasminogen activa...bioejjournal
Tissue plasminogen activator (tPA) has noteworthy application in treatment of acute myocardial
infarctions. This study focuses on expression of rt-PA using microbial systems in order to reduce cost
without compromising on quality as an alternative to commercial (rt-PA)produced by using mammalian
host systems. In the present study, Pichia pastoris X-33strain was used as a host with pICZαA expression
vector to obtain extracellular expression of full length tPA gene. Specific primers were designed in such a
way to get native tPA protein sequence in subsequent purification steps. Further, construct pICZαA-tPA
was developed and electroporated into host to achieve stablert-PA gene by achieving genome integration. The transformants were screened for phenotypic characters.Mut+
phenotypic colony named Pichia tPA-3
showed expression of rt-PA at 66 kDa on SDS PAGE. Size Exclusion Chromatography (SEC) was
performed, resulting in product peak at same RT as reference standard. (alteplase).Cloning and expression
of rt-PA was successfully achieved in microbial system. Further process optimization at larger scales will surely provide cost effective alternative to mammalian system forrt-PA production
Cloning and extracellular expression of recombinant tissue plasminogen activa...bioejjournal
Tissue plasminogen activator (tPA) has noteworthy application in treatment of acute myocardial
infarctions. This study focuses on expression of rt-PA using microbial systems in order to reduce cost
without compromising on quality as an alternative to commercial (rt-PA)produced by using mammalian
host systems. In the present study, Pichia pastoris X-33strain was used as a host with pICZA expression
vector to obtain extracellular expression of full length tPA gene. Specific primers were designed in such a
way to get native tPA protein sequence in subsequent purification steps. Further, construct pICZA-tPA
was developed and electroporated into host to achieve stablert-PA gene by achieving genome integration.
The transformants were screened for phenotypic characters.Mut+phenotypic colony named Pichia tPA-3
showed expression of rt-PA at 66 kDa on SDS PAGE. Size Exclusion Chromatography (SEC) was
performed, resulting in product peak at same RT as reference standard. (alteplase).Cloning and expression
of rt-PA was successfully achieved in microbial system. Further process optimization at larger scales will
surely provide cost effective alternative to mammalian system forrt-PA production.
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...bioejjournal
Tissue plasminogen activator (tPA) has noteworthy application in treatment of acute myocardial
infarctions. This study focuses on expression of rt-PA using microbial systems in order to reduce cost
without compromising on quality as an alternative to commercial (rt-PA)produced by using mammalian
host systems. In the present study, Pichia pastoris X-33strain was used as a host with pICZαA expression
vector to obtain extracellular expression of full length tPA gene. Specific primers were designed in such a
way to get native tPA protein sequence in subsequent purification steps. Further, construct pICZαA-tPA
was developed and electroporated into host to achieve stablert-PA gene by achieving genome integration.
The transformants were screened for phenotypic characters.Mut+
phenotypic colony named Pichia tPA-3
showed expression of rt-PA at 66 kDa on SDS PAGE. Size Exclusion Chromatography (SEC) was
performed, resulting in product peak at same RT as reference standard. (alteplase).Cloning and expression
of rt-PA was successfully achieved in microbial system. Further process optimization at larger scales will
surely provide cost effective alternative to mammalian system forrt-PA production.
1. Mode-of-Action: What is it and what is it good for
2. MOA Examples: Thyroid hormone disruption, thyroid cancer and developmental neurotoxicity
3. Thyroid 101: Hormones, homeostasis and signaling
4. Sites-of-action for TDCs
5. Relevancy of animal data to humans
6. How can MOA help identify research needs
7. Developmental neurotoxicity of TDCs
8. Acute neurotoxicity of pyrethroids
9. How can thinking of MOA help with screening for toxicity
Epidermal Growth Factor use in Diabetic Foot UlcersG H PRABHU
EGF is used by many diabetologists to treat Diabetic Foot Ulcers. If used early EGF can prevent progression of DFU resulting in amputations and related morbidity and mortality.
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...bioejjournal
Tissue plasminogen activator (tPA) has noteworthy application in treatment of acute myocardial
infarctions. This study focuses on expression of rt-PA using microbial systems in order to reduce cost
without compromising on quality as an alternative to commercial (rt-PA)produced by using mammalian
host systems. In the present study, Pichia pastoris X-33strain was used as a host with pICZαA expression
vector to obtain extracellular expression of full length tPA gene. Specific primers were designed in such a
way to get native tPA protein sequence in subsequent purification steps. Further, construct pICZαA-tPA
was developed and electroporated into host to achieve stablert-PA gene by achieving genome integration.
The transformants were screened for phenotypic characters.Mut+
phenotypic colony named Pichia tPA-3
showed expression of rt-PA at 66 kDa on SDS PAGE. Size Exclusion Chromatography (SEC) was
performed, resulting in product peak at same RT as reference standard. (alteplase).Cloning and expression
of rt-PA was successfully achieved in microbial system. Further process optimization at larger scales will
surely provide cost effective alternative to mammalian system forrt-PA production.
CLONING AND EXTRACELLULAR EXPRESSION OF RECOMBINANT TISSUE PLASMINOGEN ACTIVA...bioejjournal
Tissue plasminogen activator (tPA) has noteworthy application in treatment of acute myocardial infarctions. This study focuses on expression of rt-PA using microbial systems in order to reduce cost
without compromising on quality as an alternative to commercial (rt-PA)produced by using mammalian host systems. In the present study, Pichia pastoris X-33strain was used as a host with pICZαA expression vector to obtain extracellular expression of full length tPA gene. Specific primers were designed in such a way to get native tPA protein sequence in subsequent purification steps
Cloning and extracellular expression of recombinant tissue plasminogen activa...bioejjournal
Tissue plasminogen activator (tPA) has noteworthy application in treatment of acute myocardial
infarctions. This study focuses on expression of rt-PA using microbial systems in order to reduce cost
without compromising on quality as an alternative to commercial (rt-PA)produced by using mammalian
host systems. In the present study, Pichia pastoris X-33strain was used as a host with pICZαA expression
vector to obtain extracellular expression of full length tPA gene. Specific primers were designed in such a
way to get native tPA protein sequence in subsequent purification steps. Further, construct pICZαA-tPA
was developed and electroporated into host to achieve stablert-PA gene by achieving genome integration. The transformants were screened for phenotypic characters.Mut+
phenotypic colony named Pichia tPA-3
showed expression of rt-PA at 66 kDa on SDS PAGE. Size Exclusion Chromatography (SEC) was
performed, resulting in product peak at same RT as reference standard. (alteplase).Cloning and expression
of rt-PA was successfully achieved in microbial system. Further process optimization at larger scales will surely provide cost effective alternative to mammalian system forrt-PA production
Cloning and extracellular expression of recombinant tissue plasminogen activa...bioejjournal
Tissue plasminogen activator (tPA) has noteworthy application in treatment of acute myocardial
infarctions. This study focuses on expression of rt-PA using microbial systems in order to reduce cost
without compromising on quality as an alternative to commercial (rt-PA)produced by using mammalian
host systems. In the present study, Pichia pastoris X-33strain was used as a host with pICZA expression
vector to obtain extracellular expression of full length tPA gene. Specific primers were designed in such a
way to get native tPA protein sequence in subsequent purification steps. Further, construct pICZA-tPA
was developed and electroporated into host to achieve stablert-PA gene by achieving genome integration.
The transformants were screened for phenotypic characters.Mut+phenotypic colony named Pichia tPA-3
showed expression of rt-PA at 66 kDa on SDS PAGE. Size Exclusion Chromatography (SEC) was
performed, resulting in product peak at same RT as reference standard. (alteplase).Cloning and expression
of rt-PA was successfully achieved in microbial system. Further process optimization at larger scales will
surely provide cost effective alternative to mammalian system forrt-PA production.
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...bioejjournal
Tissue plasminogen activator (tPA) has noteworthy application in treatment of acute myocardial
infarctions. This study focuses on expression of rt-PA using microbial systems in order to reduce cost
without compromising on quality as an alternative to commercial (rt-PA)produced by using mammalian
host systems. In the present study, Pichia pastoris X-33strain was used as a host with pICZαA expression
vector to obtain extracellular expression of full length tPA gene. Specific primers were designed in such a
way to get native tPA protein sequence in subsequent purification steps. Further, construct pICZαA-tPA
was developed and electroporated into host to achieve stablert-PA gene by achieving genome integration.
The transformants were screened for phenotypic characters.Mut+
phenotypic colony named Pichia tPA-3
showed expression of rt-PA at 66 kDa on SDS PAGE. Size Exclusion Chromatography (SEC) was
performed, resulting in product peak at same RT as reference standard. (alteplase).Cloning and expression
of rt-PA was successfully achieved in microbial system. Further process optimization at larger scales will
surely provide cost effective alternative to mammalian system forrt-PA production.
1. Mode-of-Action: What is it and what is it good for
2. MOA Examples: Thyroid hormone disruption, thyroid cancer and developmental neurotoxicity
3. Thyroid 101: Hormones, homeostasis and signaling
4. Sites-of-action for TDCs
5. Relevancy of animal data to humans
6. How can MOA help identify research needs
7. Developmental neurotoxicity of TDCs
8. Acute neurotoxicity of pyrethroids
9. How can thinking of MOA help with screening for toxicity
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...robertstevens65
A talk on OBOPedia (HTTP://www.obopedia.org.uk) given at Semantic Web Applicaitons and Tools for Life Sciences (SWAT4Ls) 2015 in cambridge, UK December 2015
Issues and activities in authoring ontologiesrobertstevens65
Departmental seminar at Department of Computer Science, university of Birmingham, 6 November, 2014
abstract: Ontologies are complex knowledge representation artefacts used across biomedical sciences, the media and other domains for defining terminologies and providing metadata. Their use is increasing rapidly, but so far, ontology authoring tools have not benefited from empirical research into the ontology authoring process. Understanding how people build ontologies is key to developing tools that can properly support common authoring activities. In this talk I will first present the outcomes of qualative interviews with ontology authors and the issues it reveals. Second, I will present the results of a study that identifies common activity patterns through analysis of the event logs, screen capture and eye-tracking data collected from the popular authoring tool, Protege. Results from this bottom-up investigation suggest that the class hierarchy is the central focus of activity, playing a role beyond simple class representation. We also find that checking how updates to the ontology is hard and performance is hindered by inadequate support in the user interface. From this investigation we propose design guidelines for bulk editing, efficient reasoning and increased situational awareness in ontology authoring.
The state of the nation for ontology developmentrobertstevens65
Invited talk at European Ontology Network (EUON) 2014
Ontologies are now quite big, both literally and metaphorically. They have become central resources in disciplines such as biology, medicine, healthcare and others. Such developments rely on people, tools and methods to deliver ontologies that do the desired job, on-time and on-budget. In this talk I wil ask the question of whether the tools and methods we have are capable of doing what is necessary to deliver robust and maintainable ontologies. To explore this question I will borro from the Capability Maturity Model used to assess the capabilities of institutions to deliver software projects. Instead of institutional assessment, I will bend the CCM to the discipline of ontology engineering. The levels of the CMM range from the ad hoc to one where metrics are used to monitor and adjust ontology development. In this talk I will use some audience participation to gather views on ontology engineering maturity level and then deliver my own view of that maturity.
Properties and Individuals in OWL: Reasoning About Family Historyrobertstevens65
Slides used in an advanced OWL tutorial in 2012. The tutorial is based on family history and uses OWL individuals as a first class citizen in the learning.
Toxic effects of heavy metals : Lead and Arsenicsanjana502982
Heavy metals are naturally occuring metallic chemical elements that have relatively high density, and are toxic at even low concentrations. All toxic metals are termed as heavy metals irrespective of their atomic mass and density, eg. arsenic, lead, mercury, cadmium, thallium, chromium, etc.
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Sérgio Sacani
We characterize the earliest galaxy population in the JADES Origins Field (JOF), the deepest
imaging field observed with JWST. We make use of the ancillary Hubble optical images (5 filters
spanning 0.4−0.9µm) and novel JWST images with 14 filters spanning 0.8−5µm, including 7 mediumband filters, and reaching total exposure times of up to 46 hours per filter. We combine all our data
at > 2.3µm to construct an ultradeep image, reaching as deep as ≈ 31.4 AB mag in the stack and
30.3-31.0 AB mag (5σ, r = 0.1” circular aperture) in individual filters. We measure photometric
redshifts and use robust selection criteria to identify a sample of eight galaxy candidates at redshifts
z = 11.5 − 15. These objects show compact half-light radii of R1/2 ∼ 50 − 200pc, stellar masses of
M⋆ ∼ 107−108M⊙, and star-formation rates of SFR ∼ 0.1−1 M⊙ yr−1
. Our search finds no candidates
at 15 < z < 20, placing upper limits at these redshifts. We develop a forward modeling approach to
infer the properties of the evolving luminosity function without binning in redshift or luminosity that
marginalizes over the photometric redshift uncertainty of our candidate galaxies and incorporates the
impact of non-detections. We find a z = 12 luminosity function in good agreement with prior results,
and that the luminosity function normalization and UV luminosity density decline by a factor of ∼ 2.5
from z = 12 to z = 14. We discuss the possible implications of our results in the context of theoretical
models for evolution of the dark matter halo mass function.
Richard's aventures in two entangled wonderlandsRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Sérgio Sacani
Since volcanic activity was first discovered on Io from Voyager images in 1979, changes
on Io’s surface have been monitored from both spacecraft and ground-based telescopes.
Here, we present the highest spatial resolution images of Io ever obtained from a groundbased telescope. These images, acquired by the SHARK-VIS instrument on the Large
Binocular Telescope, show evidence of a major resurfacing event on Io’s trailing hemisphere. When compared to the most recent spacecraft images, the SHARK-VIS images
show that a plume deposit from a powerful eruption at Pillan Patera has covered part
of the long-lived Pele plume deposit. Although this type of resurfacing event may be common on Io, few have been detected due to the rarity of spacecraft visits and the previously low spatial resolution available from Earth-based telescopes. The SHARK-VIS instrument ushers in a new era of high resolution imaging of Io’s surface using adaptive
optics at visible wavelengths.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.Sérgio Sacani
The return of a sample of near-surface atmosphere from Mars would facilitate answers to several first-order science questions surrounding the formation and evolution of the planet. One of the important aspects of terrestrial planet formation in general is the role that primary atmospheres played in influencing the chemistry and structure of the planets and their antecedents. Studies of the martian atmosphere can be used to investigate the role of a primary atmosphere in its history. Atmosphere samples would also inform our understanding of the near-surface chemistry of the planet, and ultimately the prospects for life. High-precision isotopic analyses of constituent gases are needed to address these questions, requiring that the analyses are made on returned samples rather than in situ.
The ability to recreate computational results with minimal effort and actionable metrics provides a solid foundation for scientific research and software development. When people can replicate an analysis at the touch of a button using open-source software, open data, and methods to assess and compare proposals, it significantly eases verification of results, engagement with a diverse range of contributors, and progress. However, we have yet to fully achieve this; there are still many sociotechnical frictions.
Inspired by David Donoho's vision, this talk aims to revisit the three crucial pillars of frictionless reproducibility (data sharing, code sharing, and competitive challenges) with the perspective of deep software variability.
Our observation is that multiple layers — hardware, operating systems, third-party libraries, software versions, input data, compile-time options, and parameters — are subject to variability that exacerbates frictions but is also essential for achieving robust, generalizable results and fostering innovation. I will first review the literature, providing evidence of how the complex variability interactions across these layers affect qualitative and quantitative software properties, thereby complicating the reproduction and replication of scientific studies in various fields.
I will then present some software engineering and AI techniques that can support the strategic exploration of variability spaces. These include the use of abstractions and models (e.g., feature models), sampling strategies (e.g., uniform, random), cost-effective measurements (e.g., incremental build of software configurations), and dimensionality reduction methods (e.g., transfer learning, feature selection, software debloating).
I will finally argue that deep variability is both the problem and solution of frictionless reproducibility, calling the software science community to develop new methods and tools to manage variability and foster reproducibility in software systems.
Exposé invité Journées Nationales du GDR GPL 2024
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...Scintica Instrumentation
Intravital microscopy (IVM) is a powerful tool utilized to study cellular behavior over time and space in vivo. Much of our understanding of cell biology has been accomplished using various in vitro and ex vivo methods; however, these studies do not necessarily reflect the natural dynamics of biological processes. Unlike traditional cell culture or fixed tissue imaging, IVM allows for the ultra-fast high-resolution imaging of cellular processes over time and space and were studied in its natural environment. Real-time visualization of biological processes in the context of an intact organism helps maintain physiological relevance and provide insights into the progression of disease, response to treatments or developmental processes.
In this webinar we give an overview of advanced applications of the IVM system in preclinical research. IVIM technology is a provider of all-in-one intravital microscopy systems and solutions optimized for in vivo imaging of live animal models at sub-micron resolution. The system’s unique features and user-friendly software enables researchers to probe fast dynamic biological processes such as immune cell tracking, cell-cell interaction as well as vascularization and tumor metastasis with exceptional detail. This webinar will also give an overview of IVM being utilized in drug development, offering a view into the intricate interaction between drugs/nanoparticles and tissues in vivo and allows for the evaluation of therapeutic intervention in a variety of tissues and organs. This interdisciplinary collaboration continues to drive the advancements of novel therapeutic strategies.
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Studia Poinsotiana
I Introduction
II Subalternation and Theology
III Theology and Dogmatic Declarations
IV The Mixed Principles of Theology
V Virtual Revelation: The Unity of Theology
VI Theology as a Natural Science
VII Theology’s Certitude
VIII Conclusion
Notes
Bibliography
All the contents are fully attributable to the author, Doctor Victor Salas. Should you wish to get this text republished, get in touch with the author or the editorial committee of the Studia Poinsotiana. Insofar as possible, we will be happy to broker your contact.
1. Making Semantics do Some
Work
Robert Stevens
BioHealth Informatics Group
School of Computer Science
University of Manchester
Robert.Stevens@manchester.ac.uk
2. Introduction
• What’s the use of highly axiomatised
ontological descriptions?
• Two use cases:
• Classifying instances based on features:
New discoveries;
• Building a complex terminology.
• Cost and benefit.
• Conclusions
3. Protein Classification
• Proteins divided into broad functional classes
“Protein Families”
• Families sub-divided to give family
classifications
• Class membership can be determined by
“protein features”, such as domains, etc.
• Resources exist for feature detection via
primary sequence– but not class membership
• Current Limitation of Automated Tools
• Needs human knowledge to recognise class
membership
4. Finding Domains on a
Sequence
A search of the linear sequence of protein tyrosine
phosphatase type K – identified 9 functional
domains
>uniprot|Q15262|PTPK_HUMAN Receptor-type protein-tyrosine phosphatase kappa precursor (EC
3.1.3.48) (R-PTP-kappa).
MDTTAAAALPAFVALLLLSPWPLLGSAQGQFSAGGCTFDDGPGACDYHQDLYDDFEWVHV
SAQEPHYLPPEMPQGSYMIVDSSDHDPGEKARLQLPTMKENDTHCIDFSYLLYSQKGLNP
GTLNILVRVNKGPLANPIWNVTGFTGRDWLRAELAVSSFWPNEYQVIFEAEVSGGRSGYI
AIDDIQVLSYPCDKSPHFLRLGDVEVNAGQNATFQCIATGRDAVHNKLWLQRRNGEDIPV………..
5. Why Classify?
• Classification and curation of a genome is the
first step in understanding the processes and
functions happening in an organism
• Classification enables comparative genomic
studies - what is already known in other
organisms
• The similarities and differences between
processes and functions in related organisms
often provide the greatest insight into the
biology
• In silico characterisation is the current
bottleneck
6. Phosphatase Classification
• Diagnostic phosphatase domains/motifs – sufficient for
membership of the protein phosphatase superfamily
• Any protein having a phosphatase domain is a
member of the phosphatase super-family
• Other motifs determine a protein’s place within the
family
• Usually needs human to recognise that features
detected imply class membership
• Can these be captured in an ontology?
8. Necessity and Sufficiency
• An R2A phosphatase must have a fibronectin domain
• Having a fibronectin domain does not a phosphatase
make
• Necessity -- what must a class instance have?
• Any protein that has a phosphatase catalytic domain is a
phosphatase enzyme
• All phosphatase enzymes have a catalytic domain
• Sufficiency – how is an instance recognised to be a
member of a class?
10. …there are known knowns; there are things
we know we know. We also know there are
known unknowns; that is to say we know there
are some things we do not know. But there
are also unknown unknowns -- the ones we
don't know we don't know.
11. Definition for R2A Phosphatase
Class: R2A
EquivalentTO: Protein That
- contains 2 ProteinTyrosinePhosphataseDomain and
- contains 1 TransmembraneDomain and
- contains 4 FibronectinDomains and
- contains 1 ImmunoglobulinDomain and
- contains 1 MAMDomain and
- contains 1 Cadherin-LikeDomain and
- contains only TyrosinePhosphataseDomain or
TransmembraneDomain or FibronectinDomain or
ImnunoglobulinDomain or Clathrin-LikeDomain or
ManDomain
12. Automated Reasoning
• An OWL-DL ontology mapped to its DL form
as a collection of axioms
• An automated reasoner checks for
satisfiability – throws out the inconsistent
and infers subsumption
• Defined classes (where there are necessary
and sufficient restrictions) enable a reasoner
to infer subclass axioms
• Also infer to which class an individual
belongs
13. Incremental Addition of Protein
Functional Domains
Phosphatase catalytic
Cadherin-like
Immunoglobulin
MAM domain Cellular retinaldehyde
Adhesion recognition Transmembrane
Fibronectin III Glycosylation
15. What is the Ontology Telling Us?
• Each class of phosphatase defined in terms of
domain composition
• We know the characteristics by which an
individual protein can be recognised to be a
member of a particular class of phosphatase
• We have this knowledge in a computational form
• If we had protein instances described in terms of
the ontology, we could classify those individual
proteins
• A catalogue of phosphatases
16. Description of an Instance of a
Protein
Individual: P21592
Types: Protein,
hasDomain 2
ProteinTyrosinePhosphataseDomain
hasdomain 1 TransmembraneDomain,,
hasdomain 4 FibronectinDomain,
hasDomain 1 ImmunoglobulinDomain,
hasdomain 1 MAMDomain,
hasdomain 1 Cadherin-LikeDomain
17. Instance: P21592
TypeOf: Protein That
Fact: hasDomain 2 ProteinTyrosinePhosphataseDomain
and
Fact: hasdomain 1 TransmembraneDomain and
Fact: hasdomain 4 FibronectinDomains and
Fact: hasDomain 1 ImmunoglobulinDomain and
Fact: hasdomain 1 MAMDomain and
Fact: hasdomain 1 Cadherin-LikeDomain
Tyrosine Phosphatase
(containsDomain some TransmembraneDomain) and
(containsDomain at least 1 ProteinTyrosinePhosphataseDomain)
tase
n some MAMDomain) and
n some ProteinTyrosineCatalyticDomain or ImmunoglobulinDomain) and
n some FibronectinDomain or FibronectinTypeIIIFoldDomain) and
n exactly 2 ProteinTyrosinePhosphataseDomain)
19. So Far…..
• Human phosphatases have been classified using the
system
• The ontology classification performed equally well as
expert classification
• The ontology system refined classification
- DUSC contains zinc finger domain
Characterised and conserved – but not in classification
- DUSA contains a disintegrin domain
previously uncharacterised – evolutionarily conserved
• A new kind of phosphatase?
20. Aspergillus fumigatus
• Phosphatase compliment very different from human
>100 human <50 A.fumigatus
• Whole subfamilies ‘missing’
Different fungi-specific phosphorylation pathways?
No requirement for tissue-specific variations?
• Novel serine/threonine phosphatase with homeobox
Conserved in aspergillus and closely related species,
but not in any other
Again, a new phosphatase?
21. Generic Technique
• Feature detection
• Categories defined in terms of those
features
• Produce catalogue of what you currently
know
• Highlight cases that don’t match current
knowledge
22. The Cell type Ontology
• Some 880 terms
• Describing cell function, lineage,
developmental stage, ploidy, secretion,
species,…
• Not explicitly classified according to anatomy
• Uses is-a and developsFrom
• Used to describe cell types used in
experiments
24. Issues with Current CTO
• History: A need was seen and a few days
was spent “lashing” together an ontology
by hand
• Contains lots of knowledge
• Asserted multiple inheritance: Humans will
make slips and it is difficult
• Some biological mistakes
• All the knowledge is within the “is-a”
relationships and implicit in the cell names
25. CTO Axes of Classification
• Histology: What cells look like
• Lineage: Whence a given cell develops
• Ploidy: How sets of chromosomes in a cell
• Nucleation: How many nuclei
• Secretion & accumulation: What chemicals a
cells secretes or accumulates
• Function: What does the cell do
• Location: In anatomy
• Species: In what taxa does the cell exist
• And some others
27. Problems
• Tangles
• Hard to maintain
• Difficult to add a new cell
• Inflexible queries: What about hormone
secreting mesodermal cells?
• Information hidden inside term names
29. Describing a Big Blue Ford Car
Class: BigBlueFordCar
SubClassOf: Car
that hasColour some Blue
and hasSize some Big
and hasManufacturer some Ford
30. Modules
• Choose a primary axis: In this case
Vehicle
• Other axes are represented in separate
modules (Colour, Size (qualities) and
manufacturer)
• Represent other aspects of classes
through restrictions
• (Spot the ontological howler in this toy
example)
31. Definition of a Red Car
Class: RedCar
EquivalentTo: Car
that hasColour some Red
• Any car that has the colour red is
recognised to be a member of the
class RedCar
• The reasoner works it all out and
builds the hierarchy for you
32. Normalisation
• This technique of “pulling” apart tangled
ontologies is “normalisation”
• Makes for cleaner modelling
• Makes for re-usable components
• The reasoner builds the taxonomy
“completely”
• A new car (e.g., yellow Saab” is described
and it just appears in the right place
33. What We Did
• Examined CTO
• Chose primary axis of classification
• All other axes added as restrictions on
class membership
• Describe cells
• Build ontology
• Use reasoner
35. Mammalian Red Blood Cell
Class: RedBloodCell
SubclassOf: Cell
That hasNucleation some Anucleate
and participatesIn some
OxygenTransport
and existsIn some mammalia
and part_of some BloodTissue
and developsFrom some Reticulocyte
36. Mesodermal Lineage Cells
Class: MesodermalLineageCell
EquivalentTo: Cell
That developsFrom some
MesodermalCell
(developsFrom is transitive)
39. Secreting Cells
Class: EpinephrinSecretingCell
SubclassOf: Cell
That belongs_to_line some Somatic
and has_nucleation some mononucleate
and has_ploidy some diploid
and potentiality some TerminallyDifferentiated
and participates_in some EpinephrineBiosyntheticProcess
and participates_in EpinephrineSecretion
Class: ProlactinSecretingCell
SubclassOf: Cell
That belongs_to_line some Somatic
and has_nucleation some mononucleate
and has_ploidy some diploid
and potentiality some TerminallyDifferentiated
and participates_in some PeptideHormoneSecretion
and participates_in some ProlactinSecretion
40. Defined Cells
Class: SecretoryCell
EquivalentTo: Cell
that participates_in some (secretion or
(part_of some secrection)
Class: EndocrineCell
EquivalentTo: Cell
that participates_in some (EndocrineProcess or
(part_of some EndocrineProcess)
43. What We Found
• More subsumption relationships
• The “is-a” hierarchy is complete
• Explicitness made us ask questions
• Found bad structure
• Can just slip in a new cell
• Can make arbitrary queries based on any
of the types of axis
44. Conclusions
• Can use strict semantics and automated
reasoning to build structurally sound ontologies
• Can catalogue instances and make discoveries
• If an object can be recognised by its features
and features can be computationally generated
classification can be automated
• High cost and high benefit
45. Acknowledgements
• Katy Wolstencroft did the protein
phosphtase work as part of her Ph.D.
• The work on the cell type ontology was
udnertaken by members of the EPSRC
OntoGenesis Network
• All the ontoogy work at Manchester relies
on the support and input of the wider
BioHealth and Information Management
Groups
Editor's Notes
All of which helps build better ontologies. But can we actually apply this computational amenability more
Directly to biological knowledge. In this example, which is work by Katy Wolstencroft, we have codified
Community knowledge about protein domains in phosphatases in OWL. We then take unknown protein sequences,
Pass then through interpro and stick them into the instance store, which is basically a database and reasoner tied together
Qualified Cardiniality!!!
Showing cell by function, cell by histology etc….
Tangled on left, arrow moving to picture of untangled ontology
Shows the asserted hierarchy in OWLViz for epiniephrin and prolactin cell (primitives) as well as Secretory and Endocrine Cell (defined)
Shows the inferred hierarchy (all parents) for epiniephrin and prolactin cell as well as defined secretory and endocrine Cell. Shows lots of multiple inheritance inferred by the reasoner.