CDD BioAssay Express: Expanding the target dimension: How to visualize a lot of SAR

Alex Clark
Alex ClarkFounder of Molecular Materials Informatics, Scientist at Collaborative Drug Discovery
Expanding the target dimension:
How to visualize a lot of SAR
Alex M. Clark
http://www.bioassayexpress.com
⚡
COLLABORATIVE DRUG DISCOVERY
Protocols & Datasets
◈ Structure-activity data
usually exists in isolation:
▷ one target, one protocol
▷ many compounds &
measurements
◈ Datasets often related:
▷ same target?
▷ same protocol?
◈ How would we know?
2
Prerequisite
◈ Protocols normally described using scientific English
◈ Alternatively, many ontologies are available:
▷ BAO (BioAssay Ontology)
▷ DTO (Drug Target Ontology)
▷ CLO (Cell Line Ontology)
▷ GO (Gene Ontology)
▷ ... and many others
◈ Each term has a URI: global meaning, hierarchical
3
Common Assay Template
◈ Pulls together
ontology
categories
◈ ~22 assignments
◈ Captures key
summary
information
4
Common Assay Template
◈ Pulls together
ontology
categories
◈ ~22 assignments
◈ Captures key
summary
information
4
Protocol Similarity
5
504854 449764
A Cell Based Secondary Assay To Explore Vero
Cell Cytotoxicity of Purified and Synthesized
Compounds that Inhibit Mycobacterium
Tuberculosis (4)
This functional assay was developed for
detection of compounds inhibiting Vero E6 cells
viability as a secondary screen to the beta-
lactam sensitizing M. tuberculosis bacteriocidal
assay.
In this assay, we treated Vero E6 cells with
compounds selected as hits in the M.
tuberculosis assay for 72 hours over a 10 point
2-fold dilution series, ranging from 0.195 uM to
100 uM. Following 72 hours of treatment,
relative viable cell number was determined
using Cell Titer Glo from Promega. Each plate
contained 64 replicates of vehicle treated cells
which served as negative controls.
Outcome: Compounds that showed <70% cell
viability for at least one concentration were
defined as "Active". If the % viability at all doses
was >70%, the compound was defined as
"Inactive".
...
A High Throughput Confirmatory Assay used to
Identify Novel Compounds that Inhibit
Mycobacterium Tuberculosis in the absence of
Glycerol
Outcome: Compounds that showed >30% inhibition
for at least one concentration in the Mtb dose
response were defined as "Active". If the inhibition
at all doses was <30% in the Mtb assay, the
compound was defined as "Inactive". In the primary
screen a compound was deemed "Inactive" if it had
a Percent Inhibition <70.31%. Compounds with a
Percent Inhibition >70.31% but were not selected
for follow up dose response were labeled
"Inconclusive."
The following tiered system has been implemented
at Southern Research Institute for use with the
PubChem Score. Compounds in the primary screen
are scored on a scale of 0-40 based on inhibitory
activity where a score of 40 corresponds to 100%
inhibition. In the confirmatory dose response
screen, active compounds were scored on a scale
of 41-80 based on the IC50 result in the Mtb assay
while compounds that did not confirm as actives
were given the score 0.
...
69%
similar (?)
Protocol Similarity
5
semantic
annotations
≡
fingerprints
Matrix: Assays vs. Compounds
◈ Curated ~3500 assays from PubChem:
▷ mostly from Molecular Libraries project
▷ natural language/machine learning helped
▷ carefully analyzed by expert biologists
◈ Enough data for proof of concept assay informatics
◈ Adding the assay dimension to structure-activity
relationship analysis
6
Now With Compounds
◈ Each assay has compounds & data from PubChem:
7
Now With Compounds
◈ Each assay has compounds & data from PubChem:
7
Now With Compounds
◈ Each assay has compounds & data from PubChem:
7
Looking at Primary Screens
◈ Browse the curated content from PubChem:
8
Looking at Primary Screens
◈ Browse the curated content from PubChem:
8
Looking at Primary Screens
◈ Browse the curated content from PubChem:
8
Matching Assays
Matching Assays
Frequent Hitters
◈ Rank by score = # hits vs. selected assays
10
Assay Grid (1°)
11
Assay Grid (1°)
11
Assay Clustering
◈ Leverage hierarchical structure of ontology terms:
12
Assay Clustering
◈ Leverage hierarchical structure of ontology terms:
12
Group by Disease
13
Group by Disease
13
Group by Disease
13
Selectivity
◈ Using secondary assays
◈ Ranking score =
14
# inactives
# actives
Assay Grid (2°)
15
Assay Grid (2°)
15
Assay Grid (2°)
15
Similarity
◈ Can select compounds by similarity to a reference
structure (ECFP6/Tanimoto):
16
Similarity
◈ Can select compounds by similarity to a reference
structure (ECFP6/Tanimoto):
16
paste/drag structure
Assay Grid
17
liver
cirrhosis
Assay Grid
17
liver
cirrhosis
Assay Grid
17
parasitic
helminthiasis
liver
cirrhosis
Conclusion
◈ Cheminformatics & bioinformatics are well
established, but assay informatics is new
◈ Data curation has begun: much yet remains (>1M)
◈ Content curation is the main bottleneck
▷ BioAssay Express is key to solving this problem
◈ Content consumption is at proof of concept stage
▷ ... and this is the fun part.
18
Future Work
◈ Crowd curation of public data: you should want your
assays to be machine readable
◈ Model building: create from structure-activity-assay,
apply all existing knowledge to prospective discovery
◈ ELN context: use semantic annotations instead of
text, describe assays comprehensively
▷ convert annotations to text, for communication
▷ translate protocols to import formats for robots
19
Assay Data
◈ All features, and curated data, available at:
◦ http://www.bioassayexpress.com
◦ http://beta.bioassayexpress.com
◈ Ready for crowd curation - free to use in public
◈ Commercial product: private installations, with
custom settings, are being deployed
20
Acknowledgments
◈ Biologists
▷ Janice Kranz
▷ Haifa Ghandour
▷ Karen Featherstone
21
◈ Collaborative Drug Discovery
⇨ Barry Bunin
⇨ Hande Kücük
⇨ and the rest of the team
◈ More information
http://github.com/cdd/bioassay-template
http://www.bioassayexpress.com
http://collaborativedrug.com
PeerJ
Comp Sci
2:e61
(2016)
1 of 37

Recommended

BioAssay Express by
BioAssay ExpressBioAssay Express
BioAssay ExpressAlex Clark
782 views18 slides
Virtual Screening and Hit Prioritization by
Virtual Screening and Hit PrioritizationVirtual Screening and Hit Prioritization
Virtual Screening and Hit PrioritizationPuneet Kacker
1.9K views25 slides
Ligbuilder V2: overview and tutorial. by
Ligbuilder V2: overview and tutorial.Ligbuilder V2: overview and tutorial.
Ligbuilder V2: overview and tutorial.Ashish Pratim Mahanta
347 views12 slides
Session ii g2 lab modeling mmc by
Session ii g2 lab modeling mmcSession ii g2 lab modeling mmc
Session ii g2 lab modeling mmcUSD Bioinformatics
315 views5 slides
Virtual screening techniques by
Virtual screening techniquesVirtual screening techniques
Virtual screening techniquesROHIT PAL
7.4K views28 slides
CINF66 Visualizing Molecules In and Out of Context by
CINF66 Visualizing Molecules In and Out of ContextCINF66 Visualizing Molecules In and Out of Context
CINF66 Visualizing Molecules In and Out of ContextJeff White
53 views29 slides

More Related Content

Similar to CDD BioAssay Express: Expanding the target dimension: How to visualize a lot of SAR

Drug Target Interaction (DTI) prediction (MSc. thesis) by
Drug Target Interaction (DTI) prediction (MSc. thesis) Drug Target Interaction (DTI) prediction (MSc. thesis)
Drug Target Interaction (DTI) prediction (MSc. thesis) Dimitris Papadopoulos
232 views28 slides
Bioassay by
BioassayBioassay
BioassayDr Renju Ravi
227.2K views49 slides
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge by
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
2016 bioinformatics i_bio_cheminformatics_wimvancriekingeProf. Wim Van Criekinge
3K views104 slides
Network analysis of cancer metabolism: A novel route to precision medicine by
Network analysis of cancer metabolism: A novel route to precision medicineNetwork analysis of cancer metabolism: A novel route to precision medicine
Network analysis of cancer metabolism: A novel route to precision medicineVarshit Dusad
49 views31 slides
Bioinformatics t9-t10-biocheminformatics v2014 by
Bioinformatics t9-t10-biocheminformatics v2014Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014Prof. Wim Van Criekinge
3.2K views65 slides
Sorting bioactive wheat from database chaff by
Sorting bioactive wheat from database chaffSorting bioactive wheat from database chaff
Sorting bioactive wheat from database chaffChris Southan
289 views1 slide

Similar to CDD BioAssay Express: Expanding the target dimension: How to visualize a lot of SAR(20)

Drug Target Interaction (DTI) prediction (MSc. thesis) by Dimitris Papadopoulos
Drug Target Interaction (DTI) prediction (MSc. thesis) Drug Target Interaction (DTI) prediction (MSc. thesis)
Drug Target Interaction (DTI) prediction (MSc. thesis)
Network analysis of cancer metabolism: A novel route to precision medicine by Varshit Dusad
Network analysis of cancer metabolism: A novel route to precision medicineNetwork analysis of cancer metabolism: A novel route to precision medicine
Network analysis of cancer metabolism: A novel route to precision medicine
Varshit Dusad49 views
Sorting bioactive wheat from database chaff by Chris Southan
Sorting bioactive wheat from database chaffSorting bioactive wheat from database chaff
Sorting bioactive wheat from database chaff
Chris Southan289 views
Sorting bioactive wheat from database chaff: Challenges of discerning correct... by Guide to PHARMACOLOGY
Sorting bioactive wheat from database chaff: Challenges of discerning correct...Sorting bioactive wheat from database chaff: Challenges of discerning correct...
Sorting bioactive wheat from database chaff: Challenges of discerning correct...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj... by Kamel Mansouri
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
Kamel Mansouri105 views
USFDA guidelines for bioanalytical method validation by bhatiaji123
USFDA guidelines for bioanalytical method validationUSFDA guidelines for bioanalytical method validation
USFDA guidelines for bioanalytical method validation
bhatiaji12321.5K views
Scoring and ranking of metabolic trees to computationally prioritize chemical... by Kamel Mansouri
Scoring and ranking of metabolic trees to computationally prioritize chemical...Scoring and ranking of metabolic trees to computationally prioritize chemical...
Scoring and ranking of metabolic trees to computationally prioritize chemical...
Kamel Mansouri254 views
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013 by Prof. Wim Van Criekinge
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
A_Pope_RQRM_LeadDisc_June_2016 by Andrew Pope
A_Pope_RQRM_LeadDisc_June_2016A_Pope_RQRM_LeadDisc_June_2016
A_Pope_RQRM_LeadDisc_June_2016
Andrew Pope440 views
Bringing bioassay protocols to the world of informatics, using semantic annot... by Alex Clark
Bringing bioassay protocols to the world of informatics, using semantic annot...Bringing bioassay protocols to the world of informatics, using semantic annot...
Bringing bioassay protocols to the world of informatics, using semantic annot...
Alex Clark322 views
Autonomous model building with a preponderance of well annotated assay protocols by Alex Clark
Autonomous model building with a preponderance of well annotated assay protocolsAutonomous model building with a preponderance of well annotated assay protocols
Autonomous model building with a preponderance of well annotated assay protocols
Alex Clark595 views
Protein Remote Homology Detection by Alia Hamwi
Protein Remote Homology DetectionProtein Remote Homology Detection
Protein Remote Homology Detection
Alia Hamwi80 views
Reaxys medicinal chemistry oct2014 by Reaxys
Reaxys medicinal chemistry oct2014Reaxys medicinal chemistry oct2014
Reaxys medicinal chemistry oct2014
Reaxys760 views
Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA) by Kamel Mansouri
Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA)Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA)
Chemical prioritization using in silico modeling. SOT 2018 (San Antonio, USA)
Kamel Mansouri53 views

More from Alex Clark

Mixtures QSAR: modelling collections of chemicals by
Mixtures QSAR: modelling collections of chemicalsMixtures QSAR: modelling collections of chemicals
Mixtures QSAR: modelling collections of chemicalsAlex Clark
259 views24 slides
Mixtures InChI: a story of how standards drive upstream products by
Mixtures InChI: a story of how standards drive upstream productsMixtures InChI: a story of how standards drive upstream products
Mixtures InChI: a story of how standards drive upstream productsAlex Clark
600 views19 slides
Mixtures as first class citizens in the realm of informatics by
Mixtures as first class citizens in the realm of informaticsMixtures as first class citizens in the realm of informatics
Mixtures as first class citizens in the realm of informaticsAlex Clark
422 views19 slides
Mixtures: informatics for formulations and consumer products by
Mixtures: informatics for formulations and consumer productsMixtures: informatics for formulations and consumer products
Mixtures: informatics for formulations and consumer productsAlex Clark
125 views27 slides
Coordination InChI (2019) by
Coordination InChI (2019)Coordination InChI (2019)
Coordination InChI (2019)Alex Clark
223 views15 slides
Chemical mixtures: File format, open source tools, example data, and mixtures... by
Chemical mixtures: File format, open source tools, example data, and mixtures...Chemical mixtures: File format, open source tools, example data, and mixtures...
Chemical mixtures: File format, open source tools, example data, and mixtures...Alex Clark
777 views20 slides

More from Alex Clark(20)

Mixtures QSAR: modelling collections of chemicals by Alex Clark
Mixtures QSAR: modelling collections of chemicalsMixtures QSAR: modelling collections of chemicals
Mixtures QSAR: modelling collections of chemicals
Alex Clark259 views
Mixtures InChI: a story of how standards drive upstream products by Alex Clark
Mixtures InChI: a story of how standards drive upstream productsMixtures InChI: a story of how standards drive upstream products
Mixtures InChI: a story of how standards drive upstream products
Alex Clark600 views
Mixtures as first class citizens in the realm of informatics by Alex Clark
Mixtures as first class citizens in the realm of informaticsMixtures as first class citizens in the realm of informatics
Mixtures as first class citizens in the realm of informatics
Alex Clark422 views
Mixtures: informatics for formulations and consumer products by Alex Clark
Mixtures: informatics for formulations and consumer productsMixtures: informatics for formulations and consumer products
Mixtures: informatics for formulations and consumer products
Alex Clark125 views
Coordination InChI (2019) by Alex Clark
Coordination InChI (2019)Coordination InChI (2019)
Coordination InChI (2019)
Alex Clark223 views
Chemical mixtures: File format, open source tools, example data, and mixtures... by Alex Clark
Chemical mixtures: File format, open source tools, example data, and mixtures...Chemical mixtures: File format, open source tools, example data, and mixtures...
Chemical mixtures: File format, open source tools, example data, and mixtures...
Alex Clark777 views
ACS CINF Luncheon talk (Boston 2018) by Alex Clark
ACS CINF Luncheon talk (Boston 2018)ACS CINF Luncheon talk (Boston 2018)
ACS CINF Luncheon talk (Boston 2018)
Alex Clark298 views
Representing molecules with minimalism: A solution to the entropy of informatics by Alex Clark
Representing molecules with minimalism: A solution to the entropy of informaticsRepresenting molecules with minimalism: A solution to the entropy of informatics
Representing molecules with minimalism: A solution to the entropy of informatics
Alex Clark614 views
SLAS2016: Why have one model when you could have thousands? by Alex Clark
SLAS2016: Why have one model when you could have thousands?SLAS2016: Why have one model when you could have thousands?
SLAS2016: Why have one model when you could have thousands?
Alex Clark749 views
The anatomy of a chemical reaction: Dissection by machine learning algorithms by Alex Clark
The anatomy of a chemical reaction: Dissection by machine learning algorithmsThe anatomy of a chemical reaction: Dissection by machine learning algorithms
The anatomy of a chemical reaction: Dissection by machine learning algorithms
Alex Clark1.3K views
Compact models for compact devices: Visualisation of SAR using mobile apps by Alex Clark
Compact models for compact devices: Visualisation of SAR using mobile appsCompact models for compact devices: Visualisation of SAR using mobile apps
Compact models for compact devices: Visualisation of SAR using mobile apps
Alex Clark1.3K views
Green chemistry in chemical reactions: informatics by design by Alex Clark
Green chemistry in chemical reactions: informatics by designGreen chemistry in chemical reactions: informatics by design
Green chemistry in chemical reactions: informatics by design
Alex Clark2.6K views
ICCE 2014: The Green Lab Notebook by Alex Clark
ICCE 2014: The Green Lab NotebookICCE 2014: The Green Lab Notebook
ICCE 2014: The Green Lab Notebook
Alex Clark1.1K views
Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014) by Alex Clark
Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)
Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)
Alex Clark914 views
Building a mobile reaction lab notebook (ACS Dallas 2014) by Alex Clark
Building a mobile reaction lab notebook (ACS Dallas 2014)Building a mobile reaction lab notebook (ACS Dallas 2014)
Building a mobile reaction lab notebook (ACS Dallas 2014)
Alex Clark1.4K views
Reaction Lab Notebooks for Mobile Devices - Alex M. Clark - GDCh 2013 by Alex Clark
Reaction Lab Notebooks for Mobile Devices - Alex M. Clark - GDCh 2013Reaction Lab Notebooks for Mobile Devices - Alex M. Clark - GDCh 2013
Reaction Lab Notebooks for Mobile Devices - Alex M. Clark - GDCh 2013
Alex Clark1.8K views
Alex Clark : NETTAB 2013 by Alex Clark
Alex Clark : NETTAB 2013Alex Clark : NETTAB 2013
Alex Clark : NETTAB 2013
Alex Clark1.3K views
Open Drug Discovery Teams @ Hacking Health Montreal by Alex Clark
Open Drug Discovery Teams @ Hacking Health MontrealOpen Drug Discovery Teams @ Hacking Health Montreal
Open Drug Discovery Teams @ Hacking Health Montreal
Alex Clark932 views
Pistoia Alliance App Strategy by Alex Clark
Pistoia Alliance App StrategyPistoia Alliance App Strategy
Pistoia Alliance App Strategy
Alex Clark1.1K views
Mobile+Cloud: a viable replacement for desktop cheminformatics? by Alex Clark
Mobile+Cloud: a viable replacement for desktop cheminformatics?Mobile+Cloud: a viable replacement for desktop cheminformatics?
Mobile+Cloud: a viable replacement for desktop cheminformatics?
Alex Clark891 views

Recently uploaded

A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance... by
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...InsideScientific
58 views62 slides
1978 NASA News Release Log by
1978 NASA News Release Log1978 NASA News Release Log
1978 NASA News Release Logpurrterminator
10 views146 slides
Distinct distributions of elliptical and disk galaxies across the Local Super... by
Distinct distributions of elliptical and disk galaxies across the Local Super...Distinct distributions of elliptical and disk galaxies across the Local Super...
Distinct distributions of elliptical and disk galaxies across the Local Super...Sérgio Sacani
31 views12 slides
Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl... by
Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl...Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl...
Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl...GIFT KIISI NKIN
26 views31 slides
Disinfectants & Antiseptic by
Disinfectants & AntisepticDisinfectants & Antiseptic
Disinfectants & AntisepticSanket P Shinde
48 views36 slides
MILK LIPIDS 2.pptx by
MILK LIPIDS 2.pptxMILK LIPIDS 2.pptx
MILK LIPIDS 2.pptxabhinambroze18
7 views15 slides

Recently uploaded(20)

A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance... by InsideScientific
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
InsideScientific58 views
Distinct distributions of elliptical and disk galaxies across the Local Super... by Sérgio Sacani
Distinct distributions of elliptical and disk galaxies across the Local Super...Distinct distributions of elliptical and disk galaxies across the Local Super...
Distinct distributions of elliptical and disk galaxies across the Local Super...
Sérgio Sacani31 views
Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl... by GIFT KIISI NKIN
Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl...Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl...
Synthesis and Characterization of Magnetite-Magnesium Sulphate-Sodium Dodecyl...
GIFT KIISI NKIN26 views
PRINCIPLES-OF ASSESSMENT by rbalmagro
PRINCIPLES-OF ASSESSMENTPRINCIPLES-OF ASSESSMENT
PRINCIPLES-OF ASSESSMENT
rbalmagro12 views
별헤는 사람들 2023년 12월호 전명원 교수 자료 by sciencepeople
별헤는 사람들 2023년 12월호 전명원 교수 자료별헤는 사람들 2023년 12월호 전명원 교수 자료
별헤는 사람들 2023년 12월호 전명원 교수 자료
sciencepeople41 views
MODULE-9-Biotechnology, Genetically Modified Organisms, and Gene Therapy.pdf by KerryNuez1
MODULE-9-Biotechnology, Genetically Modified Organisms, and Gene Therapy.pdfMODULE-9-Biotechnology, Genetically Modified Organisms, and Gene Therapy.pdf
MODULE-9-Biotechnology, Genetically Modified Organisms, and Gene Therapy.pdf
KerryNuez125 views
CSF -SHEEBA.D presentation.pptx by SheebaD7
CSF -SHEEBA.D presentation.pptxCSF -SHEEBA.D presentation.pptx
CSF -SHEEBA.D presentation.pptx
SheebaD714 views
Experimental animal Guinea pigs.pptx by Mansee Arya
Experimental animal Guinea pigs.pptxExperimental animal Guinea pigs.pptx
Experimental animal Guinea pigs.pptx
Mansee Arya17 views
Pollination By Nagapradheesh.M.pptx by MNAGAPRADHEESH
Pollination By Nagapradheesh.M.pptxPollination By Nagapradheesh.M.pptx
Pollination By Nagapradheesh.M.pptx
MNAGAPRADHEESH16 views
Nitrosamine & NDSRI.pptx by NileshBonde4
Nitrosamine & NDSRI.pptxNitrosamine & NDSRI.pptx
Nitrosamine & NDSRI.pptx
NileshBonde417 views
A giant thin stellar stream in the Coma Galaxy Cluster by Sérgio Sacani
A giant thin stellar stream in the Coma Galaxy ClusterA giant thin stellar stream in the Coma Galaxy Cluster
A giant thin stellar stream in the Coma Galaxy Cluster
Sérgio Sacani10 views
How to be(come) a successful PhD student by Tom Mens
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD student
Tom Mens491 views
application of genetic engineering 2.pptx by SankSurezz
application of genetic engineering 2.pptxapplication of genetic engineering 2.pptx
application of genetic engineering 2.pptx
SankSurezz10 views

CDD BioAssay Express: Expanding the target dimension: How to visualize a lot of SAR

  • 1. Expanding the target dimension: How to visualize a lot of SAR Alex M. Clark http://www.bioassayexpress.com ⚡ COLLABORATIVE DRUG DISCOVERY
  • 2. Protocols & Datasets ◈ Structure-activity data usually exists in isolation: ▷ one target, one protocol ▷ many compounds & measurements ◈ Datasets often related: ▷ same target? ▷ same protocol? ◈ How would we know? 2
  • 3. Prerequisite ◈ Protocols normally described using scientific English ◈ Alternatively, many ontologies are available: ▷ BAO (BioAssay Ontology) ▷ DTO (Drug Target Ontology) ▷ CLO (Cell Line Ontology) ▷ GO (Gene Ontology) ▷ ... and many others ◈ Each term has a URI: global meaning, hierarchical 3
  • 4. Common Assay Template ◈ Pulls together ontology categories ◈ ~22 assignments ◈ Captures key summary information 4
  • 5. Common Assay Template ◈ Pulls together ontology categories ◈ ~22 assignments ◈ Captures key summary information 4
  • 6. Protocol Similarity 5 504854 449764 A Cell Based Secondary Assay To Explore Vero Cell Cytotoxicity of Purified and Synthesized Compounds that Inhibit Mycobacterium Tuberculosis (4) This functional assay was developed for detection of compounds inhibiting Vero E6 cells viability as a secondary screen to the beta- lactam sensitizing M. tuberculosis bacteriocidal assay. In this assay, we treated Vero E6 cells with compounds selected as hits in the M. tuberculosis assay for 72 hours over a 10 point 2-fold dilution series, ranging from 0.195 uM to 100 uM. Following 72 hours of treatment, relative viable cell number was determined using Cell Titer Glo from Promega. Each plate contained 64 replicates of vehicle treated cells which served as negative controls. Outcome: Compounds that showed <70% cell viability for at least one concentration were defined as "Active". If the % viability at all doses was >70%, the compound was defined as "Inactive". ... A High Throughput Confirmatory Assay used to Identify Novel Compounds that Inhibit Mycobacterium Tuberculosis in the absence of Glycerol Outcome: Compounds that showed >30% inhibition for at least one concentration in the Mtb dose response were defined as "Active". If the inhibition at all doses was <30% in the Mtb assay, the compound was defined as "Inactive". In the primary screen a compound was deemed "Inactive" if it had a Percent Inhibition <70.31%. Compounds with a Percent Inhibition >70.31% but were not selected for follow up dose response were labeled "Inconclusive." The following tiered system has been implemented at Southern Research Institute for use with the PubChem Score. Compounds in the primary screen are scored on a scale of 0-40 based on inhibitory activity where a score of 40 corresponds to 100% inhibition. In the confirmatory dose response screen, active compounds were scored on a scale of 41-80 based on the IC50 result in the Mtb assay while compounds that did not confirm as actives were given the score 0. ... 69% similar (?)
  • 8. Matrix: Assays vs. Compounds ◈ Curated ~3500 assays from PubChem: ▷ mostly from Molecular Libraries project ▷ natural language/machine learning helped ▷ carefully analyzed by expert biologists ◈ Enough data for proof of concept assay informatics ◈ Adding the assay dimension to structure-activity relationship analysis 6
  • 9. Now With Compounds ◈ Each assay has compounds & data from PubChem: 7
  • 10. Now With Compounds ◈ Each assay has compounds & data from PubChem: 7
  • 11. Now With Compounds ◈ Each assay has compounds & data from PubChem: 7
  • 12. Looking at Primary Screens ◈ Browse the curated content from PubChem: 8
  • 13. Looking at Primary Screens ◈ Browse the curated content from PubChem: 8
  • 14. Looking at Primary Screens ◈ Browse the curated content from PubChem: 8
  • 17. Frequent Hitters ◈ Rank by score = # hits vs. selected assays 10
  • 20. Assay Clustering ◈ Leverage hierarchical structure of ontology terms: 12
  • 21. Assay Clustering ◈ Leverage hierarchical structure of ontology terms: 12
  • 25. Selectivity ◈ Using secondary assays ◈ Ranking score = 14 # inactives # actives
  • 29. Similarity ◈ Can select compounds by similarity to a reference structure (ECFP6/Tanimoto): 16
  • 30. Similarity ◈ Can select compounds by similarity to a reference structure (ECFP6/Tanimoto): 16 paste/drag structure
  • 34. Conclusion ◈ Cheminformatics & bioinformatics are well established, but assay informatics is new ◈ Data curation has begun: much yet remains (>1M) ◈ Content curation is the main bottleneck ▷ BioAssay Express is key to solving this problem ◈ Content consumption is at proof of concept stage ▷ ... and this is the fun part. 18
  • 35. Future Work ◈ Crowd curation of public data: you should want your assays to be machine readable ◈ Model building: create from structure-activity-assay, apply all existing knowledge to prospective discovery ◈ ELN context: use semantic annotations instead of text, describe assays comprehensively ▷ convert annotations to text, for communication ▷ translate protocols to import formats for robots 19
  • 36. Assay Data ◈ All features, and curated data, available at: ◦ http://www.bioassayexpress.com ◦ http://beta.bioassayexpress.com ◈ Ready for crowd curation - free to use in public ◈ Commercial product: private installations, with custom settings, are being deployed 20
  • 37. Acknowledgments ◈ Biologists ▷ Janice Kranz ▷ Haifa Ghandour ▷ Karen Featherstone 21 ◈ Collaborative Drug Discovery ⇨ Barry Bunin ⇨ Hande Kücük ⇨ and the rest of the team ◈ More information http://github.com/cdd/bioassay-template http://www.bioassayexpress.com http://collaborativedrug.com PeerJ Comp Sci 2:e61 (2016)