SlideShare a Scribd company logo
1 of 26
Download to read offline
Molecular scaffolds are special
and useful guides for discovery
Jeremy Yang, UNM & IU
Cristian Bologa, UNM
David Wild, IU
Tudor Oprea, UNM
ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN
CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
Molecular scaffolds are special
and useful guides for discovery
Jeremy Yang, UNM & IU
Cristian Bologa, UNM
David Wild, IU
Tudor Oprea, UNM
ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN
CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
What is a molecular scaffold?
• "Ring-system"
• "Ring"
• "Core"
• "Framework"
Can you identify this famous scaffold?
Some famous scaffolds
beta – lactam
(penicillins,
cephalosporins )
Millions saved,
billions earned
steroid
(testosterone,
hydrocortisone, etc. )
Wonder drugs and
hormones
benzodiazepine
(Valium, flurazepam,
etc. )
“Mother’s little
helper”
Scaffolds are special because:
3D shape
Estradiol
docked into
ER-α
(OpenEye
Fred, Vida)
Scaffold scholarship & software
• Bemis & Murcko, “Molecular
frameworks”, 1996.
• Lewell et al., “Drug rings
database”, 2003.
• Wilkens et al., “HierS: hierarchical
scaffolds”, 2005.
• Ertl et al., “Quest for the Rings”,
2006.
• Clustering, indexing databases.
• Navigation of chemical space.
• Data reduction, visualization.
• R-group / SAR analyses.
• Bioactivity prediction.
• Promiscuity prediction.
Scaffold Applications
What can be done with scaffolds?
"The only rules that really matter are these: what a man can do and what a man can't do."
- Jack Sparrow
Scaffold Applications: Scaffold Hunter
Interactive exploration of chemical space with Scaffold Hunter,
S Wetzel, K Klein, S Renner, D Rauh, T Oprea, P Mutzel, H
Waldmann, Nat Chem Bio, 5, 2009, 581-583.
Scaffold Applications: Scaffold Hopper
Scaffold Hopper, NCATS/NCGC, http://tripod.nih.gov,
http://tripod.nih.gov/files/ACS_apr8_2013.pdf.
Scaffold Applications: CARLSBAD
CARLSBAD:
The Power to Explore Biological
Networks via Chemical Patterns
The CARLSBAD Database: A Confederated Database of Chemical Bioactivities,
S. L. Mathias, J. Hines-Kay, J. J. Yang, G. Zahoransky-Kohalmi, C. G. Bologa, O.
Ursu and T. I. Oprea, Database, 2013, bat044. http://carlsbad.health.unm.edu
Scaffold Applications: Molecule Cloud
The Molecule Cloud - compact visualization of large collections of
molecules, P Ertl and B Rohde, J. Cheminfo, 2012, 4:12.
Scaffold Applications: Badapple
(BioActivity Data Associative Promiscuity Pattern Learning Engine)
Translational Informatics Public Webapps:
http://pasilla.health.unm.edu/
See also my Badapple talk in CINF session "Integrative Chemogenomics Knowledge Mining Using NIH
Open Access Resources", Tues. Sept. 9, 10:45am, Rm. 140.
Scaffold Applications:
Badapple Promiscuity Plugin
Badapple Promiscuity
Plugin for BARD,
http://bard.nih.gov
Scaffold software: UNM-Biocomp-HScaf
(Open-source Google Code project)
http://code.google.com/p/unm-biocomp-hscaf/
UNM Translational Informatics Public Web Apps:
http://pasilla.health.unm.edu/
Demo web app: HScaf
Scaffold analysis algorithm
• Remove non-linking chains
• Keep linking chains
• Keep atoms multiply-bonded to rings and chains
• Special case: ignore solo-benzene.
HierS scaffold hierarchy
quinine
Bemis-Murcko
framework
scaffolds
Cheminformatics and scaffolds:
Relevant methods
• SSSR (Smallest Set of Smallest Rings)
• Canonicalization (e.g. Morgan, CanSMILES)
• Scaffolds vs. MCS (max common subgraph)
• Fingerprints, descriptors, similarity
• Proposed new method: scaffold-based similarity
More scaffold charms
• Patents, Markush, $$$.
• Lead discovery ~ scaffold discovery.
• Organic chemists like scaffolds.
• Scaffolds can be "privileged".
Scaffolds & drug-scaffolds, the privileged few
explaining a lot of activity...
Dataset:
BARD,
MLSMR,
MLP HTS
Totals: compounds:
373,802 ; scaffolds:
146,024 ; assays: 528
; wells/results:
30,612,714;
drugs: 283;
drugscafs: 1958
% total
activity
# scaffolds %
scaffolds
All 50% 1979 1.4%
All 75% 11,645 8%
Drugs 50% 54 2.8%
Drugs 90% 327 16.7%
“activity of DB” ~ # active scaffold-instances
Privileged scaffolds concept
Nature favors a few privileged scaffolds, a.k.a.
"privileged structures", for multiple receptors.
"What is clear is that certain “privileged structures” are capable
of providing useful ligands for more than one receptor and that
judicious modification of such structures could be a viable
alternative in the search for new receptor agonists and
antagonists."*
*Methods for drug discovery: development of potent, selective, orally effective
cholecystokinin antagonists, Evans et al., J. Med. Chem., 1988, 31, 2235.
News: antibiotic, scaffold:
Anthracimycin
Anthracimycin, a Potent Anthrax Antibiotic from a Marine-Derived Actinomycete,
Kyoung Hwa Jang et al., Angewandte Chemie, vol. 52, no 30, 2013, pp7822–7824; doi:
10.1002/anie.201302749.
Problems with scaffolds
• Definition of "scaffold" not consistent & rigorous
among chemists & cheminformaticians.
Testosterone
Estradiol
Danazol
Cyproterone
acetate
"We think in generalities, but we live in detail." - Alfred North Whitehead
http://en.wikipedia.org/wiki/Steroids
Steroidogenesis
[#8]~[#6;R1]~1~[#6;R1]~[#6;R1]~[#6;R2]~2~[#6;R2]~1
~[#6;R2]~[#6;R1]~[#6;R2]~1~[#6;R2]~2~[#6;R1]~[#6;R
1]~[#6;R2]~2~[#6;R1]~[#6;R1](~[#8])~[#6;R1]~[#6;R1]
~[#6;R2]~1~2
Steroid pattern
definition via
SMARTS
Problems solved by Cheminformatics
Conclusion:
Molecular scaffolds
(like cheminformatics itself)
are special and useful guides
for discovery
in chemical biology,
chemogenomics,
and drug discovery
ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN
CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
Thank Yous:
Cristian Bologa, UNM
Tudor Oprea, UNM
Oleg Ursu, UNM
David Wild, IU
Gary Wiggins, IU
Happy Explorations!
ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN
CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science

More Related Content

What's hot

Molecular and Quantum Mechanics in drug design
Molecular and Quantum Mechanics in drug designMolecular and Quantum Mechanics in drug design
Molecular and Quantum Mechanics in drug designAjay Kumar
 
Pharmacophore Modeling in Drug Designing
Pharmacophore Modeling in Drug DesigningPharmacophore Modeling in Drug Designing
Pharmacophore Modeling in Drug DesigningVinod Tonde
 
Target identification in drug discovery
Target identification in drug discoveryTarget identification in drug discovery
Target identification in drug discoverySwati Kumari
 
molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...GAUTAM KHUNE
 
Conformational analysis
Conformational analysisConformational analysis
Conformational analysisPinky Vincent
 
Principles and Applications of Structure Activity Relationship
Principles and Applications of Structure Activity RelationshipPrinciples and Applications of Structure Activity Relationship
Principles and Applications of Structure Activity RelationshipNizam Ashraf
 
Stages of drug discovery
Stages of drug discoveryStages of drug discovery
Stages of drug discoveryPawanDhamala1
 
Pharmacophore mapping
Pharmacophore mapping Pharmacophore mapping
Pharmacophore mapping GamitKinjal
 
PHARMACOHORE MAPPING AND VIRTUAL SCRRENING FOR RESEARCH DEPARTMENT
PHARMACOHORE MAPPING AND VIRTUAL SCRRENING FOR RESEARCH DEPARTMENTPHARMACOHORE MAPPING AND VIRTUAL SCRRENING FOR RESEARCH DEPARTMENT
PHARMACOHORE MAPPING AND VIRTUAL SCRRENING FOR RESEARCH DEPARTMENTShikha Popali
 
Basics of QSAR Modeling
Basics of QSAR ModelingBasics of QSAR Modeling
Basics of QSAR ModelingPrachi Pradeep
 
Pharmacophore modeling
Pharmacophore modelingPharmacophore modeling
Pharmacophore modelingDevika Rana
 

What's hot (20)

Virtual screening
Virtual screeningVirtual screening
Virtual screening
 
Pharmacophore
PharmacophorePharmacophore
Pharmacophore
 
Molecular and Quantum Mechanics in drug design
Molecular and Quantum Mechanics in drug designMolecular and Quantum Mechanics in drug design
Molecular and Quantum Mechanics in drug design
 
3D QSAR
3D QSAR3D QSAR
3D QSAR
 
3d qsar
3d qsar3d qsar
3d qsar
 
Pharmacophore Modeling in Drug Designing
Pharmacophore Modeling in Drug DesigningPharmacophore Modeling in Drug Designing
Pharmacophore Modeling in Drug Designing
 
Docking
DockingDocking
Docking
 
Qsar ppt
Qsar pptQsar ppt
Qsar ppt
 
Target identification in drug discovery
Target identification in drug discoveryTarget identification in drug discovery
Target identification in drug discovery
 
molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...
 
Conformational analysis
Conformational analysisConformational analysis
Conformational analysis
 
Principles and Applications of Structure Activity Relationship
Principles and Applications of Structure Activity RelationshipPrinciples and Applications of Structure Activity Relationship
Principles and Applications of Structure Activity Relationship
 
SAR & QSAR
SAR & QSARSAR & QSAR
SAR & QSAR
 
Stages of drug discovery
Stages of drug discoveryStages of drug discovery
Stages of drug discovery
 
Pharmacophore mapping
Pharmacophore mapping Pharmacophore mapping
Pharmacophore mapping
 
ChemDraw 15
ChemDraw 15ChemDraw 15
ChemDraw 15
 
PHARMACOHORE MAPPING AND VIRTUAL SCRRENING FOR RESEARCH DEPARTMENT
PHARMACOHORE MAPPING AND VIRTUAL SCRRENING FOR RESEARCH DEPARTMENTPHARMACOHORE MAPPING AND VIRTUAL SCRRENING FOR RESEARCH DEPARTMENT
PHARMACOHORE MAPPING AND VIRTUAL SCRRENING FOR RESEARCH DEPARTMENT
 
Computer aided Drug designing (CADD)
Computer aided Drug designing (CADD)Computer aided Drug designing (CADD)
Computer aided Drug designing (CADD)
 
Basics of QSAR Modeling
Basics of QSAR ModelingBasics of QSAR Modeling
Basics of QSAR Modeling
 
Pharmacophore modeling
Pharmacophore modelingPharmacophore modeling
Pharmacophore modeling
 

Similar to Molecular scaffolds are special and useful guides to discovery

Developing data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesDeveloping data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesAmanda Whitmire
 
Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Mark Wilkinson
 
Evolution of e-Research
Evolution of e-ResearchEvolution of e-Research
Evolution of e-ResearchDavid De Roure
 
Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Monica Munoz-Torres
 
Investigación con embriones humanos ¿sí o no
Investigación con embriones humanos ¿sí o noInvestigación con embriones humanos ¿sí o no
Investigación con embriones humanos ¿sí o noseminary
 
5. angelica assignment 2 march 9 revised
5. angelica assignment 2 march 9 revised5. angelica assignment 2 march 9 revised
5. angelica assignment 2 march 9 revisedangelicagonzalez10
 
Advances in experimental medicine and biology hussain book
Advances in experimental medicine and biology hussain bookAdvances in experimental medicine and biology hussain book
Advances in experimental medicine and biology hussain bookmantu verma
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!adcobb
 
Model organisms - BUGEMA UNIVERSITY
Model organisms - BUGEMA UNIVERSITYModel organisms - BUGEMA UNIVERSITY
Model organisms - BUGEMA UNIVERSITYMuunda Mudenda
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1AlyciaGold776
 
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sgScott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sgGigaScience, BGI Hong Kong
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the futurePistoia Alliance
 
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...GigaScience, BGI Hong Kong
 

Similar to Molecular scaffolds are special and useful guides to discovery (20)

Developing data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesDeveloping data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universities
 
Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014
 
Evolution of e-Research
Evolution of e-ResearchEvolution of e-Research
Evolution of e-Research
 
Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.
 
Investigación con embriones humanos ¿sí o no
Investigación con embriones humanos ¿sí o noInvestigación con embriones humanos ¿sí o no
Investigación con embriones humanos ¿sí o no
 
5. angelica assignment 2 march 9 revised
5. angelica assignment 2 march 9 revised5. angelica assignment 2 march 9 revised
5. angelica assignment 2 march 9 revised
 
Advances in experimental medicine and biology hussain book
Advances in experimental medicine and biology hussain bookAdvances in experimental medicine and biology hussain book
Advances in experimental medicine and biology hussain book
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 
Shorthouse
ShorthouseShorthouse
Shorthouse
 
Model organisms - BUGEMA UNIVERSITY
Model organisms - BUGEMA UNIVERSITYModel organisms - BUGEMA UNIVERSITY
Model organisms - BUGEMA UNIVERSITY
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Biomarker-Vol-8
Biomarker-Vol-8Biomarker-Vol-8
Biomarker-Vol-8
 
DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1
 
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sgScott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
 
Chibucos annot go_final
Chibucos annot go_finalChibucos annot go_final
Chibucos annot go_final
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the future
 
Ppt jitu[1]
Ppt jitu[1]Ppt jitu[1]
Ppt jitu[1]
 
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
 
Organoid Poster
Organoid PosterOrganoid Poster
Organoid Poster
 

More from Jeremy Yang

TIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS AnalyticsTIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS AnalyticsJeremy Yang
 
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizerDrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizerJeremy Yang
 
Mining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypothesesMining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypothesesJeremy Yang
 
TIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST APITIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST APIJeremy Yang
 
Ex-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles ExplorerEx-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles ExplorerJeremy Yang
 
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Jeremy Yang
 
Open Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource posterOpen Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource posterJeremy Yang
 
Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)Jeremy Yang
 
Bibliological data science and drug discovery
Bibliological data science and drug discoveryBibliological data science and drug discovery
Bibliological data science and drug discoveryJeremy Yang
 
BioMISS: Language Diversity of Computing
BioMISS: Language Diversity of ComputingBioMISS: Language Diversity of Computing
BioMISS: Language Diversity of ComputingJeremy Yang
 
The Language Diversity of Computing
The Language Diversity of ComputingThe Language Diversity of Computing
The Language Diversity of ComputingJeremy Yang
 
RMSD: routine measure stirs doubts
RMSD: routine measure stirs doubtsRMSD: routine measure stirs doubts
RMSD: routine measure stirs doubtsJeremy Yang
 
Canonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsCanonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsJeremy Yang
 
Molecular scaffolds poster
Molecular scaffolds posterMolecular scaffolds poster
Molecular scaffolds posterJeremy Yang
 
The BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARDThe BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARDJeremy Yang
 
Cheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case StudiesCheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case StudiesJeremy Yang
 
How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...Jeremy Yang
 
UNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applicationsUNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applicationsJeremy Yang
 
Cyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingCyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingJeremy Yang
 
Promiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNPromiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNJeremy Yang
 

More from Jeremy Yang (20)

TIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS AnalyticsTIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS Analytics
 
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizerDrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
 
Mining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypothesesMining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
 
TIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST APITIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST API
 
Ex-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles ExplorerEx-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles Explorer
 
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
 
Open Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource posterOpen Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource poster
 
Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)
 
Bibliological data science and drug discovery
Bibliological data science and drug discoveryBibliological data science and drug discovery
Bibliological data science and drug discovery
 
BioMISS: Language Diversity of Computing
BioMISS: Language Diversity of ComputingBioMISS: Language Diversity of Computing
BioMISS: Language Diversity of Computing
 
The Language Diversity of Computing
The Language Diversity of ComputingThe Language Diversity of Computing
The Language Diversity of Computing
 
RMSD: routine measure stirs doubts
RMSD: routine measure stirs doubtsRMSD: routine measure stirs doubts
RMSD: routine measure stirs doubts
 
Canonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsCanonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformatics
 
Molecular scaffolds poster
Molecular scaffolds posterMolecular scaffolds poster
Molecular scaffolds poster
 
The BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARDThe BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARD
 
Cheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case StudiesCheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case Studies
 
How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...
 
UNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applicationsUNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applications
 
Cyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingCyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in Biocomputing
 
Promiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNPromiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCN
 

Recently uploaded

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 

Recently uploaded (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

Molecular scaffolds are special and useful guides to discovery

  • 1. Molecular scaffolds are special and useful guides for discovery Jeremy Yang, UNM & IU Cristian Bologa, UNM David Wild, IU Tudor Oprea, UNM ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
  • 2. Molecular scaffolds are special and useful guides for discovery Jeremy Yang, UNM & IU Cristian Bologa, UNM David Wild, IU Tudor Oprea, UNM ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
  • 3. What is a molecular scaffold? • "Ring-system" • "Ring" • "Core" • "Framework" Can you identify this famous scaffold?
  • 4. Some famous scaffolds beta – lactam (penicillins, cephalosporins ) Millions saved, billions earned steroid (testosterone, hydrocortisone, etc. ) Wonder drugs and hormones benzodiazepine (Valium, flurazepam, etc. ) “Mother’s little helper”
  • 5. Scaffolds are special because: 3D shape Estradiol docked into ER-α (OpenEye Fred, Vida)
  • 6. Scaffold scholarship & software • Bemis & Murcko, “Molecular frameworks”, 1996. • Lewell et al., “Drug rings database”, 2003. • Wilkens et al., “HierS: hierarchical scaffolds”, 2005. • Ertl et al., “Quest for the Rings”, 2006.
  • 7. • Clustering, indexing databases. • Navigation of chemical space. • Data reduction, visualization. • R-group / SAR analyses. • Bioactivity prediction. • Promiscuity prediction. Scaffold Applications What can be done with scaffolds? "The only rules that really matter are these: what a man can do and what a man can't do." - Jack Sparrow
  • 8. Scaffold Applications: Scaffold Hunter Interactive exploration of chemical space with Scaffold Hunter, S Wetzel, K Klein, S Renner, D Rauh, T Oprea, P Mutzel, H Waldmann, Nat Chem Bio, 5, 2009, 581-583.
  • 9. Scaffold Applications: Scaffold Hopper Scaffold Hopper, NCATS/NCGC, http://tripod.nih.gov, http://tripod.nih.gov/files/ACS_apr8_2013.pdf.
  • 10. Scaffold Applications: CARLSBAD CARLSBAD: The Power to Explore Biological Networks via Chemical Patterns The CARLSBAD Database: A Confederated Database of Chemical Bioactivities, S. L. Mathias, J. Hines-Kay, J. J. Yang, G. Zahoransky-Kohalmi, C. G. Bologa, O. Ursu and T. I. Oprea, Database, 2013, bat044. http://carlsbad.health.unm.edu
  • 11. Scaffold Applications: Molecule Cloud The Molecule Cloud - compact visualization of large collections of molecules, P Ertl and B Rohde, J. Cheminfo, 2012, 4:12.
  • 12. Scaffold Applications: Badapple (BioActivity Data Associative Promiscuity Pattern Learning Engine) Translational Informatics Public Webapps: http://pasilla.health.unm.edu/
  • 13. See also my Badapple talk in CINF session "Integrative Chemogenomics Knowledge Mining Using NIH Open Access Resources", Tues. Sept. 9, 10:45am, Rm. 140. Scaffold Applications: Badapple Promiscuity Plugin Badapple Promiscuity Plugin for BARD, http://bard.nih.gov
  • 14. Scaffold software: UNM-Biocomp-HScaf (Open-source Google Code project) http://code.google.com/p/unm-biocomp-hscaf/
  • 15. UNM Translational Informatics Public Web Apps: http://pasilla.health.unm.edu/ Demo web app: HScaf
  • 16. Scaffold analysis algorithm • Remove non-linking chains • Keep linking chains • Keep atoms multiply-bonded to rings and chains • Special case: ignore solo-benzene.
  • 18. Cheminformatics and scaffolds: Relevant methods • SSSR (Smallest Set of Smallest Rings) • Canonicalization (e.g. Morgan, CanSMILES) • Scaffolds vs. MCS (max common subgraph) • Fingerprints, descriptors, similarity • Proposed new method: scaffold-based similarity
  • 19. More scaffold charms • Patents, Markush, $$$. • Lead discovery ~ scaffold discovery. • Organic chemists like scaffolds. • Scaffolds can be "privileged".
  • 20. Scaffolds & drug-scaffolds, the privileged few explaining a lot of activity... Dataset: BARD, MLSMR, MLP HTS Totals: compounds: 373,802 ; scaffolds: 146,024 ; assays: 528 ; wells/results: 30,612,714; drugs: 283; drugscafs: 1958 % total activity # scaffolds % scaffolds All 50% 1979 1.4% All 75% 11,645 8% Drugs 50% 54 2.8% Drugs 90% 327 16.7% “activity of DB” ~ # active scaffold-instances
  • 21. Privileged scaffolds concept Nature favors a few privileged scaffolds, a.k.a. "privileged structures", for multiple receptors. "What is clear is that certain “privileged structures” are capable of providing useful ligands for more than one receptor and that judicious modification of such structures could be a viable alternative in the search for new receptor agonists and antagonists."* *Methods for drug discovery: development of potent, selective, orally effective cholecystokinin antagonists, Evans et al., J. Med. Chem., 1988, 31, 2235.
  • 22. News: antibiotic, scaffold: Anthracimycin Anthracimycin, a Potent Anthrax Antibiotic from a Marine-Derived Actinomycete, Kyoung Hwa Jang et al., Angewandte Chemie, vol. 52, no 30, 2013, pp7822–7824; doi: 10.1002/anie.201302749.
  • 23. Problems with scaffolds • Definition of "scaffold" not consistent & rigorous among chemists & cheminformaticians. Testosterone Estradiol Danazol Cyproterone acetate "We think in generalities, but we live in detail." - Alfred North Whitehead
  • 25. Conclusion: Molecular scaffolds (like cheminformatics itself) are special and useful guides for discovery in chemical biology, chemogenomics, and drug discovery ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science
  • 26. Thank Yous: Cristian Bologa, UNM Tudor Oprea, UNM Oleg Ursu, UNM David Wild, IU Gary Wiggins, IU Happy Explorations! ACS National Meeting - Sept. 8-12, 2013 - Indianapolis, IN CINF Graduate Student Research Symposium in Cheminformatics, Information Science, and Library Science