SlideShare a Scribd company logo
1 of 41
Download to read offline
Exploiting Bigger Data and Collaborative
Tools for Predictive Drug Discovery
Sean Ekins, CSO
CDD
CDD website 2010-2013
Perkin Elmer in
Laboratory Informatics Guide 2014
This guy deserves a coffee
“by provisioning the right amount of storage and compute resources, cost can be significantly
reduced with no significant impact on application performance”
CDD’s Influence spreads
beyond the cloud
2014
2004 - present
Small Data circa late 1990’s
Big Data circa 2010’s
~193,000 cpds
Drug Metab Dispos, 38: 2083-2090, 2010
-3
-2.5
-2
-1.5
-1
-0.5
0
0.5
1
1.5
-3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5
logIC50-tips
log IC50-acoustic
Hydrophobi
c features
(HPF)
Hydrogen
bond
acceptor
(HBA)
Hydrogen
bond
donor
(HBD)
Observed
vs.
predicted
IC50 r
Acoustic mediated process 2 1 1 0.92
Tip-based process 0 2 1 0.80
Acoustic Tip based
Generated with Discovery Studio
(Accelrys)
Cyan = hydrophobic
Green = hydrogen bond acceptor
Purple = hydrogen bond donor
Each model shows most potent
molecule mapping
How you dispense liquids may be important: insights from small data
PLoS ONE 8(5): e62325 (2013)
Future: sharing chemical relationships without
structures
Matlock and Swamidass J. Chem. Inf. Model. 2014, 54, 37−48
Drug Discovery Archeology
• Still a heavy emphasis on
“testing” “doing “ rather
than ‘learning’
• Mining data and historic
data will increase in value
• Data becomes a
repurposing opportunity
• How do we position
databases for this?
• What about neglected
diseases?
Tuberculosis kills 1.6-1.7m/yr (~1 every
8 seconds)
1/3rd of worlds population infected!!!!
streptomycin (1943)
para-aminosalicyclic acid (1949)
isoniazid (1952)
pyrazinamide (1954)
cycloserine (1955)
ethambutol (1962)
rifampicin (1967)
Multi drug resistance in
4.3% of cases
Extensively drug resistant
increasing incidence
one new drug
(bedaquiline) in 40 yrs
Ponder et al., Pharm Res 31: 271-277, 2014
Source G-Finder
Freundlich Laboratory Collaborations Rely on CDD for Data Tracking!
• Three collaborations within Rutgers–NJMS
• Collaboration with Johns Hopkins, SRI, and CDD
• Collaboration with Johns Hopkins
• Collaboration with CDD
Supported by
7 Active NIH Grants
Chemical Probe
Evolution
Drug Discovery
Compound
Evolution
Target
Identification
& Validation
Godbole et al., Biochem Biophys Res Comm 2014, in press
24 groups in this project use a single
Biotin biosynthesis
dethiobiotin
Pharmacophore
Searching Maybridge (57K)
gives 72 molecules – many
of them hydrophobic so
they stand a chance of in
vitro activity
Take substrate
and generate 3D
conformers and
build a
pharmacophore
Use the
pharmacophore
to search vendor
libraries in 3D
Buy and test
compounds
Fishing: Example of mimic strategy for bioB Rv1589
 Sarker et al., Pharm Res 2012, 29:2115-27
Over 5 years analyzed in vitro data and built models
Top scoring molecules
assayed for
Mtb growth inhibition
Mtb screening
molecule
database/s
High-throughput
phenotypic
Mtb screening
Descriptors + Bioactivity (+Cytotoxicity)
Bayesian Machine Learning classification Mtb Model
Molecule Database
(e.g. GSK malaria
actives)
virtually scored
using Bayesian Models
New bioactivity data
may enhance models
Identify in vitro hits and test models
3 x published prospective tests >20% hit rate
Multiple retrospective tests 3-10 fold enrichment
N
H
S
N
Ekins et al., Pharm Res 31: 414-435, 2014
Ekins, et al., Tuberculosis 94; 162-169, 2014
Ekins, et al., PLOSONE 8; e63240, 2013
Ekins, et al., Chem Biol 20: 370-378, 2013
Ekins, et al., JCIM, 53: 3054−3063, 2013
Ekins and Freundlich, Pharm Res, 28, 1859-1869, 2011
Ekins et al., Mol BioSyst, 6: 840-851, 2010
Ekins, et al., Mol. Biosyst. 6, 2316-2324, 2010,
A summary of some of the numbers involved – filtering for
hits.
>250,000 molecules screened through Bayesian models
~750 molecules were tested in vitro
198 actives were identified
>20 % hit rate
Identified several novel potent hit series with good
cytotoxicity & selectivity Identified known human kinase
inhibitors and FDA approved drugs as hits
Ekins et al., PLOSONE 2013 May 7;8(5):e63240;
Ekins et al.,Chem Biol 20, 370–378, 2013
Ekins et al., Tuberculosis 94: 162-169
• BAS00521003/ TCMDC-125802 reported to be a P.
falciparum lactate dehydrogenase inhibitor
• Only one report of antitubercular activity from 1969
- solid agar MIC = 1 mg/mL (“wild strain”)
- “no activity” in mouse model up to 400 mg/kg
- however, activity was solely judged by
extension of survival!
Bruhin, H. et al., J. Pharm. Pharmac. 1969, 21, 423-433.
.
MIC of 0.0625 ug/mL
• 64X MIC affords 6 logs of
kill
• Resistance and/or drug
instability beyond 14 d
Vero cells : CC50 = 4.0
mg/mL
Selectivity Index SI =
CC50/MICMtb = 16 – 64
In mouse no toxicity but
also no efficacy in GKO
model – probably
metabolized.
Ekins et al.,Chem Biol 20, 370–378, 2013
Taking a compound in vivo identifies issues
Filling out the triazine matrix using SARtable:
A new kind of map
Green = good activity, Red = bad; colored dots are predictions
Tested >350,000 molecules Tested ~2M 2M >300,000
>1500 active and non toxic Published 177 100s 800
Big Data: Screening for New Tuberculosis Treatments
How many will become a new drug?
How do we learn from this big data?
Others have likely screened another 500,000
Hunting High and Low for new
molecules to test
We need to search
sources..
From the Oceans…
To the ground
To the trees
To the air..
And do it virtually
Find new libraries
to screen virtually
and test
Take everything out of CDD public
• Run through TB Bayesian models
• Score
• Test
What is the next bottleneck?
$ $ $ $ $Five-fold increase in the publication of TB mouse model studies from 1997 to 2009
Franco, PLoS One 7, e47723 (2012).
Billions of $ of your money spent on TB
but no database of mouse in vivo data !
Hunting for the in vivo data
It’s out there.. be patient
Building the mouse TB database
Manually curated, structures sketched Mobile Molecular
DataSheet (MMDS) iOS app or ChemDraw (Perkin Elmer)
Downloaded from www.chemspider.com
Combined with pertinent data fields
1 log10 reduction in Mtb colony-forming units (CFUs) in the lungs
Publically available CDD TB database (In process)
30 years with little TB mouse in vivo data
MIND THE TB GAP
JCIM In Press 2014
Ekins, Nuermberger & Freundlich Submitted
Where are the New TB drugs to be found?
PCA of in vivo
(yellow)
and compounds
with known
targets
(blue)
PCA of TB in
vitro actives
(blue)
TB in vivo
(yellow)
PCA of In vivo
actives and
inactives
Inactive (blue)
Active (yellow)
JCIM In Press 2014
Machine Learning Models
Bayesian
Support Vector Machine
Recursive partitioning (single and multiple trees)
Using Accelrys Discovery Studio and R.
RP Forest RP Single
Tree
SVM Bayesian
0.75 0.71 0.77 0.73ROC 5 fold cross validation
JCIM In Press 2014
RP Forest RP Single
Tree
SVM Bayesian
3 /11
(27.2%)
4/11
(36.4%)
7/11
(63.6%)
8/11
(72.7%)
External Test set
Studio and R.
11 additional active molecules obtained from 1953-2013
JCIM In Press 2014
A much higher ratio of compounds were tested in vivo to
in vitro in the 1940s-1960s rather than now
Infrastructure to provide a clear understanding of the
position of compounds in the pipeline is essentially
lacking
Shortage of new candidates suggest we may lack the
commitment and resources we had 6o years ago
Use machine learning in vivo models to prioritize Mouse
studies
Ekins, Nuermberger & Freundlich Submitted
The Clock is ticking
MoDELS RESIDE IN PAPERS
NOT ACCESSIBLE…THIS IS
UNDESIRABLE
How do we share them?
How do we use Them?
ECFP_6 FCFP_6
• Collected,
deduplicated,
hashed
• Sparse integers
• Invented for Pipeline Pilot: public method, proprietary details
• Often used with Bayesian models: many published papers
• Built a new implementation: open source, Java, CDK
– stable: fingerprints don't change with each new toolkit release
– well defined: easy to document precise steps
– easy to port: already migrated to iOS (Objective-C) for TB Mobile app
• Provides core basis feature for CDD open source model service
Dataset Leave one out
ROC Published
Reference Leave one out
ROC Open
fingerprints
In vivo data (773
molecules) FCFP_6
fingerprints
0.77 this study 0.75
Combined model (5304
molecules) FCFP_6
fingerprints
0.71 J Chem Inf
Model
53:3054-
3063.
0.77
MLSMR dual event model
(2273 molecules) and
FCFP_6 fingerprints
0.86 PLOSONE
8:e63240
0.83
Same datasets –
Versus published data
Clark et al., submitted 2014
Open fingerprints and bayesian method used in TB Mobile Vers.2
Could we add in vivo prediction
models to this?
Ekins et al., J Cheminform 5:13, 2013
Predict targets
Cluster molecules
http://goo.gl/vPOKS
http://goo.gl/iDJFR
In vitro data In vivo data
Target data
ADME/Tox data & Models
Drug-like scaffold creation
TB Prediction Tools TB Publications
Data sources and tools we could integrate
Future: How
can we tackle
more
diseases?
Chagas Disease
Reverse the mimic
approach to predict
targets of hits
Use pharmacophpores
for targets e.g. CYP51
Use machine learning
models to identify novel
compounds
Test in vitro
The new faces of
personalized medicine:
children with rare diseases
The Rare Disease Parent Odyssey
• Diagnosis of child
• Try to find out about disease – papers behind paywall
• Try to connect with scientists
• Form not-for-profit
• Raise funds
• Fund Scientific research on disease
• Advocate for support from NIH, FDA etc
• Start a company
• Try to find a cure before its too late
Could we create a rare disease community for scientists & foundations ?
Wood J, Drug Disc Today, 18: 1043–1051, 2013
Rare diseases inspired an App that
may be a new kind of database
upload molecules by tweeting them-
1 tweet upload
Take our data with us anywhere
Bring data off the cloud into device
Advantages you get to analyze it in
the Cloud on a plane
Future: how do
we deliver data
All at CDD and many others …Funding: 1R41AI088893-01, 2R42AI088893-02, R43
LM011152-01, 9R44TR000942-02, 1R41AI108003-01, MM4TB, Software: Accelrys

More Related Content

What's hot

Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1Jean-Claude Bradley
 
Using Multi-Scale Modeling to Support Preclinical Developments
Using Multi-Scale Modeling to Support Preclinical DevelopmentsUsing Multi-Scale Modeling to Support Preclinical Developments
Using Multi-Scale Modeling to Support Preclinical DevelopmentsTao You
 
New target prediction and vizualization tools incorporating open source molec...
New target prediction and vizualization tools incorporating open source molec...New target prediction and vizualization tools incorporating open source molec...
New target prediction and vizualization tools incorporating open source molec...Sean Ekins
 
Collaborative Database and Computational Models for Tuberculosis Drug Discovery
Collaborative Database and Computational Models for Tuberculosis Drug DiscoveryCollaborative Database and Computational Models for Tuberculosis Drug Discovery
Collaborative Database and Computational Models for Tuberculosis Drug DiscoverySean Ekins
 
A comparative in silico study finds a functional co-relation between human hs...
A comparative in silico study finds a functional co-relation between human hs...A comparative in silico study finds a functional co-relation between human hs...
A comparative in silico study finds a functional co-relation between human hs...ijbbjournal
 
Ransbotyn et al PUBLISHED (1)
Ransbotyn et al PUBLISHED (1)Ransbotyn et al PUBLISHED (1)
Ransbotyn et al PUBLISHED (1)Tania Acuna
 
dual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discoverydual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discoverySean Ekins
 
Using Computational Toxicology to Enable Risk-Based Chemical Safety Decision ...
Using Computational Toxicology to Enable Risk-Based Chemical Safety Decision ...Using Computational Toxicology to Enable Risk-Based Chemical Safety Decision ...
Using Computational Toxicology to Enable Risk-Based Chemical Safety Decision ...U.S. EPA Office of Research and Development
 
Application of Computational and High-Throughput in vitro Screening for Prior...
Application of Computational and High-Throughput in vitro Screening for Prior...Application of Computational and High-Throughput in vitro Screening for Prior...
Application of Computational and High-Throughput in vitro Screening for Prior...U.S. EPA Office of Research and Development
 
49.Antitumour and Radioprotective activity of Alloe vera
49.Antitumour and Radioprotective activity of Alloe vera49.Antitumour and Radioprotective activity of Alloe vera
49.Antitumour and Radioprotective activity of Alloe veraAnnadurai B
 
Identification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databasesIdentification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databasesYoann Pageaud
 
Genome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleGenome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleLaurence Dawkins-Hall
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataChirag Patel
 
A systematic, data driven approach to the combined analysis of microarray and...
A systematic, data driven approach to the combined analysis of microarray and...A systematic, data driven approach to the combined analysis of microarray and...
A systematic, data driven approach to the combined analysis of microarray and...Laurence Dawkins-Hall
 
mHealth Israel_Ryo Kosaka_AIST_National Institute of Advanced Industrial Scie...
mHealth Israel_Ryo Kosaka_AIST_National Institute of Advanced Industrial Scie...mHealth Israel_Ryo Kosaka_AIST_National Institute of Advanced Industrial Scie...
mHealth Israel_Ryo Kosaka_AIST_National Institute of Advanced Industrial Scie...Levi Shapiro
 
Isac 2010 Poster Jay 042910 Final
Isac 2010 Poster Jay 042910 FinalIsac 2010 Poster Jay 042910 Final
Isac 2010 Poster Jay 042910 Finaljoropab1
 
Searching for predictors of male fecundity
Searching for predictors of male fecunditySearching for predictors of male fecundity
Searching for predictors of male fecundityChirag Patel
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Chirag Patel
 

What's hot (20)

Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1
 
Using Multi-Scale Modeling to Support Preclinical Developments
Using Multi-Scale Modeling to Support Preclinical DevelopmentsUsing Multi-Scale Modeling to Support Preclinical Developments
Using Multi-Scale Modeling to Support Preclinical Developments
 
New target prediction and vizualization tools incorporating open source molec...
New target prediction and vizualization tools incorporating open source molec...New target prediction and vizualization tools incorporating open source molec...
New target prediction and vizualization tools incorporating open source molec...
 
Collaborative Database and Computational Models for Tuberculosis Drug Discovery
Collaborative Database and Computational Models for Tuberculosis Drug DiscoveryCollaborative Database and Computational Models for Tuberculosis Drug Discovery
Collaborative Database and Computational Models for Tuberculosis Drug Discovery
 
A comparative in silico study finds a functional co-relation between human hs...
A comparative in silico study finds a functional co-relation between human hs...A comparative in silico study finds a functional co-relation between human hs...
A comparative in silico study finds a functional co-relation between human hs...
 
Ransbotyn et al PUBLISHED (1)
Ransbotyn et al PUBLISHED (1)Ransbotyn et al PUBLISHED (1)
Ransbotyn et al PUBLISHED (1)
 
dual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discoverydual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discovery
 
Using Computational Toxicology to Enable Risk-Based Chemical Safety Decision ...
Using Computational Toxicology to Enable Risk-Based Chemical Safety Decision ...Using Computational Toxicology to Enable Risk-Based Chemical Safety Decision ...
Using Computational Toxicology to Enable Risk-Based Chemical Safety Decision ...
 
Application of Computational and High-Throughput in vitro Screening for Prior...
Application of Computational and High-Throughput in vitro Screening for Prior...Application of Computational and High-Throughput in vitro Screening for Prior...
Application of Computational and High-Throughput in vitro Screening for Prior...
 
Computational Toxicity in 21st Century Safety Sciences
Computational Toxicity in 21st Century Safety SciencesComputational Toxicity in 21st Century Safety Sciences
Computational Toxicity in 21st Century Safety Sciences
 
49.Antitumour and Radioprotective activity of Alloe vera
49.Antitumour and Radioprotective activity of Alloe vera49.Antitumour and Radioprotective activity of Alloe vera
49.Antitumour and Radioprotective activity of Alloe vera
 
Identification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databasesIdentification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databases
 
Full Thesis
Full ThesisFull Thesis
Full Thesis
 
Genome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleGenome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattle
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
 
A systematic, data driven approach to the combined analysis of microarray and...
A systematic, data driven approach to the combined analysis of microarray and...A systematic, data driven approach to the combined analysis of microarray and...
A systematic, data driven approach to the combined analysis of microarray and...
 
mHealth Israel_Ryo Kosaka_AIST_National Institute of Advanced Industrial Scie...
mHealth Israel_Ryo Kosaka_AIST_National Institute of Advanced Industrial Scie...mHealth Israel_Ryo Kosaka_AIST_National Institute of Advanced Industrial Scie...
mHealth Israel_Ryo Kosaka_AIST_National Institute of Advanced Industrial Scie...
 
Isac 2010 Poster Jay 042910 Final
Isac 2010 Poster Jay 042910 FinalIsac 2010 Poster Jay 042910 Final
Isac 2010 Poster Jay 042910 Final
 
Searching for predictors of male fecundity
Searching for predictors of male fecunditySearching for predictors of male fecundity
Searching for predictors of male fecundity
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 

Viewers also liked

Emerging technology analysis visualization based data discovery tools
Emerging technology analysis visualization based data discovery toolsEmerging technology analysis visualization based data discovery tools
Emerging technology analysis visualization based data discovery toolsrectoc27
 
Visualization for Discovery
Visualization for DiscoveryVisualization for Discovery
Visualization for DiscoveryTuri, Inc.
 
Data drivenapproach to medicinalchemistry
Data drivenapproach to medicinalchemistryData drivenapproach to medicinalchemistry
Data drivenapproach to medicinalchemistryAnn-Marie Roche
 
Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...
Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...
Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...Salford Systems
 
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsDevelopment and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsSean Ekins
 
Chemical Spaces: Modeling, Exploration & Understanding
Chemical Spaces: Modeling, Exploration & UnderstandingChemical Spaces: Modeling, Exploration & Understanding
Chemical Spaces: Modeling, Exploration & UnderstandingRajarshi Guha
 
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and HadoopGoogle Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoophuguk
 
Role of Tableau on the Data Discovery Market
Role of Tableau on the Data Discovery MarketRole of Tableau on the Data Discovery Market
Role of Tableau on the Data Discovery MarketDmitry Anoshin
 
Alteryx Architecture
Alteryx ArchitectureAlteryx Architecture
Alteryx ArchitectureVivek Mohan
 
Agile large-scale machine-learning pipelines in drug discovery
Agile large-scale machine-learning pipelines in drug discoveryAgile large-scale machine-learning pipelines in drug discovery
Agile large-scale machine-learning pipelines in drug discoveryOla Spjuth
 
Cheminformatics in drug design
Cheminformatics in drug designCheminformatics in drug design
Cheminformatics in drug designSurmil Shah
 
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, TrifactaData Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, Trifactahuguk
 
Chemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & ReproducibleChemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & ReproducibleRajarshi Guha
 

Viewers also liked (16)

Dispensing error
Dispensing errorDispensing error
Dispensing error
 
VDD Tools
VDD ToolsVDD Tools
VDD Tools
 
Emerging technology analysis visualization based data discovery tools
Emerging technology analysis visualization based data discovery toolsEmerging technology analysis visualization based data discovery tools
Emerging technology analysis visualization based data discovery tools
 
Visualization for Discovery
Visualization for DiscoveryVisualization for Discovery
Visualization for Discovery
 
Data drivenapproach to medicinalchemistry
Data drivenapproach to medicinalchemistryData drivenapproach to medicinalchemistry
Data drivenapproach to medicinalchemistry
 
Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...
Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...
Improved Predictions in Structure-Based Drug Design Using CART and Bayesian M...
 
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsDevelopment and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
 
Chemical Spaces: Modeling, Exploration & Understanding
Chemical Spaces: Modeling, Exploration & UnderstandingChemical Spaces: Modeling, Exploration & Understanding
Chemical Spaces: Modeling, Exploration & Understanding
 
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and HadoopGoogle Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
 
Role of Tableau on the Data Discovery Market
Role of Tableau on the Data Discovery MarketRole of Tableau on the Data Discovery Market
Role of Tableau on the Data Discovery Market
 
Alteryx Architecture
Alteryx ArchitectureAlteryx Architecture
Alteryx Architecture
 
Agile large-scale machine-learning pipelines in drug discovery
Agile large-scale machine-learning pipelines in drug discoveryAgile large-scale machine-learning pipelines in drug discovery
Agile large-scale machine-learning pipelines in drug discovery
 
Cheminformatics in drug design
Cheminformatics in drug designCheminformatics in drug design
Cheminformatics in drug design
 
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, TrifactaData Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
 
Chemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & ReproducibleChemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & Reproducible
 
Medication error
Medication errorMedication error
Medication error
 

Similar to Exploiting bigger data and collaborative tools for predictive drug discovery

Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...Sean Ekins
 
Bigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoveryBigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoverySean Ekins
 
Slides for st judes
Slides for st judesSlides for st judes
Slides for st judesSean Ekins
 
C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16Sean Ekins
 
Unc slides on computational toxicology
Unc slides on computational toxicologyUnc slides on computational toxicology
Unc slides on computational toxicologySean Ekins
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016Sean Ekins
 
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryMel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryJean-Claude Bradley
 
Nc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational ToxicologyNc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational ToxicologySean Ekins
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...Sean Ekins
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsSean Ekins
 
BioIT Drug induced liver injury talk 2011
BioIT Drug induced liver injury talk 2011BioIT Drug induced liver injury talk 2011
BioIT Drug induced liver injury talk 2011Sean Ekins
 
ICIC 2013 New Product Introductions GVK Bio Informatics
ICIC 2013 New Product Introductions GVK Bio InformaticsICIC 2013 New Product Introductions GVK Bio Informatics
ICIC 2013 New Product Introductions GVK Bio InformaticsDr. Haxel Consult
 
GVK BIO Informatics ICIC 2013
GVK BIO Informatics ICIC 2013GVK BIO Informatics ICIC 2013
GVK BIO Informatics ICIC 2013GVK Biosciences
 
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...Fundación Ramón Areces
 
Finland Helsinki Drug Research slides 2011
Finland Helsinki Drug Research slides 2011Finland Helsinki Drug Research slides 2011
Finland Helsinki Drug Research slides 2011Sean Ekins
 
Assay-for-estimating-total-bacterial-load-relative-qPCR-normalisation-of-bact...
Assay-for-estimating-total-bacterial-load-relative-qPCR-normalisation-of-bact...Assay-for-estimating-total-bacterial-load-relative-qPCR-normalisation-of-bact...
Assay-for-estimating-total-bacterial-load-relative-qPCR-normalisation-of-bact...Ivan Brukner
 
Machine Learning in Healthcare by Mehrdad Yazdani
Machine Learning in Healthcare by Mehrdad YazdaniMachine Learning in Healthcare by Mehrdad Yazdani
Machine Learning in Healthcare by Mehrdad YazdaniData Con LA
 
Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014LushPrize
 
Analytical chemistry 2013 qian liu
Analytical chemistry 2013 qian liuAnalytical chemistry 2013 qian liu
Analytical chemistry 2013 qian liuQian Liu, phD
 

Similar to Exploiting bigger data and collaborative tools for predictive drug discovery (20)

Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...
 
Bigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoveryBigger Data to Increase Drug Discovery
Bigger Data to Increase Drug Discovery
 
Slides for st judes
Slides for st judesSlides for st judes
Slides for st judes
 
C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16
 
Unc slides on computational toxicology
Unc slides on computational toxicologyUnc slides on computational toxicology
Unc slides on computational toxicology
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016
 
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryMel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
 
Nc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational ToxicologyNc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational Toxicology
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
 
BioIT Drug induced liver injury talk 2011
BioIT Drug induced liver injury talk 2011BioIT Drug induced liver injury talk 2011
BioIT Drug induced liver injury talk 2011
 
ICIC 2013 New Product Introductions GVK Bio Informatics
ICIC 2013 New Product Introductions GVK Bio InformaticsICIC 2013 New Product Introductions GVK Bio Informatics
ICIC 2013 New Product Introductions GVK Bio Informatics
 
GVK BIO Informatics ICIC 2013
GVK BIO Informatics ICIC 2013GVK BIO Informatics ICIC 2013
GVK BIO Informatics ICIC 2013
 
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...
 
Finland Helsinki Drug Research slides 2011
Finland Helsinki Drug Research slides 2011Finland Helsinki Drug Research slides 2011
Finland Helsinki Drug Research slides 2011
 
Assay-for-estimating-total-bacterial-load-relative-qPCR-normalisation-of-bact...
Assay-for-estimating-total-bacterial-load-relative-qPCR-normalisation-of-bact...Assay-for-estimating-total-bacterial-load-relative-qPCR-normalisation-of-bact...
Assay-for-estimating-total-bacterial-load-relative-qPCR-normalisation-of-bact...
 
Machine Learning in Healthcare by Mehrdad Yazdani
Machine Learning in Healthcare by Mehrdad YazdaniMachine Learning in Healthcare by Mehrdad Yazdani
Machine Learning in Healthcare by Mehrdad Yazdani
 
Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014
 
Analytical chemistry 2013 qian liu
Analytical chemistry 2013 qian liuAnalytical chemistry 2013 qian liu
Analytical chemistry 2013 qian liu
 

More from Sean Ekins

How to Win a small business grant.pptx
How to Win a small business grant.pptxHow to Win a small business grant.pptx
How to Win a small business grant.pptxSean Ekins
 
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Sean Ekins
 
A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...Sean Ekins
 
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Sean Ekins
 
Bayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseBayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseSean Ekins
 
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Sean Ekins
 
Drug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueDrug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueSean Ekins
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesSean Ekins
 
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchFive Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchSean Ekins
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation Sean Ekins
 
academic / small company collaborations for rare and neglected diseasesv2
 academic / small company collaborations for rare and neglected diseasesv2 academic / small company collaborations for rare and neglected diseasesv2
academic / small company collaborations for rare and neglected diseasesv2Sean Ekins
 
CDD models case study #3
CDD models case study #3 CDD models case study #3
CDD models case study #3 Sean Ekins
 
CDD models case study #2
CDD models case study #2 CDD models case study #2
CDD models case study #2 Sean Ekins
 
CDD Models case study #1
CDD Models case study #1 CDD Models case study #1
CDD Models case study #1 Sean Ekins
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...Sean Ekins
 
The future of computational chemistry b ig
The future of computational chemistry b igThe future of computational chemistry b ig
The future of computational chemistry b igSean Ekins
 
#ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - #ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - Sean Ekins
 
Pros and cons of social networking for scientists
Pros and cons of social networking for scientistsPros and cons of social networking for scientists
Pros and cons of social networking for scientistsSean Ekins
 
Rare pediatric and neglected tropical diseases priority review voucher and tr...
Rare pediatric and neglected tropical diseases priority review voucher and tr...Rare pediatric and neglected tropical diseases priority review voucher and tr...
Rare pediatric and neglected tropical diseases priority review voucher and tr...Sean Ekins
 
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...Sean Ekins
 

More from Sean Ekins (20)

How to Win a small business grant.pptx
How to Win a small business grant.pptxHow to Win a small business grant.pptx
How to Win a small business grant.pptx
 
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
 
A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...
 
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
 
Bayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseBayesian Models for Chagas Disease
Bayesian Models for Chagas Disease
 
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
 
Drug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueDrug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issue
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
 
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchFive Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation
 
academic / small company collaborations for rare and neglected diseasesv2
 academic / small company collaborations for rare and neglected diseasesv2 academic / small company collaborations for rare and neglected diseasesv2
academic / small company collaborations for rare and neglected diseasesv2
 
CDD models case study #3
CDD models case study #3 CDD models case study #3
CDD models case study #3
 
CDD models case study #2
CDD models case study #2 CDD models case study #2
CDD models case study #2
 
CDD Models case study #1
CDD Models case study #1 CDD Models case study #1
CDD Models case study #1
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
 
The future of computational chemistry b ig
The future of computational chemistry b igThe future of computational chemistry b ig
The future of computational chemistry b ig
 
#ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - #ZikaOpen: Homology Models -
#ZikaOpen: Homology Models -
 
Pros and cons of social networking for scientists
Pros and cons of social networking for scientistsPros and cons of social networking for scientists
Pros and cons of social networking for scientists
 
Rare pediatric and neglected tropical diseases priority review voucher and tr...
Rare pediatric and neglected tropical diseases priority review voucher and tr...Rare pediatric and neglected tropical diseases priority review voucher and tr...
Rare pediatric and neglected tropical diseases priority review voucher and tr...
 
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...
 

Recently uploaded

Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Servicevidya singh
 
Bangalore Call Girls Nelamangala Number 9332606886 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 9332606886  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 9332606886  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 9332606886 Meetin With Bangalore Esc...narwatsonia7
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...narwatsonia7
 
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomdiscovermytutordmt
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Call Girls in Nagpur High Profile
 
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...astropune
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...astropune
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...jageshsingh5554
 
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Dipal Arora
 
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiAlinaDevecerski
 
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...hotbabesbook
 
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 9332606886 ⟟ Call Me For G...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟  9332606886 ⟟ Call Me For G...Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟  9332606886 ⟟ Call Me For G...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 9332606886 ⟟ Call Me For G...narwatsonia7
 
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...tanya dube
 
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...narwatsonia7
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeCall Girls Delhi
 
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service AvailableDipal Arora
 

Recently uploaded (20)

Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
 
Bangalore Call Girls Nelamangala Number 9332606886 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 9332606886  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 9332606886  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 9332606886 Meetin With Bangalore Esc...
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
 
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
 
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
 
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
 
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
 
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
 
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
 
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 9332606886 ⟟ Call Me For G...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟  9332606886 ⟟ Call Me For G...Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟  9332606886 ⟟ Call Me For G...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 9332606886 ⟟ Call Me For G...
 
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
 
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
 
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Kochi Just Call 8250077686 Top Class Call Girl Service Available
 

Exploiting bigger data and collaborative tools for predictive drug discovery

  • 1. Exploiting Bigger Data and Collaborative Tools for Predictive Drug Discovery Sean Ekins, CSO CDD
  • 2. CDD website 2010-2013 Perkin Elmer in Laboratory Informatics Guide 2014 This guy deserves a coffee
  • 3. “by provisioning the right amount of storage and compute resources, cost can be significantly reduced with no significant impact on application performance”
  • 4. CDD’s Influence spreads beyond the cloud 2014 2004 - present
  • 5.
  • 6. Small Data circa late 1990’s Big Data circa 2010’s ~193,000 cpds Drug Metab Dispos, 38: 2083-2090, 2010
  • 7. -3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 -3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 logIC50-tips log IC50-acoustic Hydrophobi c features (HPF) Hydrogen bond acceptor (HBA) Hydrogen bond donor (HBD) Observed vs. predicted IC50 r Acoustic mediated process 2 1 1 0.92 Tip-based process 0 2 1 0.80 Acoustic Tip based Generated with Discovery Studio (Accelrys) Cyan = hydrophobic Green = hydrogen bond acceptor Purple = hydrogen bond donor Each model shows most potent molecule mapping How you dispense liquids may be important: insights from small data PLoS ONE 8(5): e62325 (2013)
  • 8. Future: sharing chemical relationships without structures Matlock and Swamidass J. Chem. Inf. Model. 2014, 54, 37−48
  • 9. Drug Discovery Archeology • Still a heavy emphasis on “testing” “doing “ rather than ‘learning’ • Mining data and historic data will increase in value • Data becomes a repurposing opportunity • How do we position databases for this? • What about neglected diseases?
  • 10. Tuberculosis kills 1.6-1.7m/yr (~1 every 8 seconds) 1/3rd of worlds population infected!!!! streptomycin (1943) para-aminosalicyclic acid (1949) isoniazid (1952) pyrazinamide (1954) cycloserine (1955) ethambutol (1962) rifampicin (1967) Multi drug resistance in 4.3% of cases Extensively drug resistant increasing incidence one new drug (bedaquiline) in 40 yrs
  • 11. Ponder et al., Pharm Res 31: 271-277, 2014 Source G-Finder
  • 12. Freundlich Laboratory Collaborations Rely on CDD for Data Tracking! • Three collaborations within Rutgers–NJMS • Collaboration with Johns Hopkins, SRI, and CDD • Collaboration with Johns Hopkins • Collaboration with CDD Supported by 7 Active NIH Grants Chemical Probe Evolution Drug Discovery Compound Evolution Target Identification & Validation
  • 13. Godbole et al., Biochem Biophys Res Comm 2014, in press 24 groups in this project use a single
  • 14. Biotin biosynthesis dethiobiotin Pharmacophore Searching Maybridge (57K) gives 72 molecules – many of them hydrophobic so they stand a chance of in vitro activity Take substrate and generate 3D conformers and build a pharmacophore Use the pharmacophore to search vendor libraries in 3D Buy and test compounds Fishing: Example of mimic strategy for bioB Rv1589  Sarker et al., Pharm Res 2012, 29:2115-27
  • 15. Over 5 years analyzed in vitro data and built models Top scoring molecules assayed for Mtb growth inhibition Mtb screening molecule database/s High-throughput phenotypic Mtb screening Descriptors + Bioactivity (+Cytotoxicity) Bayesian Machine Learning classification Mtb Model Molecule Database (e.g. GSK malaria actives) virtually scored using Bayesian Models New bioactivity data may enhance models Identify in vitro hits and test models 3 x published prospective tests >20% hit rate Multiple retrospective tests 3-10 fold enrichment N H S N Ekins et al., Pharm Res 31: 414-435, 2014 Ekins, et al., Tuberculosis 94; 162-169, 2014 Ekins, et al., PLOSONE 8; e63240, 2013 Ekins, et al., Chem Biol 20: 370-378, 2013 Ekins, et al., JCIM, 53: 3054−3063, 2013 Ekins and Freundlich, Pharm Res, 28, 1859-1869, 2011 Ekins et al., Mol BioSyst, 6: 840-851, 2010 Ekins, et al., Mol. Biosyst. 6, 2316-2324, 2010,
  • 16. A summary of some of the numbers involved – filtering for hits. >250,000 molecules screened through Bayesian models ~750 molecules were tested in vitro 198 actives were identified >20 % hit rate Identified several novel potent hit series with good cytotoxicity & selectivity Identified known human kinase inhibitors and FDA approved drugs as hits Ekins et al., PLOSONE 2013 May 7;8(5):e63240; Ekins et al.,Chem Biol 20, 370–378, 2013 Ekins et al., Tuberculosis 94: 162-169
  • 17. • BAS00521003/ TCMDC-125802 reported to be a P. falciparum lactate dehydrogenase inhibitor • Only one report of antitubercular activity from 1969 - solid agar MIC = 1 mg/mL (“wild strain”) - “no activity” in mouse model up to 400 mg/kg - however, activity was solely judged by extension of survival! Bruhin, H. et al., J. Pharm. Pharmac. 1969, 21, 423-433. . MIC of 0.0625 ug/mL • 64X MIC affords 6 logs of kill • Resistance and/or drug instability beyond 14 d Vero cells : CC50 = 4.0 mg/mL Selectivity Index SI = CC50/MICMtb = 16 – 64 In mouse no toxicity but also no efficacy in GKO model – probably metabolized. Ekins et al.,Chem Biol 20, 370–378, 2013 Taking a compound in vivo identifies issues
  • 18. Filling out the triazine matrix using SARtable: A new kind of map Green = good activity, Red = bad; colored dots are predictions
  • 19. Tested >350,000 molecules Tested ~2M 2M >300,000 >1500 active and non toxic Published 177 100s 800 Big Data: Screening for New Tuberculosis Treatments How many will become a new drug? How do we learn from this big data? Others have likely screened another 500,000
  • 20. Hunting High and Low for new molecules to test We need to search sources.. From the Oceans… To the ground To the trees To the air.. And do it virtually Find new libraries to screen virtually and test
  • 21. Take everything out of CDD public • Run through TB Bayesian models • Score • Test
  • 22. What is the next bottleneck? $ $ $ $ $Five-fold increase in the publication of TB mouse model studies from 1997 to 2009 Franco, PLoS One 7, e47723 (2012).
  • 23. Billions of $ of your money spent on TB but no database of mouse in vivo data !
  • 24. Hunting for the in vivo data It’s out there.. be patient
  • 25. Building the mouse TB database Manually curated, structures sketched Mobile Molecular DataSheet (MMDS) iOS app or ChemDraw (Perkin Elmer) Downloaded from www.chemspider.com Combined with pertinent data fields 1 log10 reduction in Mtb colony-forming units (CFUs) in the lungs Publically available CDD TB database (In process)
  • 26. 30 years with little TB mouse in vivo data MIND THE TB GAP JCIM In Press 2014 Ekins, Nuermberger & Freundlich Submitted
  • 27. Where are the New TB drugs to be found? PCA of in vivo (yellow) and compounds with known targets (blue) PCA of TB in vitro actives (blue) TB in vivo (yellow) PCA of In vivo actives and inactives Inactive (blue) Active (yellow) JCIM In Press 2014
  • 28. Machine Learning Models Bayesian Support Vector Machine Recursive partitioning (single and multiple trees) Using Accelrys Discovery Studio and R. RP Forest RP Single Tree SVM Bayesian 0.75 0.71 0.77 0.73ROC 5 fold cross validation JCIM In Press 2014
  • 29. RP Forest RP Single Tree SVM Bayesian 3 /11 (27.2%) 4/11 (36.4%) 7/11 (63.6%) 8/11 (72.7%) External Test set Studio and R. 11 additional active molecules obtained from 1953-2013 JCIM In Press 2014
  • 30. A much higher ratio of compounds were tested in vivo to in vitro in the 1940s-1960s rather than now Infrastructure to provide a clear understanding of the position of compounds in the pipeline is essentially lacking Shortage of new candidates suggest we may lack the commitment and resources we had 6o years ago Use machine learning in vivo models to prioritize Mouse studies Ekins, Nuermberger & Freundlich Submitted The Clock is ticking
  • 31. MoDELS RESIDE IN PAPERS NOT ACCESSIBLE…THIS IS UNDESIRABLE How do we share them? How do we use Them?
  • 32. ECFP_6 FCFP_6 • Collected, deduplicated, hashed • Sparse integers • Invented for Pipeline Pilot: public method, proprietary details • Often used with Bayesian models: many published papers • Built a new implementation: open source, Java, CDK – stable: fingerprints don't change with each new toolkit release – well defined: easy to document precise steps – easy to port: already migrated to iOS (Objective-C) for TB Mobile app • Provides core basis feature for CDD open source model service
  • 33. Dataset Leave one out ROC Published Reference Leave one out ROC Open fingerprints In vivo data (773 molecules) FCFP_6 fingerprints 0.77 this study 0.75 Combined model (5304 molecules) FCFP_6 fingerprints 0.71 J Chem Inf Model 53:3054- 3063. 0.77 MLSMR dual event model (2273 molecules) and FCFP_6 fingerprints 0.86 PLOSONE 8:e63240 0.83 Same datasets – Versus published data Clark et al., submitted 2014
  • 34. Open fingerprints and bayesian method used in TB Mobile Vers.2 Could we add in vivo prediction models to this? Ekins et al., J Cheminform 5:13, 2013 Predict targets Cluster molecules http://goo.gl/vPOKS http://goo.gl/iDJFR
  • 35. In vitro data In vivo data Target data ADME/Tox data & Models Drug-like scaffold creation TB Prediction Tools TB Publications Data sources and tools we could integrate
  • 36. Future: How can we tackle more diseases?
  • 37. Chagas Disease Reverse the mimic approach to predict targets of hits Use pharmacophpores for targets e.g. CYP51 Use machine learning models to identify novel compounds Test in vitro
  • 38. The new faces of personalized medicine: children with rare diseases
  • 39. The Rare Disease Parent Odyssey • Diagnosis of child • Try to find out about disease – papers behind paywall • Try to connect with scientists • Form not-for-profit • Raise funds • Fund Scientific research on disease • Advocate for support from NIH, FDA etc • Start a company • Try to find a cure before its too late Could we create a rare disease community for scientists & foundations ? Wood J, Drug Disc Today, 18: 1043–1051, 2013
  • 40. Rare diseases inspired an App that may be a new kind of database upload molecules by tweeting them- 1 tweet upload Take our data with us anywhere Bring data off the cloud into device Advantages you get to analyze it in the Cloud on a plane Future: how do we deliver data
  • 41. All at CDD and many others …Funding: 1R41AI088893-01, 2R42AI088893-02, R43 LM011152-01, 9R44TR000942-02, 1R41AI108003-01, MM4TB, Software: Accelrys