SlideShare a Scribd company logo
Automated QSAR Modelling  David E Leahy Newcastle University, UK & Damjan Krstajic Research Centre for Cheminformatics, Serbia
Discovery Bus ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],www.discoverybus.com   “ The Discovery Bus is not a tool for users. It is a system for deriving QSAR models independent of any user”
Discovery Bus QSAR
Chemical structure & response data Transform response 1/X logX X class Split and stratify ? Calculate descriptors D E H L R A Combine descriptors Filter features A&D A&L L&H&R A&E E&D A&D&R ... no filter cfs1 cfs2 cfs4 cfs5 cfs3 Cross validate Build models Test model Rnnet Rrpart Rlin Rpls GARMLR NetlabNN GUIDE GAWRMLR 4 x 8 x 6 x 8 = 1536 models ?&? ? new ff New method? 4 x 8 = 32 filter feature requests 32 filter feature requests x 8 = 256 models 10%
Solubility
Solubility Results  Learner Filter  Reduction Types Linear Fit Training (1167) Test (130) Filter Learner Rel.MSE r 2, Rel.MSE r 2, GUIDE         H 1990 -> 558 -> 54 R,D 1.46 0.11 0.89 0.12 0.89 H 170 -> 26 -> 14 A,E,H,D 0.13 0.11 0.89 0.13 0.88 H 80 -> 16 -> 12 A,H,D 0.14 0.11 0.88 0.12 0.87 C 250 -> 2 -> 2 A,R 0.18 0.13 0.87 0.16 0.84 C 8 -> 2 -> 2 A,L 0.16 0.13 0.87 0.16 0.86 GA1   H 80 -> 16 -> 16 A,H,D 0.14 0.14 0.86 0.18 0.83 C 8 -> 2 -> 2 A,L 0.16 0.17 0.84 0.17 0.83 NN1     H 250 -> 54 -> 54 A,R 0.12 0.09 0.91 0.08 0.92 H 80 -> 16 -> 16 A,H,D 0.14 0.10 0.90 0.12 0.88 H 326 -> 46 -> 46 H,R,D 0.18 0.10 0.90 0.12 0.89
HSA Binding
HSA Binding Learner Filter Reduction Types Linear Fit Training (82) Test (9) Filter Learner Rel.MSE r 2 Rel.MSE r 2 Guide Hh2 332 -> 39 -> 8 A,E,R 0.92 0.40 0.62 0.25 0.81 H 250 -> 59 -> 12 A,R 1.62 0.47 0.56 0.30 0.76 Hh4 382 -> 20 -> 1 A 0.25 0.50 0.50 0.57 0.49 GA1 Hh2 1998 -> 39 -> 26 A,R,D 0.42 0.23 0.77 0.20 0.85 Hh4 344 -> 20 -> 19 H,R,D 0.42 0.26 0.74 0.28 0.78 Hh10 302 -> 9 -> 9 H,R 0.27 0.27 0.73 0.40 0.64 NN1 H 8 -> 5 -> 5 A,L 0.37 0.17 0.83 0.15 0.87 Hh10 346 -> 8 -> 8 A,R,D 0.30 0.30 0.70 0.16 0.84 H 302 -> 19 -> 19 H,R 0.27 0.32 0.70 0.39 0.71
P-Glycoprotein Technique % Correctly Classified  Training Set % Correctly Classified  Test Set Neural  Net Classifier 95.6 69.7 R Part 90.4 81.0
Discovery Bus Architecture
Summary ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Current & Future Work in QSAR ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Reverse QSAR Engineering
Forager: A PSO for Reverse QSAR  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Forager Optimisation Thanks to Tudor Oprea for a copy of Wombat
Colonist
Acknowledgements

More Related Content

Viewers also liked

QSAR Study on Antitubercular Drug Derivatives
QSAR Study on Antitubercular Drug DerivativesQSAR Study on Antitubercular Drug Derivatives
QSAR Study on Antitubercular Drug Derivatives
Lydia Yeshitla
 
Data Analysis in QSAR
Data Analysis in QSARData Analysis in QSAR
Data Analysis in QSAR
baoilleach
 
25.qsar
25.qsar25.qsar
Effect of substituents and functions on drug structure activity relationships
Effect of substituents and functions on drug structure activity relationshipsEffect of substituents and functions on drug structure activity relationships
Effect of substituents and functions on drug structure activity relationships
Omar Sokkar
 
Computer aided drug design
Computer aided drug designComputer aided drug design
Computer aided drug design
Ali Ahsan
 
Introduction to Quantitative Structure Activity Relationships
Introduction to Quantitative Structure Activity RelationshipsIntroduction to Quantitative Structure Activity Relationships
Introduction to Quantitative Structure Activity Relationships
Omar Sokkar
 
Meta QSAR
Meta QSARMeta QSAR
Meta QSAR
David Leahy
 
Computer Aided Drug Design
Computer Aided Drug DesignComputer Aided Drug Design
Computer Aided Drug Design
pooja sabarinathan
 
Qsar
QsarQsar
Qsar
nehla313
 
Qsar lecture
Qsar lectureQsar lecture
Qsar lecture
shishirkawde
 
QSAR
QSARQSAR
Qsar by hansch analysis
Qsar by hansch analysisQsar by hansch analysis
Qsar by hansch analysis
bhavnesh munjal
 
Computational Drug Design
Computational Drug DesignComputational Drug Design
Computational Drug Design
baoilleach
 
Free wilson analysis qsar
Free wilson analysis qsarFree wilson analysis qsar
Free wilson analysis qsar
Rahul B S
 
Computer aided drug designing
Computer aided drug designing Computer aided drug designing
Computer aided drug designing
Ayesha Aftab
 
QSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative StructureQSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative Structure
Saramita De Chakravarti
 
Qsar
QsarQsar
Qsar
Rahul B S
 
Structure activity relation ship
Structure activity relation shipStructure activity relation ship
Structure activity relation ship
Akshil Mehta
 
Qsar and drug design ppt
Qsar and drug design pptQsar and drug design ppt
Qsar and drug design ppt
Abhik Seal
 

Viewers also liked (19)

QSAR Study on Antitubercular Drug Derivatives
QSAR Study on Antitubercular Drug DerivativesQSAR Study on Antitubercular Drug Derivatives
QSAR Study on Antitubercular Drug Derivatives
 
Data Analysis in QSAR
Data Analysis in QSARData Analysis in QSAR
Data Analysis in QSAR
 
25.qsar
25.qsar25.qsar
25.qsar
 
Effect of substituents and functions on drug structure activity relationships
Effect of substituents and functions on drug structure activity relationshipsEffect of substituents and functions on drug structure activity relationships
Effect of substituents and functions on drug structure activity relationships
 
Computer aided drug design
Computer aided drug designComputer aided drug design
Computer aided drug design
 
Introduction to Quantitative Structure Activity Relationships
Introduction to Quantitative Structure Activity RelationshipsIntroduction to Quantitative Structure Activity Relationships
Introduction to Quantitative Structure Activity Relationships
 
Meta QSAR
Meta QSARMeta QSAR
Meta QSAR
 
Computer Aided Drug Design
Computer Aided Drug DesignComputer Aided Drug Design
Computer Aided Drug Design
 
Qsar
QsarQsar
Qsar
 
Qsar lecture
Qsar lectureQsar lecture
Qsar lecture
 
QSAR
QSARQSAR
QSAR
 
Qsar by hansch analysis
Qsar by hansch analysisQsar by hansch analysis
Qsar by hansch analysis
 
Computational Drug Design
Computational Drug DesignComputational Drug Design
Computational Drug Design
 
Free wilson analysis qsar
Free wilson analysis qsarFree wilson analysis qsar
Free wilson analysis qsar
 
Computer aided drug designing
Computer aided drug designing Computer aided drug designing
Computer aided drug designing
 
QSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative StructureQSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative Structure
 
Qsar
QsarQsar
Qsar
 
Structure activity relation ship
Structure activity relation shipStructure activity relation ship
Structure activity relation ship
 
Qsar and drug design ppt
Qsar and drug design pptQsar and drug design ppt
Qsar and drug design ppt
 

Similar to Discovery Bus: UK QSAR meeting at GSK

Caret max kuhn
Caret max kuhnCaret max kuhn
Caret max kuhn
kmettler
 
Caret Package for R
Caret Package for RCaret Package for R
Caret Package for R
kmettler
 
FPGA Implementation of a GA
FPGA Implementation of a GAFPGA Implementation of a GA
FPGA Implementation of a GA
Hocine Merabti
 
Automated QSAR
Automated QSAR Automated QSAR
Automated QSAR
David Leahy
 
aserra_phdthesis_ppt
aserra_phdthesis_pptaserra_phdthesis_ppt
aserra_phdthesis_ppt
aserrapages
 
Unsupervised selection of mother wavelets and parameter optimization
Unsupervised selection of mother wavelets and parameter optimizationUnsupervised selection of mother wavelets and parameter optimization
Unsupervised selection of mother wavelets and parameter optimization
Md Kafiul Islam
 
The caret Package: A Unified Interface for Predictive Models
The caret Package: A Unified Interface for Predictive ModelsThe caret Package: A Unified Interface for Predictive Models
The caret Package: A Unified Interface for Predictive Models
NYC Predictive Analytics
 
OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...
Kamel Mansouri
 
Lab 2: Classification and Regression Prediction Models, training and testing ...
Lab 2: Classification and Regression Prediction Models, training and testing ...Lab 2: Classification and Regression Prediction Models, training and testing ...
Lab 2: Classification and Regression Prediction Models, training and testing ...
Yao Yao
 
Introduction to Genetic algorithm and its significance in VLSI design and aut...
Introduction to Genetic algorithm and its significance in VLSI design and aut...Introduction to Genetic algorithm and its significance in VLSI design and aut...
Introduction to Genetic algorithm and its significance in VLSI design and aut...
Centre for Electronics, Computer, Self development
 
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...
Informatikai Intézet
 
Open Science Data Repository - Dataledger
Open Science Data Repository - DataledgerOpen Science Data Repository - Dataledger
Open Science Data Repository - Dataledger
Alexandru Korotcov
 
Prediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source toolsPrediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source tools
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Forecast of long term wind speed based on optimized support vector regression...
Forecast of long term wind speed based on optimized support vector regression...Forecast of long term wind speed based on optimized support vector regression...
Forecast of long term wind speed based on optimized support vector regression...
Aboul Ella Hassanien
 
Algorithm Selection for Preferred Extensions Enumeration
Algorithm Selection for Preferred Extensions EnumerationAlgorithm Selection for Preferred Extensions Enumeration
Algorithm Selection for Preferred Extensions Enumeration
Federico Cerutti
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
ChemAxon
 
Automated parameter optimization should be included in future 
defect predict...
Automated parameter optimization should be included in future 
defect predict...Automated parameter optimization should be included in future 
defect predict...
Automated parameter optimization should be included in future 
defect predict...
Chakkrit (Kla) Tantithamthavorn
 
Svd filtered temporal usage clustering
Svd filtered temporal usage clusteringSvd filtered temporal usage clustering
Svd filtered temporal usage clustering
Liang Xie, PhD
 
Machine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpointsMachine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpoints
Valery Tkachenko
 
Using HOG Descriptors on Superpixels for Human Detection of UAV Imagery
Using HOG Descriptors on Superpixels for Human Detection of UAV ImageryUsing HOG Descriptors on Superpixels for Human Detection of UAV Imagery
Using HOG Descriptors on Superpixels for Human Detection of UAV Imagery
Wai Nwe Tun
 

Similar to Discovery Bus: UK QSAR meeting at GSK (20)

Caret max kuhn
Caret max kuhnCaret max kuhn
Caret max kuhn
 
Caret Package for R
Caret Package for RCaret Package for R
Caret Package for R
 
FPGA Implementation of a GA
FPGA Implementation of a GAFPGA Implementation of a GA
FPGA Implementation of a GA
 
Automated QSAR
Automated QSAR Automated QSAR
Automated QSAR
 
aserra_phdthesis_ppt
aserra_phdthesis_pptaserra_phdthesis_ppt
aserra_phdthesis_ppt
 
Unsupervised selection of mother wavelets and parameter optimization
Unsupervised selection of mother wavelets and parameter optimizationUnsupervised selection of mother wavelets and parameter optimization
Unsupervised selection of mother wavelets and parameter optimization
 
The caret Package: A Unified Interface for Predictive Models
The caret Package: A Unified Interface for Predictive ModelsThe caret Package: A Unified Interface for Predictive Models
The caret Package: A Unified Interface for Predictive Models
 
OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...
 
Lab 2: Classification and Regression Prediction Models, training and testing ...
Lab 2: Classification and Regression Prediction Models, training and testing ...Lab 2: Classification and Regression Prediction Models, training and testing ...
Lab 2: Classification and Regression Prediction Models, training and testing ...
 
Introduction to Genetic algorithm and its significance in VLSI design and aut...
Introduction to Genetic algorithm and its significance in VLSI design and aut...Introduction to Genetic algorithm and its significance in VLSI design and aut...
Introduction to Genetic algorithm and its significance in VLSI design and aut...
 
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...
Blanka Láng, László Kovács and László Mohácsi: Linear regression model select...
 
Open Science Data Repository - Dataledger
Open Science Data Repository - DataledgerOpen Science Data Repository - Dataledger
Open Science Data Repository - Dataledger
 
Prediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source toolsPrediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source tools
 
Forecast of long term wind speed based on optimized support vector regression...
Forecast of long term wind speed based on optimized support vector regression...Forecast of long term wind speed based on optimized support vector regression...
Forecast of long term wind speed based on optimized support vector regression...
 
Algorithm Selection for Preferred Extensions Enumeration
Algorithm Selection for Preferred Extensions EnumerationAlgorithm Selection for Preferred Extensions Enumeration
Algorithm Selection for Preferred Extensions Enumeration
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
 
Automated parameter optimization should be included in future 
defect predict...
Automated parameter optimization should be included in future 
defect predict...Automated parameter optimization should be included in future 
defect predict...
Automated parameter optimization should be included in future 
defect predict...
 
Svd filtered temporal usage clustering
Svd filtered temporal usage clusteringSvd filtered temporal usage clustering
Svd filtered temporal usage clustering
 
Machine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpointsMachine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpoints
 
Using HOG Descriptors on Superpixels for Human Detection of UAV Imagery
Using HOG Descriptors on Superpixels for Human Detection of UAV ImageryUsing HOG Descriptors on Superpixels for Human Detection of UAV Imagery
Using HOG Descriptors on Superpixels for Human Detection of UAV Imagery
 

More from David Leahy

AI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryAI is the Future of Drug Discovery
AI is the Future of Drug Discovery
David Leahy
 
Most Drug Discovery Scientists could be replaced by Software Systems
Most Drug Discovery Scientists could be replaced by Software SystemsMost Drug Discovery Scientists could be replaced by Software Systems
Most Drug Discovery Scientists could be replaced by Software Systems
David Leahy
 
From Hammett to the Semantic Web
From Hammett to the Semantic WebFrom Hammett to the Semantic Web
From Hammett to the Semantic Web
David Leahy
 
InkSpot Science presentation at Open Science Meeting
InkSpot Science presentation at Open Science MeetingInkSpot Science presentation at Open Science Meeting
InkSpot Science presentation at Open Science Meeting
David Leahy
 
PBPK simulation as an alternative to animal testing
PBPK simulation as an alternative to animal testingPBPK simulation as an alternative to animal testing
PBPK simulation as an alternative to animal testing
David Leahy
 
Forager Poster
Forager PosterForager Poster
Forager Poster
David Leahy
 
Colonist
ColonistColonist
Colonist
David Leahy
 

More from David Leahy (7)

AI is the Future of Drug Discovery
AI is the Future of Drug DiscoveryAI is the Future of Drug Discovery
AI is the Future of Drug Discovery
 
Most Drug Discovery Scientists could be replaced by Software Systems
Most Drug Discovery Scientists could be replaced by Software SystemsMost Drug Discovery Scientists could be replaced by Software Systems
Most Drug Discovery Scientists could be replaced by Software Systems
 
From Hammett to the Semantic Web
From Hammett to the Semantic WebFrom Hammett to the Semantic Web
From Hammett to the Semantic Web
 
InkSpot Science presentation at Open Science Meeting
InkSpot Science presentation at Open Science MeetingInkSpot Science presentation at Open Science Meeting
InkSpot Science presentation at Open Science Meeting
 
PBPK simulation as an alternative to animal testing
PBPK simulation as an alternative to animal testingPBPK simulation as an alternative to animal testing
PBPK simulation as an alternative to animal testing
 
Forager Poster
Forager PosterForager Poster
Forager Poster
 
Colonist
ColonistColonist
Colonist
 

Recently uploaded

5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
HarisZaheer8
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Precisely
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
FREE A4 Cyber Security Awareness Posters-Social Engineering part 3
FREE A4 Cyber Security Awareness  Posters-Social Engineering part 3FREE A4 Cyber Security Awareness  Posters-Social Engineering part 3
FREE A4 Cyber Security Awareness Posters-Social Engineering part 3
Data Hops
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
Dinusha Kumarasiri
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 

Recently uploaded (20)

5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
FREE A4 Cyber Security Awareness Posters-Social Engineering part 3
FREE A4 Cyber Security Awareness  Posters-Social Engineering part 3FREE A4 Cyber Security Awareness  Posters-Social Engineering part 3
FREE A4 Cyber Security Awareness Posters-Social Engineering part 3
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 

Discovery Bus: UK QSAR meeting at GSK

  • 1. Automated QSAR Modelling David E Leahy Newcastle University, UK & Damjan Krstajic Research Centre for Cheminformatics, Serbia
  • 2.
  • 4. Chemical structure & response data Transform response 1/X logX X class Split and stratify ? Calculate descriptors D E H L R A Combine descriptors Filter features A&D A&L L&H&R A&E E&D A&D&R ... no filter cfs1 cfs2 cfs4 cfs5 cfs3 Cross validate Build models Test model Rnnet Rrpart Rlin Rpls GARMLR NetlabNN GUIDE GAWRMLR 4 x 8 x 6 x 8 = 1536 models ?&? ? new ff New method? 4 x 8 = 32 filter feature requests 32 filter feature requests x 8 = 256 models 10%
  • 6. Solubility Results Learner Filter Reduction Types Linear Fit Training (1167) Test (130) Filter Learner Rel.MSE r 2, Rel.MSE r 2, GUIDE         H 1990 -> 558 -> 54 R,D 1.46 0.11 0.89 0.12 0.89 H 170 -> 26 -> 14 A,E,H,D 0.13 0.11 0.89 0.13 0.88 H 80 -> 16 -> 12 A,H,D 0.14 0.11 0.88 0.12 0.87 C 250 -> 2 -> 2 A,R 0.18 0.13 0.87 0.16 0.84 C 8 -> 2 -> 2 A,L 0.16 0.13 0.87 0.16 0.86 GA1   H 80 -> 16 -> 16 A,H,D 0.14 0.14 0.86 0.18 0.83 C 8 -> 2 -> 2 A,L 0.16 0.17 0.84 0.17 0.83 NN1     H 250 -> 54 -> 54 A,R 0.12 0.09 0.91 0.08 0.92 H 80 -> 16 -> 16 A,H,D 0.14 0.10 0.90 0.12 0.88 H 326 -> 46 -> 46 H,R,D 0.18 0.10 0.90 0.12 0.89
  • 8. HSA Binding Learner Filter Reduction Types Linear Fit Training (82) Test (9) Filter Learner Rel.MSE r 2 Rel.MSE r 2 Guide Hh2 332 -> 39 -> 8 A,E,R 0.92 0.40 0.62 0.25 0.81 H 250 -> 59 -> 12 A,R 1.62 0.47 0.56 0.30 0.76 Hh4 382 -> 20 -> 1 A 0.25 0.50 0.50 0.57 0.49 GA1 Hh2 1998 -> 39 -> 26 A,R,D 0.42 0.23 0.77 0.20 0.85 Hh4 344 -> 20 -> 19 H,R,D 0.42 0.26 0.74 0.28 0.78 Hh10 302 -> 9 -> 9 H,R 0.27 0.27 0.73 0.40 0.64 NN1 H 8 -> 5 -> 5 A,L 0.37 0.17 0.83 0.15 0.87 Hh10 346 -> 8 -> 8 A,R,D 0.30 0.30 0.70 0.16 0.84 H 302 -> 19 -> 19 H,R 0.27 0.32 0.70 0.39 0.71
  • 9. P-Glycoprotein Technique % Correctly Classified Training Set % Correctly Classified Test Set Neural Net Classifier 95.6 69.7 R Part 90.4 81.0
  • 11.
  • 12.
  • 14.
  • 15. Forager Optimisation Thanks to Tudor Oprea for a copy of Wombat