SlideShare a Scribd company logo
1 of 11
Data Mining Using  Consensus Fingerprint Searches AstraZeneca CNS Chemistry, Wilmington, USA *   Cancer and Infection Chemistry, Alderley Park, UK ^ James R. Arnold * Charles L. Lerman * William F. Michne * David A. Cosgrove ^ James R. Damewood *
Development Goals ,[object Object],[object Object],[object Object],[object Object],Goals:
Fingerprint Based  Classification and Data Mining Classification is based on  400  medicinally relevant functional groups. Imigran (1): GSK, 1.07 billion dollar treatment for migraine in 2000.
* The exclusions make the functional group definitions specific  and make the entire set as orthogonal as possible.   Pattern Matching Rules
Classification Quality:  Coverage and Overlap of Definitions Ideal Coverage Ideal Overlap Testing in medicinally relevant databases. Roughly 90% coverage and 10% overlap. Coverage :  All heteroatoms in molecule are classified. Overlap :  A heteroatom in molecule classified in > 1 functional group. CMC  =  8,545 MDDR  =  135,342 MedCh =  145,158
Validation:  538 Classes  and  300,000 Compounds # Functional Groups MDDR (Cumulative) # Cpds and # Clust in Tgt. Classes ,[object Object],[object Object],[object Object],[object Object],*  Clusters generated with Daylight fingerprints at Tanimoto = 0.3
Average Percentage Actives Recovered 538 Target Classes in MDDR 2003 ,[object Object],[object Object],Recovery Rates Top  Top  Top  Top 100   500   1,000   5,000   Bin  25.7  49.6  59.4  75.8 Ct  31.4  54.3  63.1  78.1 Day  38.2  56.4  68.3  82.2 Cons 37.7  65.0  74.5  87.9 > 60% Actives in top 1% DBase MDDR 2003 > 135,000 cpds.
Tanimoto Enrichment Rate Analysis  538 Target Classes in MDDR 2003 A  =  # actives at Tanimoto B  =  # cpds total at Tanimoto ADB  =  total actives in DBase NDB  =  total cpds in Dbase E  =  (A / B)  /  (ADB / NDB) Enrichment Rate Equation Enrichments normalized for the number of actives in target class. ,[object Object],[object Object]
The circles are drawn to scale according to the number of actives recovered at given Tanimoto distances.   Consensus Approach:  Overlap of True Positives from FG Count and Daylight
Number of true and false positives for the Functional Group Fingerprint counts, Daylight fingerprint and consensus (logical “AND”) approaches for the five hundred and thirty eight biological target classes at Tanimoto distances of 0.0, 0.1, 0.2, and 0.3.  The three methods are binned at the various Tanimoto distances and are reported in the order of counts, Daylight, consensus, and are listed as FG, D and C, respectively.   Performance of the FG Count, Daylight and Consensus Approaches in Terms of True and False Positives   50% reduction false positives FG = FG Count D = Daylight C = Consensus
Conclusions: Data Mining Using Consensus Searches ,[object Object],[object Object],[object Object],[object Object]

More Related Content

What's hot

3 principal components analysis
3  principal components analysis3  principal components analysis
3 principal components analysisDmitry Grapov
 
Data analysis workflows part 2 2015
Data analysis workflows part 2 2015Data analysis workflows part 2 2015
Data analysis workflows part 2 2015Dmitry Grapov
 
Connecting Metabolomic Data with Context
Connecting Metabolomic Data with ContextConnecting Metabolomic Data with Context
Connecting Metabolomic Data with ContextDmitry Grapov
 
Ligand efficiency: nice concept shame about the metrics
Ligand efficiency: nice concept shame about the metricsLigand efficiency: nice concept shame about the metrics
Ligand efficiency: nice concept shame about the metricsPeter Kenny
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Dmitry Grapov
 
Accelerating lead optimisation with active learning by exploiting MMPA based ...
Accelerating lead optimisation with active learning by exploiting MMPA based ...Accelerating lead optimisation with active learning by exploiting MMPA based ...
Accelerating lead optimisation with active learning by exploiting MMPA based ...Ed Griffen
 
Data analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination ModellingData analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination ModellingRichard Lewis
 
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses Dmitry Grapov
 
IRJET- Classification of Chemical Medicine or Drug using K Nearest Neighb...
IRJET-  	  Classification of Chemical Medicine or Drug using K Nearest Neighb...IRJET-  	  Classification of Chemical Medicine or Drug using K Nearest Neighb...
IRJET- Classification of Chemical Medicine or Drug using K Nearest Neighb...IRJET Journal
 
High Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and VisualizationHigh Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and VisualizationDmitry Grapov
 
molecular docking
molecular dockingmolecular docking
molecular dockingKOUSHIK DEB
 
5 data analysis case study
5  data analysis case study5  data analysis case study
5 data analysis case studyDmitry Grapov
 
Omic Data Integration Strategies
Omic Data Integration StrategiesOmic Data Integration Strategies
Omic Data Integration StrategiesDmitry Grapov
 
Metabolomic data analysis and visualization tools
Metabolomic data analysis and visualization toolsMetabolomic data analysis and visualization tools
Metabolomic data analysis and visualization toolsDmitry Grapov
 
Data analysis workflows part 1 2015
Data analysis workflows part 1 2015Data analysis workflows part 1 2015
Data analysis workflows part 1 2015Dmitry Grapov
 
Determining stable ligand orientation
Determining stable ligand orientationDetermining stable ligand orientation
Determining stable ligand orientationijaia
 

What's hot (20)

Day2_Plenary_Peter R Steward
Day2_Plenary_Peter R Steward Day2_Plenary_Peter R Steward
Day2_Plenary_Peter R Steward
 
3 principal components analysis
3  principal components analysis3  principal components analysis
3 principal components analysis
 
Data analysis workflows part 2 2015
Data analysis workflows part 2 2015Data analysis workflows part 2 2015
Data analysis workflows part 2 2015
 
Connecting Metabolomic Data with Context
Connecting Metabolomic Data with ContextConnecting Metabolomic Data with Context
Connecting Metabolomic Data with Context
 
Ligand efficiency: nice concept shame about the metrics
Ligand efficiency: nice concept shame about the metricsLigand efficiency: nice concept shame about the metrics
Ligand efficiency: nice concept shame about the metrics
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
 
Accelerating lead optimisation with active learning by exploiting MMPA based ...
Accelerating lead optimisation with active learning by exploiting MMPA based ...Accelerating lead optimisation with active learning by exploiting MMPA based ...
Accelerating lead optimisation with active learning by exploiting MMPA based ...
 
Data analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination ModellingData analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination Modelling
 
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses
 
2 cluster analysis
2  cluster analysis2  cluster analysis
2 cluster analysis
 
IRJET- Classification of Chemical Medicine or Drug using K Nearest Neighb...
IRJET-  	  Classification of Chemical Medicine or Drug using K Nearest Neighb...IRJET-  	  Classification of Chemical Medicine or Drug using K Nearest Neighb...
IRJET- Classification of Chemical Medicine or Drug using K Nearest Neighb...
 
High Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and VisualizationHigh Dimensional Biological Data Analysis and Visualization
High Dimensional Biological Data Analysis and Visualization
 
molecular docking
molecular dockingmolecular docking
molecular docking
 
Machine learning in computational docking
Machine learning in computational dockingMachine learning in computational docking
Machine learning in computational docking
 
5 data analysis case study
5  data analysis case study5  data analysis case study
5 data analysis case study
 
Omic Data Integration Strategies
Omic Data Integration StrategiesOmic Data Integration Strategies
Omic Data Integration Strategies
 
Metabolomic data analysis and visualization tools
Metabolomic data analysis and visualization toolsMetabolomic data analysis and visualization tools
Metabolomic data analysis and visualization tools
 
MOLECULAR DOCKING
MOLECULAR DOCKINGMOLECULAR DOCKING
MOLECULAR DOCKING
 
Data analysis workflows part 1 2015
Data analysis workflows part 1 2015Data analysis workflows part 1 2015
Data analysis workflows part 1 2015
 
Determining stable ligand orientation
Determining stable ligand orientationDetermining stable ligand orientation
Determining stable ligand orientation
 

Viewers also liked

Managing Virtual Teams
Managing Virtual TeamsManaging Virtual Teams
Managing Virtual Teamspquinn1
 
Mc Kirkley Oral Defense 11122009 V2
Mc Kirkley Oral Defense 11122009 V2Mc Kirkley Oral Defense 11122009 V2
Mc Kirkley Oral Defense 11122009 V2mckirkley
 
Managing Virtual Teams
Managing Virtual TeamsManaging Virtual Teams
Managing Virtual TeamsJohn Zachary
 
Matrix Structure Navigating & Making It Work
Matrix Structure   Navigating & Making It WorkMatrix Structure   Navigating & Making It Work
Matrix Structure Navigating & Making It Workngaungsan
 
TMA World Viewpoint 26: Matrix Organization Best Practices - Extending Your C...
TMA World Viewpoint 26: Matrix Organization Best Practices - Extending Your C...TMA World Viewpoint 26: Matrix Organization Best Practices - Extending Your C...
TMA World Viewpoint 26: Matrix Organization Best Practices - Extending Your C...TMA World
 
Surviving in a Matrix: Simple Techniques for Effective Project Management in ...
Surviving in a Matrix: Simple Techniques for Effective Project Management in ...Surviving in a Matrix: Simple Techniques for Effective Project Management in ...
Surviving in a Matrix: Simple Techniques for Effective Project Management in ...Dr. Thomas Juli
 
Kim Kardashian Fleet Media
Kim Kardashian Fleet MediaKim Kardashian Fleet Media
Kim Kardashian Fleet MediaJeffMoore
 
كيف يعامل المسنون في دين الإسلام
كيف يعامل المسنون في دين الإسلامكيف يعامل المسنون في دين الإسلام
كيف يعامل المسنون في دين الإسلامAbdullah Baspren
 
толкование десяти последних джузов благородного корана, и следует за ним Last...
толкование десяти последних джузов благородного корана, и следует за ним Last...толкование десяти последних джузов благородного корана, и следует за ним Last...
толкование десяти последних джузов благородного корана, и следует за ним Last...Abdullah Baspren
 
The Anti Islam Dutch Film – Comment, Criticism And Advice
The Anti Islam Dutch Film – Comment, Criticism And AdviceThe Anti Islam Dutch Film – Comment, Criticism And Advice
The Anti Islam Dutch Film – Comment, Criticism And AdviceAbdullah Baspren
 
Disability Insurance
Disability InsuranceDisability Insurance
Disability Insurancefpcksc
 
Amazing Creatures (Tiger & Cheetah)
Amazing Creatures (Tiger & Cheetah)Amazing Creatures (Tiger & Cheetah)
Amazing Creatures (Tiger & Cheetah)Abdullah Baspren
 
رسالة الى الباحثين عن الحقيقة
رسالة الى الباحثين عن الحقيقةرسالة الى الباحثين عن الحقيقة
رسالة الى الباحثين عن الحقيقةAbdullah Baspren
 
Ihoughton Strengthening Accountability Against Multi Lateral Agreements V031011
Ihoughton Strengthening Accountability Against Multi Lateral Agreements V031011Ihoughton Strengthening Accountability Against Multi Lateral Agreements V031011
Ihoughton Strengthening Accountability Against Multi Lateral Agreements V031011irunguh
 
Muongozo Wa Kuufahamu Uislamu Kwa Ufupi Na Kwa Kutumia Vielelezo Vya Picha _ ...
Muongozo Wa Kuufahamu Uislamu Kwa Ufupi Na Kwa Kutumia Vielelezo Vya Picha _ ...Muongozo Wa Kuufahamu Uislamu Kwa Ufupi Na Kwa Kutumia Vielelezo Vya Picha _ ...
Muongozo Wa Kuufahamu Uislamu Kwa Ufupi Na Kwa Kutumia Vielelezo Vya Picha _ ...Abdullah Baspren
 

Viewers also liked (20)

Managing Virtual Teams
Managing Virtual TeamsManaging Virtual Teams
Managing Virtual Teams
 
Managing in virtual environment . Part 1. Becoming virtual.
Managing in virtual environment. Part 1. Becoming virtual.Managing in virtual environment. Part 1. Becoming virtual.
Managing in virtual environment . Part 1. Becoming virtual.
 
Mc Kirkley Oral Defense 11122009 V2
Mc Kirkley Oral Defense 11122009 V2Mc Kirkley Oral Defense 11122009 V2
Mc Kirkley Oral Defense 11122009 V2
 
Managing Virtual Teams
Managing Virtual TeamsManaging Virtual Teams
Managing Virtual Teams
 
Matrix Structure Navigating & Making It Work
Matrix Structure   Navigating & Making It WorkMatrix Structure   Navigating & Making It Work
Matrix Structure Navigating & Making It Work
 
Managing virtual teams
Managing virtual teamsManaging virtual teams
Managing virtual teams
 
TMA World Viewpoint 26: Matrix Organization Best Practices - Extending Your C...
TMA World Viewpoint 26: Matrix Organization Best Practices - Extending Your C...TMA World Viewpoint 26: Matrix Organization Best Practices - Extending Your C...
TMA World Viewpoint 26: Matrix Organization Best Practices - Extending Your C...
 
Surviving in a Matrix: Simple Techniques for Effective Project Management in ...
Surviving in a Matrix: Simple Techniques for Effective Project Management in ...Surviving in a Matrix: Simple Techniques for Effective Project Management in ...
Surviving in a Matrix: Simple Techniques for Effective Project Management in ...
 
Matrix management
Matrix managementMatrix management
Matrix management
 
Kim Kardashian Fleet Media
Kim Kardashian Fleet MediaKim Kardashian Fleet Media
Kim Kardashian Fleet Media
 
كيف يعامل المسنون في دين الإسلام
كيف يعامل المسنون في دين الإسلامكيف يعامل المسنون في دين الإسلام
كيف يعامل المسنون في دين الإسلام
 
толкование десяти последних джузов благородного корана, и следует за ним Last...
толкование десяти последних джузов благородного корана, и следует за ним Last...толкование десяти последних джузов благородного корана, и следует за ним Last...
толкование десяти последних джузов благородного корана, и следует за ним Last...
 
The Anti Islam Dutch Film – Comment, Criticism And Advice
The Anti Islam Dutch Film – Comment, Criticism And AdviceThe Anti Islam Dutch Film – Comment, Criticism And Advice
The Anti Islam Dutch Film – Comment, Criticism And Advice
 
Disability Insurance
Disability InsuranceDisability Insurance
Disability Insurance
 
401 K
401 K401 K
401 K
 
Amazing Creatures (Tiger & Cheetah)
Amazing Creatures (Tiger & Cheetah)Amazing Creatures (Tiger & Cheetah)
Amazing Creatures (Tiger & Cheetah)
 
رسالة الى الباحثين عن الحقيقة
رسالة الى الباحثين عن الحقيقةرسالة الى الباحثين عن الحقيقة
رسالة الى الباحثين عن الحقيقة
 
Digiforum
DigiforumDigiforum
Digiforum
 
Ihoughton Strengthening Accountability Against Multi Lateral Agreements V031011
Ihoughton Strengthening Accountability Against Multi Lateral Agreements V031011Ihoughton Strengthening Accountability Against Multi Lateral Agreements V031011
Ihoughton Strengthening Accountability Against Multi Lateral Agreements V031011
 
Muongozo Wa Kuufahamu Uislamu Kwa Ufupi Na Kwa Kutumia Vielelezo Vya Picha _ ...
Muongozo Wa Kuufahamu Uislamu Kwa Ufupi Na Kwa Kutumia Vielelezo Vya Picha _ ...Muongozo Wa Kuufahamu Uislamu Kwa Ufupi Na Kwa Kutumia Vielelezo Vya Picha _ ...
Muongozo Wa Kuufahamu Uislamu Kwa Ufupi Na Kwa Kutumia Vielelezo Vya Picha _ ...
 

Similar to Data Mining Using a Consensus Algorithm

Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsValery Tkachenko
 
2007 Pharmasug, Promotion Response Analysis
2007 Pharmasug, Promotion Response Analysis2007 Pharmasug, Promotion Response Analysis
2007 Pharmasug, Promotion Response AnalysisAlejandro Jaramillo
 
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...ahmad abdelhafeez
 
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
 ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO... ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...cscpconf
 
An experimental study on hypothyroid using rotation forest
An experimental study on hypothyroid using rotation forestAn experimental study on hypothyroid using rotation forest
An experimental study on hypothyroid using rotation forestIJDKP
 
Seminar Slides
Seminar SlidesSeminar Slides
Seminar Slidespannicle
 
A Threshold Fuzzy Entropy Based Feature Selection: Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection:  Comparative StudyA Threshold Fuzzy Entropy Based Feature Selection:  Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection: Comparative StudyIJMER
 
Virtual screening of chemicals for endocrine disrupting activity through CER...
Virtual screening of chemicals for endocrine disrupting activity through  CER...Virtual screening of chemicals for endocrine disrupting activity through  CER...
Virtual screening of chemicals for endocrine disrupting activity through CER...Kamel Mansouri
 
Classification of Breast Cancer Tissues using Decision Tree Algorithms
Classification of Breast Cancer Tissues using Decision Tree AlgorithmsClassification of Breast Cancer Tissues using Decision Tree Algorithms
Classification of Breast Cancer Tissues using Decision Tree AlgorithmsLovely Professional University
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...Kamel Mansouri
 
Advance KNN classification of brain tumor
Advance KNN classification of brain tumorAdvance KNN classification of brain tumor
Advance KNN classification of brain tumorVikas Mahurkar
 
How predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinarHow predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinarAnn-Marie Roche
 
NMR, deep learning and molecular structure: a call for data
NMR, deep learning and molecular structure: a call for dataNMR, deep learning and molecular structure: a call for data
NMR, deep learning and molecular structure: a call for dataJeff White
 
Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...Damian R. Mingle, MBA
 
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...ahmad abdelhafeez
 
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...cscpconf
 

Similar to Data Mining Using a Consensus Algorithm (20)

Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpoints
 
2007 Pharmasug, Promotion Response Analysis
2007 Pharmasug, Promotion Response Analysis2007 Pharmasug, Promotion Response Analysis
2007 Pharmasug, Promotion Response Analysis
 
Hc3413121317
Hc3413121317Hc3413121317
Hc3413121317
 
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
 
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
 ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO... ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
 
An experimental study on hypothyroid using rotation forest
An experimental study on hypothyroid using rotation forestAn experimental study on hypothyroid using rotation forest
An experimental study on hypothyroid using rotation forest
 
[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar
[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar
[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar
 
Seminar Slides
Seminar SlidesSeminar Slides
Seminar Slides
 
CAD v2
CAD v2CAD v2
CAD v2
 
A Threshold Fuzzy Entropy Based Feature Selection: Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection:  Comparative StudyA Threshold Fuzzy Entropy Based Feature Selection:  Comparative Study
A Threshold Fuzzy Entropy Based Feature Selection: Comparative Study
 
Virtual screening of chemicals for endocrine disrupting activity through CER...
Virtual screening of chemicals for endocrine disrupting activity through  CER...Virtual screening of chemicals for endocrine disrupting activity through  CER...
Virtual screening of chemicals for endocrine disrupting activity through CER...
 
Classification of Breast Cancer Tissues using Decision Tree Algorithms
Classification of Breast Cancer Tissues using Decision Tree AlgorithmsClassification of Breast Cancer Tissues using Decision Tree Algorithms
Classification of Breast Cancer Tissues using Decision Tree Algorithms
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...In-silico structure activity relationship study of toxicity endpoints by QSAR...
In-silico structure activity relationship study of toxicity endpoints by QSAR...
 
Advance KNN classification of brain tumor
Advance KNN classification of brain tumorAdvance KNN classification of brain tumor
Advance KNN classification of brain tumor
 
How predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinarHow predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinar
 
NMR, deep learning and molecular structure: a call for data
NMR, deep learning and molecular structure: a call for dataNMR, deep learning and molecular structure: a call for data
NMR, deep learning and molecular structure: a call for data
 
Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...
 
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
 
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...
PROGRAM TEST DATA GENERATION FOR BRANCH COVERAGE WITH GENETIC ALGORITHM: COMP...
 

Recently uploaded

Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 

Recently uploaded (20)

Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 

Data Mining Using a Consensus Algorithm

  • 1. Data Mining Using Consensus Fingerprint Searches AstraZeneca CNS Chemistry, Wilmington, USA * Cancer and Infection Chemistry, Alderley Park, UK ^ James R. Arnold * Charles L. Lerman * William F. Michne * David A. Cosgrove ^ James R. Damewood *
  • 2.
  • 3. Fingerprint Based Classification and Data Mining Classification is based on 400 medicinally relevant functional groups. Imigran (1): GSK, 1.07 billion dollar treatment for migraine in 2000.
  • 4. * The exclusions make the functional group definitions specific and make the entire set as orthogonal as possible. Pattern Matching Rules
  • 5. Classification Quality: Coverage and Overlap of Definitions Ideal Coverage Ideal Overlap Testing in medicinally relevant databases. Roughly 90% coverage and 10% overlap. Coverage : All heteroatoms in molecule are classified. Overlap : A heteroatom in molecule classified in > 1 functional group. CMC = 8,545 MDDR = 135,342 MedCh = 145,158
  • 6.
  • 7.
  • 8.
  • 9. The circles are drawn to scale according to the number of actives recovered at given Tanimoto distances. Consensus Approach: Overlap of True Positives from FG Count and Daylight
  • 10. Number of true and false positives for the Functional Group Fingerprint counts, Daylight fingerprint and consensus (logical “AND”) approaches for the five hundred and thirty eight biological target classes at Tanimoto distances of 0.0, 0.1, 0.2, and 0.3. The three methods are binned at the various Tanimoto distances and are reported in the order of counts, Daylight, consensus, and are listed as FG, D and C, respectively. Performance of the FG Count, Daylight and Consensus Approaches in Terms of True and False Positives 50% reduction false positives FG = FG Count D = Daylight C = Consensus
  • 11.