SlideShare a Scribd company logo
1 of 3
S1: Let me introduce a work titled “Anticancer Thiazolidinones Design: Mining
of 60-Cell Lines Experimental Data”. A search for anticancer agents
containing thiazolidinone scaffold is a promising trend in modern medicinal
chemistry. Computational chemistry, particularly QSAR and docking, is one
of the success factors in this direction. In the present research we tried to
extract all valuable knowledge hidden in 60-cell line anticancer screen results
in-home database, that will be useful in further QSAR-studies.
S2: The in vitro cell line screening is implemented under the developmental
therapeutics program of National Cancer Institute (USA). The operation of
this screen utilizes 60 different human tumor cell lines, representing leukemia,
melanoma and cancers of the lung, colon, brain, ovary, breast, prostate, and
kidney. The screening is a two-stage process, beginning with the evaluation of
all compounds against the 60 cell lines at a single dose of 10 uM. The output
from the single dose screen is reported as a mean graph. Compounds which
exhibit significant growth inhibition are evaluated against the 60 cell panel at
five concentration levels (with 10 uM as one of them). The results of both
stages are compared to control test and are represented by a Growth percent.
S3: The next problems had to be solved during current computational study:
- Have the same dose results of this two stages enough statistical similarity
to be treated together in future QSAR modelling?
- Where is a rational border between active and inactive compounds?
- Is there different mechanisms of antitumor action associated with
investigated compounds?
S4: A hypothesis, stated that same dose results are homogenous was
investigated. If it is true, the next conclusions will be useful for further
investigations:
Primarily, this results can be combined together to increase overall data
amount.
Secondly, deviation in the results for same compounds is an error of the
experiment
And this experimental error is a minimal error for any QSAR model based
on this data
S5: According to statistical concepts, if same dose results are homogenous then
deviations in the results for same compounds is a normally distributed random
sample with zero mean and unknown variance. This null-hypothesis was
evaluated by Student’s t-test with 60 cell lines results for 73 pairs of compounds,
and was rejected for 41 cell lines with default statistical significance 0,05. In case
of other 19 cell lines we cannot reject null-hypothesis, what means that either it
is true, either it is insufficient data to reject this hypothesis. That is why we reject
the investigated hypothesis in general.
S6: Looking at the mean deviations of growth percents for different cell lines
distribution, a shift to positive numbers can be pointed. It means that the results
of the second testing stage are more optimistic than the results of the first one.
S7: The distribution of mean deviations of growth percents for different
compounds indicates the presence of extreme errors, that are still not corrected
after averaging. Considering a case of 100% deviation of single pair results
values as an outlier, extreme errors rate above 4% was found.
S8: Testing results for non-active compounds have to be normally distributed
with mean growth percent of cancer cells = 100% and unknown variance.
Making an assumption about an abscence of the tumor growth enhancers among
the investigated compounds, it can be stated that all mean growth percent values
above 100% form a right tail of this distribution. So the left tail can be found
statistically. For this purpose multiple evaluating of t-test with slow change of
cut-off was carried out and resulted in the first failure to reject null-hypothesis
with minimum growth percent = 86%. Simply saying, all compounds with mean
growth percent above 86% have to be treated as non-active. Such introduction of
the border between active and non-active compounds let us to form rational data
arrays for further QSAR investigations.
S9: Principal component analysis finds such linear combinations of variables that
the projection of initial data on the obtained vectors will have maximal
dispersion. Using principal component analysis is possible under the assumption
that experimantal error is less than difference in sensitiity patterns for various
mechanisms. It is expected that the first principal component incorporates an
information about mean growth percent, and the others principal components
cover differences in mechanisms and errors of the experiment. A change in
explained variances of two next principal components was selected as a
separation criteria between mechanisms and errors. Prior to calculations data was
normalized by cell lines to provide equal influence of every analyzed cell line.
Since change in explained variance with the second principal component is 5
times greater than next one, the presence of two different mechanisms are
indicated. Approximate clusters of compounds with different modes of action are
outlined by ellipses in the figure. As you can see, it is difficult to establish exact
borders and a role of intermediate compounds remains unclear. That is why
neural network modelling as more powerful computational approach was
utilized.
S10: Cohonen’s self-organizing 6 for 6 map was used for unsupervised learning
durig 5 000 epochs. Prior to calculations data was normalized by compounds, so
mean activity information was removed. And clusters with active compounds
contain also inactive because of experimental error randomness. The distribution
of whole compounds set in the neural network is showed on the left figure, and
the distribution of only active molecules with the distances between neurons are
on the right figure. We still cannot clearly separate different mechanisms
becouse importance of a cluster depends not on the number of active compounds,
but on the values of growth percents.
S11: So an integrated activity measure, calculated as a sum of same cluster
compounds contributions, was introduced. A surface of integrated activity over
the neural network is presented in the figure. It gives a possibility to distinguish
three classes: two different mechanisms (A and C) and mixed one (B) and to
separate them clearly.
S12: Weight plains of every cell line in the neural network allow to analyze
selectivity patterns for active compounds. In this figures cell line is more
sensitive to compounds from dark clusters and less sensitive to compounds from
light clusters. It have to be pointed that NCI-H460 is more sensitive than other
lines to all three classes, the second line is sensitive mostly to mechanism C,
M14 and SK-MEL-5 – to mechanism A. M14 is rather insensitive to cluster C.
S13: In the other hand, all lines on this slide are less sensitive to class A, and
OVCAR-5 and SNB-19 – to class C. We can see also that weights of mechanism
B are mostly in the interval formed by A and C weights. That confirms a
hypothesis about mixed mechanism of tumor growth inhibition by compounds
from cluster B.
S14. Conclusions:
The homogeneity of humat tumor cell line screen results obtained from different
testing stages is rejected so they cannot be combined together in further
computational investigations
It is found that about 4% of testing results are extreme errors, what is useful with
outliers detection in future QSAR models
Rational border between active and non-active compounds is introduced and
proper data arrays for further QSAR are formed.
Two independent and one mixed mechanisms of 4-thiazolidinones antitumor
activity are identified
Some selectivity linked with different modes of action for separate cell lines is
highlighted

More Related Content

What's hot

Innovative Technique for Gene Selection in Microarray Based on Recursive Clus...
Innovative Technique for Gene Selection in Microarray Based on Recursive Clus...Innovative Technique for Gene Selection in Microarray Based on Recursive Clus...
Innovative Technique for Gene Selection in Microarray Based on Recursive Clus...AM Publications
 
Standard error-Biostatistics
Standard error-BiostatisticsStandard error-Biostatistics
Standard error-BiostatisticsSudha Rameshwari
 
McCarthy_TermPaperSpring
McCarthy_TermPaperSpringMcCarthy_TermPaperSpring
McCarthy_TermPaperSpringMarc McCarthy
 
Descriptive versus Mechanistic Modeling
Descriptive versus Mechanistic ModelingDescriptive versus Mechanistic Modeling
Descriptive versus Mechanistic ModelingAshwani Dhingra
 
An Update of Lot Quality Assurance Sampling (LQAS) Technologies Handout 2
An Update of Lot Quality Assurance Sampling (LQAS) Technologies Handout 2An Update of Lot Quality Assurance Sampling (LQAS) Technologies Handout 2
An Update of Lot Quality Assurance Sampling (LQAS) Technologies Handout 2CORE Group
 
Classification and Predication of Breast Cancer Risk Factors Using Id3
Classification and Predication of Breast Cancer Risk Factors Using Id3Classification and Predication of Breast Cancer Risk Factors Using Id3
Classification and Predication of Breast Cancer Risk Factors Using Id3theijes
 
Analysis of Imbalanced Classification Algorithms A Perspective View
Analysis of Imbalanced Classification Algorithms A Perspective ViewAnalysis of Imbalanced Classification Algorithms A Perspective View
Analysis of Imbalanced Classification Algorithms A Perspective Viewijtsrd
 
Data analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination ModellingData analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination ModellingRichard Lewis
 
Analysis of kinetic data
Analysis of kinetic dataAnalysis of kinetic data
Analysis of kinetic dataVineetha Menon
 
Simulation Study of Hurdle Model Performance on Zero Inflated Count Data
Simulation Study of Hurdle Model Performance on Zero Inflated Count DataSimulation Study of Hurdle Model Performance on Zero Inflated Count Data
Simulation Study of Hurdle Model Performance on Zero Inflated Count DataIan Camacho
 
MultipleLinearRegressionPaper
MultipleLinearRegressionPaperMultipleLinearRegressionPaper
MultipleLinearRegressionPaperKatie Harvey
 
Computer simulation in pharmacokinetics and pharmacodynamics
Computer simulation in pharmacokinetics and pharmacodynamicsComputer simulation in pharmacokinetics and pharmacodynamics
Computer simulation in pharmacokinetics and pharmacodynamicsMOHAMMAD ASIM
 
Breast cancer classification
Breast cancer classificationBreast cancer classification
Breast cancer classificationAshwan Abdulmunem
 

What's hot (17)

Innovative Technique for Gene Selection in Microarray Based on Recursive Clus...
Innovative Technique for Gene Selection in Microarray Based on Recursive Clus...Innovative Technique for Gene Selection in Microarray Based on Recursive Clus...
Innovative Technique for Gene Selection in Microarray Based on Recursive Clus...
 
Standard error-Biostatistics
Standard error-BiostatisticsStandard error-Biostatistics
Standard error-Biostatistics
 
McCarthy_TermPaperSpring
McCarthy_TermPaperSpringMcCarthy_TermPaperSpring
McCarthy_TermPaperSpring
 
Descriptive versus Mechanistic Modeling
Descriptive versus Mechanistic ModelingDescriptive versus Mechanistic Modeling
Descriptive versus Mechanistic Modeling
 
final paper
final paperfinal paper
final paper
 
An Update of Lot Quality Assurance Sampling (LQAS) Technologies Handout 2
An Update of Lot Quality Assurance Sampling (LQAS) Technologies Handout 2An Update of Lot Quality Assurance Sampling (LQAS) Technologies Handout 2
An Update of Lot Quality Assurance Sampling (LQAS) Technologies Handout 2
 
Classification and Predication of Breast Cancer Risk Factors Using Id3
Classification and Predication of Breast Cancer Risk Factors Using Id3Classification and Predication of Breast Cancer Risk Factors Using Id3
Classification and Predication of Breast Cancer Risk Factors Using Id3
 
AMP Poster_Final
AMP Poster_FinalAMP Poster_Final
AMP Poster_Final
 
Analysis of Imbalanced Classification Algorithms A Perspective View
Analysis of Imbalanced Classification Algorithms A Perspective ViewAnalysis of Imbalanced Classification Algorithms A Perspective View
Analysis of Imbalanced Classification Algorithms A Perspective View
 
Data analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination ModellingData analysis and Visualisation Techniques for Compound Combination Modelling
Data analysis and Visualisation Techniques for Compound Combination Modelling
 
Analysis of kinetic data
Analysis of kinetic dataAnalysis of kinetic data
Analysis of kinetic data
 
V5I3_IJERTV5IS031157
V5I3_IJERTV5IS031157V5I3_IJERTV5IS031157
V5I3_IJERTV5IS031157
 
Simulation Study of Hurdle Model Performance on Zero Inflated Count Data
Simulation Study of Hurdle Model Performance on Zero Inflated Count DataSimulation Study of Hurdle Model Performance on Zero Inflated Count Data
Simulation Study of Hurdle Model Performance on Zero Inflated Count Data
 
B45020308
B45020308B45020308
B45020308
 
MultipleLinearRegressionPaper
MultipleLinearRegressionPaperMultipleLinearRegressionPaper
MultipleLinearRegressionPaper
 
Computer simulation in pharmacokinetics and pharmacodynamics
Computer simulation in pharmacokinetics and pharmacodynamicsComputer simulation in pharmacokinetics and pharmacodynamics
Computer simulation in pharmacokinetics and pharmacodynamics
 
Breast cancer classification
Breast cancer classificationBreast cancer classification
Breast cancer classification
 

Viewers also liked

Cave of the Heart - Esegesi emotiva - Gruppo "Medea"
Cave of the Heart - Esegesi emotiva - Gruppo "Medea"Cave of the Heart - Esegesi emotiva - Gruppo "Medea"
Cave of the Heart - Esegesi emotiva - Gruppo "Medea"Federica Cicotto
 
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Data
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental DataANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Data
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Datao_devinyak
 
Descriptive stat
Descriptive statDescriptive stat
Descriptive stato_devinyak
 
презентація медичного факультету УжНУ
презентація медичного факультету УжНУпрезентація медичного факультету УжНУ
презентація медичного факультету УжНУo_devinyak
 
Lecture2 hypothesis testing
Lecture2 hypothesis testingLecture2 hypothesis testing
Lecture2 hypothesis testingo_devinyak
 
Motivation for biostatistics
Motivation for biostatisticsMotivation for biostatistics
Motivation for biostatisticso_devinyak
 
SaltConf 2015: Salt stack at web scale: Better, Stronger, Faster
SaltConf 2015: Salt stack at web scale: Better, Stronger, FasterSaltConf 2015: Salt stack at web scale: Better, Stronger, Faster
SaltConf 2015: Salt stack at web scale: Better, Stronger, FasterThomas Jackson
 
Introduction to biostatistics
Introduction to biostatisticsIntroduction to biostatistics
Introduction to biostatisticso_devinyak
 
Workshop de Inovação em Medical Devices
Workshop de Inovação em Medical DevicesWorkshop de Inovação em Medical Devices
Workshop de Inovação em Medical DevicesEmpreender Saúde
 
Saltconf 2016: Salt stack transport and concurrency
Saltconf 2016: Salt stack transport and concurrencySaltconf 2016: Salt stack transport and concurrency
Saltconf 2016: Salt stack transport and concurrencyThomas Jackson
 
Introduction to Biodesign - Stanford University - Ravi Pamnani
Introduction to Biodesign - Stanford University - Ravi PamnaniIntroduction to Biodesign - Stanford University - Ravi Pamnani
Introduction to Biodesign - Stanford University - Ravi PamnaniEmpreender Saúde
 

Viewers also liked (12)

Cave of the Heart - Esegesi emotiva - Gruppo "Medea"
Cave of the Heart - Esegesi emotiva - Gruppo "Medea"Cave of the Heart - Esegesi emotiva - Gruppo "Medea"
Cave of the Heart - Esegesi emotiva - Gruppo "Medea"
 
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Data
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental DataANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Data
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Data
 
Asia’s Growing Influence on Houston 9.16.2015
Asia’s Growing Influence on Houston 9.16.2015Asia’s Growing Influence on Houston 9.16.2015
Asia’s Growing Influence on Houston 9.16.2015
 
Descriptive stat
Descriptive statDescriptive stat
Descriptive stat
 
презентація медичного факультету УжНУ
презентація медичного факультету УжНУпрезентація медичного факультету УжНУ
презентація медичного факультету УжНУ
 
Lecture2 hypothesis testing
Lecture2 hypothesis testingLecture2 hypothesis testing
Lecture2 hypothesis testing
 
Motivation for biostatistics
Motivation for biostatisticsMotivation for biostatistics
Motivation for biostatistics
 
SaltConf 2015: Salt stack at web scale: Better, Stronger, Faster
SaltConf 2015: Salt stack at web scale: Better, Stronger, FasterSaltConf 2015: Salt stack at web scale: Better, Stronger, Faster
SaltConf 2015: Salt stack at web scale: Better, Stronger, Faster
 
Introduction to biostatistics
Introduction to biostatisticsIntroduction to biostatistics
Introduction to biostatistics
 
Workshop de Inovação em Medical Devices
Workshop de Inovação em Medical DevicesWorkshop de Inovação em Medical Devices
Workshop de Inovação em Medical Devices
 
Saltconf 2016: Salt stack transport and concurrency
Saltconf 2016: Salt stack transport and concurrencySaltconf 2016: Salt stack transport and concurrency
Saltconf 2016: Salt stack transport and concurrency
 
Introduction to Biodesign - Stanford University - Ravi Pamnani
Introduction to Biodesign - Stanford University - Ravi PamnaniIntroduction to Biodesign - Stanford University - Ravi Pamnani
Introduction to Biodesign - Stanford University - Ravi Pamnani
 

Similar to Notes for macc

coad_machine_learning
coad_machine_learningcoad_machine_learning
coad_machine_learningFord Sleeman
 
Parmetric and non parametric statistical test in clinical trails
Parmetric and non parametric statistical test in clinical trailsParmetric and non parametric statistical test in clinical trails
Parmetric and non parametric statistical test in clinical trailsVinod Pagidipalli
 
AACR 2014 Abstract# 3730: A quick and cost effective 12-cell line panel assay...
AACR 2014 Abstract# 3730: A quick and cost effective 12-cell line panel assay...AACR 2014 Abstract# 3730: A quick and cost effective 12-cell line panel assay...
AACR 2014 Abstract# 3730: A quick and cost effective 12-cell line panel assay...yuliamax
 
Diagnosis of Cancer using Fuzzy Rough Set Theory
Diagnosis of Cancer using Fuzzy Rough Set TheoryDiagnosis of Cancer using Fuzzy Rough Set Theory
Diagnosis of Cancer using Fuzzy Rough Set TheoryIRJET Journal
 
ASEE-GSW_2015_submission_75
ASEE-GSW_2015_submission_75ASEE-GSW_2015_submission_75
ASEE-GSW_2015_submission_75Sam Yang
 
Srge most important publications 2020
Srge most important  publications 2020Srge most important  publications 2020
Srge most important publications 2020Aboul Ella Hassanien
 
Cell lines breast cancer-project
Cell lines breast cancer-project Cell lines breast cancer-project
Cell lines breast cancer-project Amitha Dasari
 
JSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzerJSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzerDennis Sweitzer
 
Class24 chi squaretestofindependenceposthoc(1)
Class24 chi squaretestofindependenceposthoc(1)Class24 chi squaretestofindependenceposthoc(1)
Class24 chi squaretestofindependenceposthoc(1)arvindmnnitmsw
 
Class24 chi squaretestofindependenceposthoc
Class24 chi squaretestofindependenceposthocClass24 chi squaretestofindependenceposthoc
Class24 chi squaretestofindependenceposthocBetynatha Kb
 
Causal inference lecture to Texas Children's fellows
Causal inference lecture to Texas Children's fellowsCausal inference lecture to Texas Children's fellows
Causal inference lecture to Texas Children's fellowsPavlos Msaouel, MD, PhD
 
Bio statistics Bio statistics Bio statistics
Bio statistics Bio statistics  Bio statisticsBio statistics Bio statistics  Bio statistics
Bio statistics Bio statistics Bio statisticsArwaAbdelHamid1
 
Sample Size Determination.23.11.2021.pdf
Sample Size Determination.23.11.2021.pdfSample Size Determination.23.11.2021.pdf
Sample Size Determination.23.11.2021.pdfstatsanjal
 
Development and Validation of a Nomogram for Predicting Response to Neoadjuva...
Development and Validation of a Nomogram for Predicting Response to Neoadjuva...Development and Validation of a Nomogram for Predicting Response to Neoadjuva...
Development and Validation of a Nomogram for Predicting Response to Neoadjuva...semualkaira
 
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...CSCJournals
 
Melanoma Nancy Shum And Anne Marcy Intro To Clinical Data Management
Melanoma   Nancy Shum And Anne Marcy Intro To Clinical Data ManagementMelanoma   Nancy Shum And Anne Marcy Intro To Clinical Data Management
Melanoma Nancy Shum And Anne Marcy Intro To Clinical Data Managementcunniffe6
 
ARTIFICIAL NEURAL NETWORK FOR DIAGNOSIS OF PANCREATIC CANCER
ARTIFICIAL NEURAL NETWORK FOR DIAGNOSIS OF PANCREATIC CANCERARTIFICIAL NEURAL NETWORK FOR DIAGNOSIS OF PANCREATIC CANCER
ARTIFICIAL NEURAL NETWORK FOR DIAGNOSIS OF PANCREATIC CANCERIJCI JOURNAL
 

Similar to Notes for macc (20)

coad_machine_learning
coad_machine_learningcoad_machine_learning
coad_machine_learning
 
Parmetric and non parametric statistical test in clinical trails
Parmetric and non parametric statistical test in clinical trailsParmetric and non parametric statistical test in clinical trails
Parmetric and non parametric statistical test in clinical trails
 
AACR 2014 Abstract# 3730: A quick and cost effective 12-cell line panel assay...
AACR 2014 Abstract# 3730: A quick and cost effective 12-cell line panel assay...AACR 2014 Abstract# 3730: A quick and cost effective 12-cell line panel assay...
AACR 2014 Abstract# 3730: A quick and cost effective 12-cell line panel assay...
 
mining
miningmining
mining
 
antiviral coursework
antiviral courseworkantiviral coursework
antiviral coursework
 
Diagnosis of Cancer using Fuzzy Rough Set Theory
Diagnosis of Cancer using Fuzzy Rough Set TheoryDiagnosis of Cancer using Fuzzy Rough Set Theory
Diagnosis of Cancer using Fuzzy Rough Set Theory
 
ASEE-GSW_2015_submission_75
ASEE-GSW_2015_submission_75ASEE-GSW_2015_submission_75
ASEE-GSW_2015_submission_75
 
Srge most important publications 2020
Srge most important  publications 2020Srge most important  publications 2020
Srge most important publications 2020
 
Cell lines breast cancer-project
Cell lines breast cancer-project Cell lines breast cancer-project
Cell lines breast cancer-project
 
JSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzerJSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzer
 
Class24 chi squaretestofindependenceposthoc(1)
Class24 chi squaretestofindependenceposthoc(1)Class24 chi squaretestofindependenceposthoc(1)
Class24 chi squaretestofindependenceposthoc(1)
 
Class24 chi squaretestofindependenceposthoc
Class24 chi squaretestofindependenceposthocClass24 chi squaretestofindependenceposthoc
Class24 chi squaretestofindependenceposthoc
 
Causal inference lecture to Texas Children's fellows
Causal inference lecture to Texas Children's fellowsCausal inference lecture to Texas Children's fellows
Causal inference lecture to Texas Children's fellows
 
Bio statistics Bio statistics Bio statistics
Bio statistics Bio statistics  Bio statisticsBio statistics Bio statistics  Bio statistics
Bio statistics Bio statistics Bio statistics
 
Sample Size Determination.23.11.2021.pdf
Sample Size Determination.23.11.2021.pdfSample Size Determination.23.11.2021.pdf
Sample Size Determination.23.11.2021.pdf
 
Development and Validation of a Nomogram for Predicting Response to Neoadjuva...
Development and Validation of a Nomogram for Predicting Response to Neoadjuva...Development and Validation of a Nomogram for Predicting Response to Neoadjuva...
Development and Validation of a Nomogram for Predicting Response to Neoadjuva...
 
SCLC CTC paper 2014
SCLC CTC paper 2014SCLC CTC paper 2014
SCLC CTC paper 2014
 
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...
Evaluation of Logistic Regression and Neural Network Model With Sensitivity A...
 
Melanoma Nancy Shum And Anne Marcy Intro To Clinical Data Management
Melanoma   Nancy Shum And Anne Marcy Intro To Clinical Data ManagementMelanoma   Nancy Shum And Anne Marcy Intro To Clinical Data Management
Melanoma Nancy Shum And Anne Marcy Intro To Clinical Data Management
 
ARTIFICIAL NEURAL NETWORK FOR DIAGNOSIS OF PANCREATIC CANCER
ARTIFICIAL NEURAL NETWORK FOR DIAGNOSIS OF PANCREATIC CANCERARTIFICIAL NEURAL NETWORK FOR DIAGNOSIS OF PANCREATIC CANCER
ARTIFICIAL NEURAL NETWORK FOR DIAGNOSIS OF PANCREATIC CANCER
 

Recently uploaded

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 

Recently uploaded (20)

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 

Notes for macc

  • 1. S1: Let me introduce a work titled “Anticancer Thiazolidinones Design: Mining of 60-Cell Lines Experimental Data”. A search for anticancer agents containing thiazolidinone scaffold is a promising trend in modern medicinal chemistry. Computational chemistry, particularly QSAR and docking, is one of the success factors in this direction. In the present research we tried to extract all valuable knowledge hidden in 60-cell line anticancer screen results in-home database, that will be useful in further QSAR-studies. S2: The in vitro cell line screening is implemented under the developmental therapeutics program of National Cancer Institute (USA). The operation of this screen utilizes 60 different human tumor cell lines, representing leukemia, melanoma and cancers of the lung, colon, brain, ovary, breast, prostate, and kidney. The screening is a two-stage process, beginning with the evaluation of all compounds against the 60 cell lines at a single dose of 10 uM. The output from the single dose screen is reported as a mean graph. Compounds which exhibit significant growth inhibition are evaluated against the 60 cell panel at five concentration levels (with 10 uM as one of them). The results of both stages are compared to control test and are represented by a Growth percent. S3: The next problems had to be solved during current computational study: - Have the same dose results of this two stages enough statistical similarity to be treated together in future QSAR modelling? - Where is a rational border between active and inactive compounds? - Is there different mechanisms of antitumor action associated with investigated compounds? S4: A hypothesis, stated that same dose results are homogenous was investigated. If it is true, the next conclusions will be useful for further investigations: Primarily, this results can be combined together to increase overall data amount. Secondly, deviation in the results for same compounds is an error of the experiment And this experimental error is a minimal error for any QSAR model based on this data S5: According to statistical concepts, if same dose results are homogenous then deviations in the results for same compounds is a normally distributed random sample with zero mean and unknown variance. This null-hypothesis was evaluated by Student’s t-test with 60 cell lines results for 73 pairs of compounds, and was rejected for 41 cell lines with default statistical significance 0,05. In case of other 19 cell lines we cannot reject null-hypothesis, what means that either it
  • 2. is true, either it is insufficient data to reject this hypothesis. That is why we reject the investigated hypothesis in general. S6: Looking at the mean deviations of growth percents for different cell lines distribution, a shift to positive numbers can be pointed. It means that the results of the second testing stage are more optimistic than the results of the first one. S7: The distribution of mean deviations of growth percents for different compounds indicates the presence of extreme errors, that are still not corrected after averaging. Considering a case of 100% deviation of single pair results values as an outlier, extreme errors rate above 4% was found. S8: Testing results for non-active compounds have to be normally distributed with mean growth percent of cancer cells = 100% and unknown variance. Making an assumption about an abscence of the tumor growth enhancers among the investigated compounds, it can be stated that all mean growth percent values above 100% form a right tail of this distribution. So the left tail can be found statistically. For this purpose multiple evaluating of t-test with slow change of cut-off was carried out and resulted in the first failure to reject null-hypothesis with minimum growth percent = 86%. Simply saying, all compounds with mean growth percent above 86% have to be treated as non-active. Such introduction of the border between active and non-active compounds let us to form rational data arrays for further QSAR investigations. S9: Principal component analysis finds such linear combinations of variables that the projection of initial data on the obtained vectors will have maximal dispersion. Using principal component analysis is possible under the assumption that experimantal error is less than difference in sensitiity patterns for various mechanisms. It is expected that the first principal component incorporates an information about mean growth percent, and the others principal components cover differences in mechanisms and errors of the experiment. A change in explained variances of two next principal components was selected as a separation criteria between mechanisms and errors. Prior to calculations data was normalized by cell lines to provide equal influence of every analyzed cell line. Since change in explained variance with the second principal component is 5 times greater than next one, the presence of two different mechanisms are indicated. Approximate clusters of compounds with different modes of action are outlined by ellipses in the figure. As you can see, it is difficult to establish exact borders and a role of intermediate compounds remains unclear. That is why neural network modelling as more powerful computational approach was utilized.
  • 3. S10: Cohonen’s self-organizing 6 for 6 map was used for unsupervised learning durig 5 000 epochs. Prior to calculations data was normalized by compounds, so mean activity information was removed. And clusters with active compounds contain also inactive because of experimental error randomness. The distribution of whole compounds set in the neural network is showed on the left figure, and the distribution of only active molecules with the distances between neurons are on the right figure. We still cannot clearly separate different mechanisms becouse importance of a cluster depends not on the number of active compounds, but on the values of growth percents. S11: So an integrated activity measure, calculated as a sum of same cluster compounds contributions, was introduced. A surface of integrated activity over the neural network is presented in the figure. It gives a possibility to distinguish three classes: two different mechanisms (A and C) and mixed one (B) and to separate them clearly. S12: Weight plains of every cell line in the neural network allow to analyze selectivity patterns for active compounds. In this figures cell line is more sensitive to compounds from dark clusters and less sensitive to compounds from light clusters. It have to be pointed that NCI-H460 is more sensitive than other lines to all three classes, the second line is sensitive mostly to mechanism C, M14 and SK-MEL-5 – to mechanism A. M14 is rather insensitive to cluster C. S13: In the other hand, all lines on this slide are less sensitive to class A, and OVCAR-5 and SNB-19 – to class C. We can see also that weights of mechanism B are mostly in the interval formed by A and C weights. That confirms a hypothesis about mixed mechanism of tumor growth inhibition by compounds from cluster B. S14. Conclusions: The homogeneity of humat tumor cell line screen results obtained from different testing stages is rejected so they cannot be combined together in further computational investigations It is found that about 4% of testing results are extreme errors, what is useful with outliers detection in future QSAR models Rational border between active and non-active compounds is introduced and proper data arrays for further QSAR are formed. Two independent and one mixed mechanisms of 4-thiazolidinones antitumor activity are identified Some selectivity linked with different modes of action for separate cell lines is highlighted