1. Introduction PCA LDA PLS
Multivariate data analysis - set of statistical models that examine patterns in
multidimensional data by considering, at once, several data variables
• Doesn’t require you to make assumptions
• Can build predictive models to estimate value of unknown sample
Chemometrics is rapidly establishing itself as a tool in Pharmaceutical Industry
Transition from “quality-by-testing” to “quality-by-design”
FDA emphasized using PAT and QbD techniques in ICH Q8 guidelines
Pharmaceutical products and processes are complex systems by nature and can only
be effectively described by multi-factorial relationships
Multivariate data analysis comes into play - Development of calibration models to predict relevant
parameters and quality attributes in real-time
1
http://letsexcel.in
Multivariate Data Analysis
2. Introduction PCA LDA PLS
PCA is a mathematical procedure that transforms a large set of variables into
a lower dimensional set of new variables designated as principal components.
Any data set, it is likely that the key information is contained in
• Dominating sources of variability
• other sources of variability (e.g. Noise)
Application in Pharmaceutical Industry:
• Can be used with other techniques (DoE or PLS) to determine in critical parameter
and study impact with CQA’s
• support investigation of root-cause of dissolution study
• Raw material classification
• Understanding differences in Positive and Negative control samples
• Support Process Analytical Technology Applications
2
http://letsexcel.in
Principal Component Analysis
3. Introduction PCA LDA PLS
3
http://letsexcel.in
Holt Melt Extrusion (HME)
Hot Melt Extrusion is used for
to uniformly disperse an API in a carrier
polymer:
• enhancement of solubility and
bioavailability of poorly soluble drugs
• control the delivery of an API
• taste masking of bitter API’s
Feeding API and polymer into an extruder
results in extrudate that contains stable,
uniformly dispersed, soluble API
HME has been used largely to manufacture
• granules
• pellets
• immediate and modified release tablets
• transmucosal/transdermal films
• implantable reservoir devices
Cautions:
• physicochemical properties of an API
are critical components of HME process
design
• API’s can undergo polymorphic changes
resulting in stability issues
• shear force and the cooling rate also
needs to be controlled
4. Introduction PCA LDA PLS
Paper Title: Data mining of solubility parameters for computational prediction of drug–
excipient miscibility
Predict the miscibility of drug and excipients using Hansen solubility parameters (HSPs)
• The energy from dispersion forces between molecules
• The energy from dipolar intermolecular force between molecules
• The energy from hydrogen bonds between molecules
In the study,
Drug Indomethacin’s miscibility was predicted with chemically diverse excipients using HSP
The miscibility was experimentally determined by Differential Scanning Calorimetry (DSC)
The algorithm results were then evaluated with DSC results and the prediction accuracy was
found at 94%
Paper reference: Alhalaweh, A., Alzghoul, A., & Kaialy, W. (2014). Data mining of solubility
parameters for computational prediction of drug–excipient miscibility. Drug development and
industrial pharmacy, 40(7), 904-909.
4
http://letsexcel.in
Example
18. Introduction PCA LDA PLS
LDA – Method to find linear combination of features that
characterizes or separates two or more classes of objects or events.
• Supervised Learning
• Closely related to ANOVA
Application
• Classification of in-process material based on Physicochemical
properties
• Quantitative structure activity relationship
• Classification of pass-fail batches
18
http://letsexcel.in
Linear Discriminant Analysis
34. Introduction PCA LDA PLS
Technique that reduces the predictors to a smaller set of
uncorrelated components and performs least squares regression
on these components, instead of on the original data
Application in Pharmaceutical Industry :
• Widely used in Chemometrics and applied to spectroscopy data to obtain
measurements that enable real time release testing (PAT applications)
• Optimize dry granulation process – combine CMA and CPP to CQA’s
• Model the relationship between API and intermediate properties of a
formulation process
• Mine the historical data for a product and determine critical variables that
influence such variability
• Quantitative structure activity relationships
34
http://letsexcel.in
Partial Least Square (PLS)
35. Introduction PCA LDA PLS
35
http://letsexcel.in
Example
Calibration and Test Data
of Sample from Raman
analysis of
Multicomponent Mixture
was clubbed together
•Since easyPLS
automatically divides
train and test samples
without bias
Mean Scatter Correction
was performed on the
spectra offline
Granulation drying stage
spectra is used as
unknown data
Bhavana, V., Chavan, R. B., Mannava, M. C., Nangia, A., & Shastri, N. R. (2019).
Quantification of niclosamide polymorphic forms–A comparative study by Raman,
NIR and MIR using chemometric techniques. Talanta, 199, 679-688.
Ferreira, A. P., & Tobyn, M. (2014). Multivariate analysis in the pharmaceutical industry: enabling process understanding and improvement in the PAT and QbD era. Pharmaceutical Development and Technology, 20(5), 513–527.