SlideShare a Scribd company logo
1 of 12
Download to read offline
Journal
of
near
Infrared
SpectroScopy
233
ISSn: 0967-0335 Š IM publications llp 2009
doi: 10.1255/jnirs.854 all rights reserved
A comparison of near infrared method
development approaches using a drug
product on different spectrophotometers
and chemometric software algorithms
Assad Kazeminy,a
Saeed Hashemi,a
Roger L. Williams,b
Gary E. Ritchie,c
Ronald Rubinovitzd
and Sumit Sene,*
a
Irvine pharmaceutical Services, Inc., 10 Vanderbilt, Irvine, ca 92618, uSa
b
united States pharmacopeial convention, 12601 twinbrook parkway, rockville, Maryland 20852, uSa
c
former united States pharmacopeial convention, 12601 twinbrook parkway, rockville, Maryland 20852, uSa
d
BĂźchi corporation, 19 lukens drive, new castle, de 19720, uSa
e
united States food and drug administration, 19701 fairchild, Irvine, ca 92612, uSa. e-mail: sumit_sen@hotmail.com
It is well known that spectral variability in near infrared (NIR) spectroscopy can be attributed to the analyst, sample, sample positioning,
instrument configuration and software (in both algorithm formats and structures used as well as in the execution of data pre-treatment
and analysis). It is often acknowledged that the single largest factor impacting NIR results is sample presentation. However, what is
obvious but not often acknowledged is that there are instrumental and software differences as well. These differences, evident in the
quality of the spectra, may impact the chemometrics that are subsequently performed and, possibly, the results obtained from the
multivariate statistical models. In order to investigate just what are these sources of variability, and just how much these variations
may impact the results of the multivariate models for predicting the identification of pharmaceutical dosage forms, a study has been
conducted. To the authors’ knowledge, no other systematic study of this kind has been published. In this study, we are interested in
learning what variability, if any, the choices for instrument and software have on the development of a NIR method for the identification
of pharmaceutical dosage forms. Furthermore, we would like to learn what and how do the choices made early on in the experimental
design impact the final quality of the spectra and the resulting multivariate models obtained from these spectra. A study protocol was
designed, using a common data set consisting of four formulations of Ibuprofen, involving three investigating parties, namely, US FDA,
USP and Irvine Pharmaceutical Services and using three NIR instruments, namely (listed in alphabetical order), a Bruker spectrometer,
a BĂźchi spectrometer and a Foss spectrometer. Based on the results and despite differences in instrument configuration [dispersive or
Fourier Transform (FT)], number of spectral data points, principal components analysis (PCA) or factorisation algorithms, and valida-
tion modelling approach, exact and accurate spectroscopic results can be achieved using NIR spectroscopy for discriminate analysis.
More importantly, this study shows that the same NIR method spectral range and pre-treatment parameters can be used, and that
nearly the same multivariate models can be obtained, despite instrumental and software differences, to accurately predict the identity
of pharmaceutical dosage forms.
Keywords: near infrared (NIR) spectroscopy, instrument variability, chemometric software algorithms, multivariate discriminant analysis, PCA
A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009)
received: 9 June 2009 n revised: 30 September 2009 n accepted: 11 october 2009 n publication: 30 october 2009
234 An NIR Comparison of Method Development Approaches Using a Drug Product
It has been successfully demonstrated that near infrared
(nIr) spectroscopy can be used for the analysis of pharma-
ceutical products using a single instrument and software
configuration.1–6
the focus of this study was to determine
if the variability observed across different nIr instruments
and chemometric software packages can be an impediment
for the deployment of this technique if different instrument
and software types are to be used for the development of nIr
models.
the united States food and drug administration (uS
fda), the united States pharmacopeia (uSp) and Irvine
pharmaceutical Services, have undertaken a feasibility
study using nIr spectroscopy to identify, by the chemometric
method of multivariate discriminate analysis using principal
component analysis (pca) and factorisation, four formula-
tions of Ibuprofen (200 mg): two branded (advil and Motrin)
and two store-branded (cVS and rite aid) Ibuprofen (200mg)
immediate-release tablets, using three different instrument/
software combinations. the instruments used included
(listed in alphabetical order), a Bruker Vector 22n ft-nIr
spectrometer, Bruker optik GmbH, ettlingen, Germany, a
BĂźchi nIrflex Solids, flawil, Switzerland and a foss XdS
rapid content analyser, laurel, uSa. the software used from
each instrument, respectively, were opuS 5.5, nIrcal 5.2 and
Vision 3.4. the unscrambler 9.7, a stand-alone multivariate
analysis and experimental design software package, was
used as referee software to assist in developing a common
model. the Bruker Vector 22n ftnIr spectrometer and the
foss XdS rapid content analyser used for this study are
both located at the uSp, and the BĂźchi nIrflex Solids spec-
trometer was located at the Irvine pharmaceutical Services.
a study protocol was designed, using a common data set
consisting of these four formulations of Ibuprofen, involving
the three investigating parties, using the above-mentioned
different nIr instrument/software combinations. to our
knowledge, this is the first study of its kind investigating
the variability of nIr instrument/software combinations and
their impact on chemometric models for discriminate drug
analysis.
there were four specific objectives of the study:
(1) demonstrate that by the use of the computerised algo-
rithms of pca and factorisation, three identical, or nearly
identical, spectral libraries can be obtained on three different
instruments. these spectral libraries will be comprised of
the same samples drawn from the same manufactured lots,
scanned the same way and modelled using the same software
settings.
(2) compare four formulations of Ibuprofen: (200 mg)
two branded and two store-branded Ibuprofen (200 mg).
Immediate-release tablets will result in four distinct clusters
in multi-dimensional space.
(3) demonstrate that the calibration models are able to analyse
unknown tablets (validation set) and determine if they are
ibuprofen (200mg) immediate-release tablets from one of the
four known sources of origin that constitute the libraries or
are not from any of these known sources.
(4) the results from each analysis of unknown samples should
give the same predicted results, indifferent of the model used,
and despite instrument, spectra, algorithm, or number of cali-
bration samples, demonstrating that a protocol can be used in
each laboratory, independent of location, device, analyst and
software used to perform the identification.
Experimental
Instrumentation and software
the instruments that were used included a Bruker Vector 22n
ft-nIr spectrometer with a spectral range of 4000–12000cm–1
and spectral resolution set at 8 cm–1
(Bruker optik GmbH,
ettlingen, Germany), BĂźchi nIrflex Solids with a spectral
range 1000–2500nm (10,000–4000cm–1
) and spectral resolu-
tion set at 8cm–1
(BĂźchi, flawil, Switzerland) and foss XdS
rapid content analyser with a spectral range of 400–2500nm
(25,000–4000cm–1
) and a spectral resolution set by the speci-
fied bandpass of 9nm (foss nIrSystems, Inc., laurel, Md, uSa).
the software used for each instrument were, respectively, opuS
5.5, nIrcal 5.2, and Vision 3.4. the unscrambler 9.7, a stand-
alone multivariate analysis and experimental design software
package (caMo technologies Inc., trondheim norway), was
used as referee software to assist in developing a common
model.
Validation of instrument operation
tests for wavelength accuracy, photometric precision, accu-
racy and noise were performed on all of the instruments prior
to tablet measurements as recommended by the uSp general
information chapter <1119> near-Infrared Spectroscopy
(http://www.usp.org/uSpnf/revisions/usp31nf26secondSup-
plement01.html.7
Vendor recommended performance tests for
each instrument were run as well. the tests are routine quality
control tests of the instrument performance and as such were
performed at the prescribed intervals to verify correct instru-
ment performance.
Sample sets
table 1 lists the sources for the samples used for the study. ten
lots of four formulations of Ibuprofen (200mg) were obtained:
two branded (advil, Motrin) and two store-branded (cVS, rite
aid) Ibuprofen (200mg) immediate release tablets. the formu-
lations for each are shown in table 2. three sets of samples
were prepared for each instrument. these sets comprise the
calibration set, test set and validation set. for this study, manu-
facturer lots 1–8 were used for the calibration set and test set
and manufacturer lots 9 and 10 were set aside and used for the
validation sets. While one of the purposes of the protocol was
to control the preparation of the calibration, test, and validation
sample sets, it was found later that the calibration set prepared
Introduction
A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) 235
for the foss instrument was missing one tablet (see table 6
footnote) and the validation set for the foss instrument and
the Bruker instrument were different from the validation set
prepared for the BĂźchi instrument, containing only 28 tablets
from the Motrin validation set instead of 40 tablets originally
intended, all due to packaging errors. this error had no effect
on the outcome of the study (model prediction accuracy of vali-
dation samples) as will be shown later in this paper. the sample
sets were prepared as shown in table 3.
Sample collection and analyses
the study protocol was modelled on that described by Scafi
and pasquini.8
the multivariate chemometric approach to
calibrate, validate and maintain the spectral libraries was
based on work proposed by Workman and Brown.9,10
log
1/R (sometimes referred to as absorbance) spectra from the
Bruker and foss instrument, and reflectance spectra from
the BĂźchi instrument, were collected from each tablet for
the calibration and test sets by averaging 32 scans in diffuse
reflectance mode against a 99% reflectance background
reference. the spectra were collected across the following
ranges from each instrument: Bruker samples were
scanned from 11,999cm–1
to 3999cm–1
(833nm to 2500nm),
Büchi samples were scanned from 10,000cm–1
to 4000cm–1
(1000 nm to 2500 nm) and using the foss instrument, each
tablet was scanned from 400nm to 2500nm, photographs of
the tablets are shown in figure 1. the unscrambler 9.7 was
used to investigate the foss calibration set spectral proper-
ties that could be used to distinguish each formulation. tables
4 and 5 show the series of spectral pre-treatments that were
tested for use in order to achieve a pca that could be used
to determine the identity of the samples from the validation
sets. the test sets were used to optimise the prediction of
the final models before being used to predict the validation
sets. the Bruker software, opuS 5.5, uses the factorisa-
tion method to model and so the calibration set from this
instrument, while subjected to pca in the unscrambler 9.7,
required a different approach when constructed in the opuS
5.5 software. table 6 lists the model values for the calibration
modelling approach.
Development of NIR
calibrations
Based on observations from the unscrambler 9.7 exploration,
a Savitzky–Golay, first derivative, 21 point smoothing, 3rd order
polynomial, from 1000nm to 2500nm and a pca model, using
Mahalanobis distance in the wavelength range from 1400–
1500nm, was postulated as the optimal model to apply to the
Bruker, BĂźchi and foss calibration data sets. figures 2(a), 2(b)
and 2(c) show that four clusters can easily be separated from
each other in all three data sets. the following discussions
Lot Brand—Advil Brand—Motrin Generic—CVS Generic—Rite Aid
1 B946681 pca189 6ee0102 p45032
2×200=400 tablets 3×100=300 tablets 2×250=500 tablets 1×500=500 tablets
2 B87154 lla103 6He0515 p44389
2×200=400 tablets 3×100=300 tablets 4×100=400 tablets 1×500=500 tablets
3 B94669 pca112 7Be0119 p424477
3×100=300 tablets 3×100=300 tablets 1×750=750 tablets 1×250=250 tablets
4 B27624 pca226 7ae0039 p44686
4×75=300 tablets 10×24=240 tablets 1×750=750 tablets 1×250=250 tablets
5 B91364 pBa123 7ce0268 p14360
2×150=300 3×100=300 tablets 1×750=750 tablets 3×100=300 tablets
6 B73322 pBa194 7ae0699 p42476
2×150=300 tablets 3×100=300 tablets 1×500=500 tablets 3×100=300 tablets
7 B33863 paa016 6le0478 p42498
3×100=300 tablets 5×50=250 tablets 1×500=500 tablets 2×120=240 tablets
8 B91414 pBa186 7ae0270 p44151
2×200=400 tablets 5×50=250 tablets 3×100=300 tablets 5×50=250 tablets
9 B98483 pea106 6Ge0118 p44688
9×24=216 tablets 2×100=200 tablets 1×500=500 tablets 5×50=250 tablets
10 B91386 lla329 7Be0606 p42058
2×200=400 tablets 2×100=200 tablets 1×500=500 tablets 2×100=200 tablets
Table 1. Sources of Ibuprofen (200mg) samples used for this study.
236 An NIR Comparison of Method Development Approaches Using a Drug Product
track the model development in the opuS 5.5, nIrcal 5.2 and
Vision 3.4 software packages.
Pre-treatment I: baseline correction
Based upon the exploration of data pre-treatments shown in
tables 4 and 5, a baseline correction using the function:
f(x)=x–min(X),
where x is a variable and X denotes all selected variables for
this sample, was first applied in the unscrambler 9.7. a base-
line correction was shown to be the simplest pre-treatment
applied to the calibration data sets to effect separation of all
four formulations, although barely. furthermore, it was deter-
mined that while it is possible to apply a baseline transform
function to the foss calibration set using the Vision 3.4 soft-
ware, the same mathematical transformation was not found
to be available in the Bruker or the BĂźchi software.
using the Bruker opuS 5.5 software, the closest approxima-
tion to the baseline offset function used in the unscrambler 9.7
would be a normalisation function. this would then be followed
by the Savitzky–Golay, first derivative, 21 point smoothing,
3rd order polynomial, from 1000nm to 2500nm, followed by
factorisation (which is a Bruker pca algorithm using euclidian
distance within the opus Ident set-up module where the cali-
bration set resides. It was further noted that the default first
derivative transformation for the opuS 5.5 software is a cubic
polynomial equation.
the BĂźchi nIrcal 5.2 software required the use of several
spectra manipulations; to reverse the x-axis, orient the spectra
in the same direction as they are for the foss and Bruker data,
prior to performing the pca to the data [figures 3(a), 3(b) and
3(c)]. the Savitzky–Golay first derivative transformation for
the nIrcal 5.2 software is also a cubic polynomial equation.
In addition to other derivative functions, the nIrcal software
allows the use of customisable linear filters, which allowed
matching to the same derivative treatment used in the other
software packages. the resulting filters for the BĂźchi calibra-
tion set are listed in table 6.
Advil Motrin CVS-Ibuprofen Rite Aid-Ibuprofen
acelytated monoglycerides
Beeswax
propylene glycol
cellulose
lactose
Sodium starch glycolate
ethoxyethanol
lecithin
pharmaceutical glaze
povidone
Semithicone
Sodium benzoate
Sodium lauryl sulphate
Sucrose
fd&c yellow #6
Magnesium stearate
polydextrose
polyethylene glycol
fd&c yellow #6
Magnesium stearate
polydextrose
polyethylene glycol
croscarmellose sodium
Microcrystalline cellulose
pharmaceutical shellac
pregelatinised starch
carnauba wax
Shellac
ppregelatinised starch
carnauba wax
Hypromellose
carnauba wax
Hypromellose
croscarmellose sodium
Microcrystalline cellulose
Hypromellose
Iron oxides
Silicon dioxide
corn starch
Ibuprofen
Stearic acid
titanium dioxide
Iron oxides
colloidal silicon dioxide
corn starch
Ibuprofen
Stearic acid
titanium dioxide
fumed silica gel
corn starch
Ibuprofen
Stearic acid
titanium dioxide
Iron oxides
colloidal silicon dioxide
corn starch
Ibuprofen
Stearic acid
titanium dioxide
Table 2. Formulations of Ibuprofen (200mg) samples.
A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) 237
Pre-treatment II: Savitzky–Golay first
derivative, 21 point smoothing, cubic
polynomial
the smoothing and differentiation of paired data by the proce-
dure of simplified least squares, now called the Savitzky–
Golay filter,11
was developed for the removal of the random
noise from paired data sets, such as that obtained for nIr,
log 1/R versus wavelength. for the data sets from this study,
it can be shown that the calibration by pca or factorisation is
most affected by sampling variation manifested as baseline
offsets. the offsets result in significant overlap of spectra
with closely resembling spectral features, which cannot be
completely resolved in spectral space. tables 4 and 5 show
a systematic approach of pre-treating the calibration sample
spectra. the pre-treatments were applied and investigated
for their ability to remove sampling variation from the spectra.
the plus sign indicates that the samples from these formula-
tions can be separated into distinct clusters from the other
samples of a different formulation using the associated pre-
treatment, whereas the minus sign indicates that samples
from these formulations cannot be separated into distinct
clusters from the other samples of a different formulation
using the associated pre-treatment. the contribution of the
variability from the samples can be seen in figures 3(a), 3(b)
and 3(c). the baseline offset is evident in the exploded plot of
the spectra in figure 4 and the effect of smoothing and deriv-
ative pre-treatment is also evident as well (note: the number
of data points applied for smoothing the spectra during deri-
vatisation were investigated and 21 data points was found
to be optimal for discerning spectral features of the four
formulations in spectral space). particularly noticeable at
the wavelength 1440nm, one can observe a sharp band in the
untreated spectra, which is attributed to Innovator a samples
(advil) [note: this is due to the sucrose in the formulation
as indicated in table 2. the origin of this unique band is
discussed by davies and Miller.12
the impact of band reso-
lution from dispersive and ft is most noticeable in figure
4, which is the expanded view of the first derivative and
smoothed spectra. the result of applying the Savitzky–Golay
first derivative, 21 point smoothing filter is that it changes
the directions of the of the Generic a and Generic B spectra
relative to both the Innovator a and Innovator B spectra in the
region between 1400nm and 1500nm.
Principal component calculations
the pca models [either by Mahalanobis distance (Md) for
the foss and BĂźchi instruments or euclidian distance (ed)
for the Bruker instrument) resulted in correct and accurate
predictions of all three validation sets. Mahalanobis distance
is the statistical distance taking into account the variance of
each variable and the correlation coefficients. In the case of
a single variable, it is the square of the distance (between two
objects, or between an object and the centroid) divided by the
variancea
euclidian distance is simply the distance between
Laboratory/instrument Experimental
samples
Innovator
manufacturer
Advil
Innovator
manufacturer
Motrin
Generic
manufacturer
CVS
Generic
manufacturer
Rite Aid
Total
united States
pharmacopeia/
foss
calibration Set 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 192
test Set 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 64
Validation set 20 tabs×2 lots=40 14 tabs×2 lots=28 20 tabs×2 lots=40 20 tabs×2 lots=40 148
united States
pharmacopeia/
Bruker
calibration Set 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 192
test Set 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 64
Validation set 20 tabs×2 lots=40 14 tabs×2 lots=28 20 tabs×2 lots=40 20 tabs×2 lots=40 148
fda/Irvine pharmaceutical
Services, Inc./
BĂźchi
calibration Set 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 192
test Set 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 64
Validation set 20 tabs×2 lots=40 20 tabs×2 lots=40 20 tabs×2 lots=40 20 tabs×2 lots=40 160
total 312 288 312 312 1224
Table 3. Study design.
Figure 1. Ibuprofen (200mg) tablets. Upper left, Advil, upper
right, Motrin, lower left CVS, and lower right, Rite Aid.
238 An NIR Comparison of Method Development Approaches Using a Drug Product
two variables and is calculated as the arithmetic difference,
i.e. variable 1–variable 2. for a bi-variate model, the squared
distances between two vectors in multi-dimensional space are
the sum of squared differences in their coordinates. as noted
previously, the algorithms used by each software were different
by the calculations employed for score distances. the signifi-
cance of this difference is that, even though the distances were
calculated in the principal component space (pc space) for the
foss and BĂźchi calibration sets, and in wavelength space for
the Bruker calibration set, on similarly pre-treated spectra,
using the same wavelength range, and on different number of
principal components, the models gave identical predictions
for the validation sets (see table 7). the reason that the pca
calculation is invariant in either wavelength space or pc space
is explained by de Maesschalck et al.6
the supervised training by pca, a discriminant analysis
technique, relies on the measurement of distances between
objects in order to achieve classification. as noted by de
Maesschalck et al., this can occur in wavelength space or
in pc space, and on normalised or unnormalised spectra.
another factor that de Maesschalck et al. point out, that may
not be obvious to a casual user of nIr techniques, is that
“the Md and the ed can also be calculated using a smaller
number of latent variables (pcs) obtained after pca analysis
instead of the original variables. In this case, the Md, however,
does not need to correct for the covariance between the vari-
ables, since pcs are by definition orthogonal (uncorrelated).
However, the way each of the residual pcs is weighted in the
computation of the distance must be taken into account.” the
paper goes on to clarify the relationship between the ed and
the Md, particularly how each is calculated in the original
variable space and the pc space. It should be noted here that
the Bruker software, opuS 5.5, utilises an approach referred
to as factorisation, which is explained as a spectral distance
calculation. table 7 gives the model values for each calibra-
tion from each instrument and the camo software. figure 4
shows the expanded view of the derivative spectra,
Spectra treatment
(1000–2500nm)
Cluster
Innovator A Innovator B Generic A Generic B
untreated spectra + + – –
Baseline corrected + + + +
first derivative + – + +
Second derivative + + – –
Baseline corrected
first derivative
+ + – –
Baseline corrected
Second derivative
+ – + +
*
pre-treatments applied to the raw spectra that resulted in a separate and distinct cluster for any member of the four sample sets following principle com-
ponent analysis are denoted by a plus sign, and pre-treatments that did not result in separate and distinct clusters are denoted by a minus sign. plus signs,
in bold, indicates that all four sample sets were successfully separated
Table 4. Data pretreatments from 1000nm to 2500nm.
Spectra treatment
(1400–1500nm)
Cluster
Innovator A Innovator B Generic A Generic B
untreated spectra + – – –
Baseline corrected + + – –
first derivative + + – –
Second derivative + – – –
Baseline corrected
first derivative
+ + + +
Baseline corrected
Second derivative
+ – – –
**
See footnote from table 4
Table 5. Data pre-treatments from 1400nm to 1500nm.
a
from a glossary provided with the permission of Bryan prazen of the
Synovec Group, department of chemistry, university of Washington.
found at http://www.spectroscopynow.com/coi/cda/detail.cda?id=101
18&type=educationfeature&chId=9&page=1
A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) 239
Unscrambler 9.7 Vision 3.4 OPUS 5.5 NIRCal 5.2
calibration set 191 191* 192 192
test set 64 64 64 64
Validation set 148 148 148 160
raw spectra range 400–2499.50nm 400–2499.50nm 11999.86–3999.952cm–1
4000–10000cm–1
number of raw
data points
4200 4200 2075 1501
Baseline
transform**
Baseline offset
f(x)=x–min(X)
(1000–2499.50nm)
Math treatments
baseline correction
(1000nm)
Manipulate
offset normalisation†
after transforming
spectra from
wavenumber to
wavelength
(1000–2500nm)
add constant,
•
constant=–1000
Shift neg to
•
0, 2500–1000
(total 1501/1501)
n/a n/a change from
wavenumber to wave-
length
change from
wavenumber to
wavelength
n/a n/a n/a absorbance log10(1/×)
Savitzky–Golay
first derivative
(SG 1st deriv.)
(1000–25000nm),
using a 3rd order
polynomial,
21 point smoothing
Modify /transform/
derivatives/S. Golay/
variables
(1000nm–2500nm)/
1st
derivative/
21 points smoothing/
3rd
order polynomial
Math treatments
Savitzky–Golay
first derivative/
region
minimum/1000nm/
region maxi-
mum/ 2500nm/21
point, cubic spline
polynomial
evaluate/Setup
Identity test method/
load method/
parameters/
preprocessing/first
derivative 21 points/
regions/7695.18cm–1
–
6248.72cm–1
, cubic
spline polynomial
linear filter (84075,
10032, –43284,
–78176, –96947,
–101900, –95338,
–79564, –56881,
–29592, 0, 29592,
56881, 79564, 95338,
101900, 96947, 78176,
43284, –10032,
–84075, 3634092)
pca task Mahalanobis
distance in principal
component
Space/1400–1500nm
Mahalanobis
distance in prin-
cipal component
space/spectral
filtering/wavelength
min.1400nm/
wavelength max.
1500nm
pcs: 5
threshold: probability
level: 0.999
factorisation: six
factors (in wavelength
space)
threshold: 0.25/1400–
1500nm
Mahalanobis
distance in principal
component
Space/1400–1500nm
pcs:3
*
only seven tablets were packaged for sample 6ufGB (lot 6 uSp/foss/Generic B–rite aid), all other lots from ufGB: 1,2,3,4,5,7 and 8 were packaged to
contain eight tablets, six for the calibration set and two for the test set.
**
Since the baseline algorithms were all significantly different and do not give sufficiently similar spectral pre-treatments, it was decided to drop this step
and proceed directly to the derivative/smoothing step on all calibration data.
†
the attempt to use normalisation to match the baseline correction from the Vision 3.4 and unscrambler 9.7 software was met with another unexpected
obstacle which forced the investigators to not use a baseline correction as the first step of pre-treating the calibration spectra. the normalisation pre-
treatment algorithm resides in a different module than the Ident module where the calibration model resides. In order to carryout the pre-treatment on the
calibration and subsequent unknown samples, a macro that would pre-process the spectra and automatically load those spectra into opuS Ident as well
as being able to analyse an unknown sample using offset normalisation spectral pre-processing and automatically output a result, would be required. Since
this is only a requirement of the opuS 5.5 software, it was decided to drop this step and proceed directly to the derivative/smoothing step on all calibration
data.
Table 6. Model values for each calibration.
240 An NIR Comparison of Method Development Approaches Using a Drug Product
2b
Innovator B
Innovator A
Generic B
Generic A
2a
Innovator A
Innovator B
Generic B
Generic A
2c
Innovator A
Innovator B
Generic B
Generic A
Figure 2. (a) BĂźchi PCA Score Plot (Unscrambler 9.7) using, (b) Foss PCA Score Plot (Unscrambler 9.7) and (c) Bruker PCA Score Plot
(Unscrambler 9.7).
(a)
(b)
(c)
A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) 241
3
log
(1/R)
Innovator A
(b)
Log
(1/R)
Wavelength
Log
(1/R)
(c)
Figure 3. (a) BĂźchi raw spectra plot (NIRCal 5.2), (b) Foss raw spectra plot (Vision 3.4) and (c) Bruker raw spectra plot (OPUS 5.5).
(a)
(b)
(c)
242 An NIR Comparison of Method Development Approaches Using a Drug Product
Figure 4. Expanded view of BĂźchi, Foss, and Bruker
Derivative Spectra of Calibration and Test Set
BĂźchi
(a)
(c)
(d)
(b)
BĂźchi
18
Figure 4. Expanded view of BĂźchi, Foss, and Bruker
Derivative Spectra of Calibration and Test Set
BĂźchi
Foss
(a)
(b)
(c)
(d)
Foss
.0010
.0005
0
.0005
.0010
.0015
.0020
1400.61nm1411.29nm1422.12nm1433.13nm1444.31nm1455.66nm1467.19nm1478.91nm1490.82nm
Variables
Line Plot
Bruker
(a)
(b)
(c)
(d)
Bruker
Figure 4. Expanded view of BĂźchi, Foss and Bruker derivative spectra of calibration and test set: (a) Innovator A, (b) Generic A, (c)
Generic B and (d) Innovator B.
A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) 243
Validation results and
discussion
table 7 lists a summary of the results of the validations sets
from the, Bruker, BĂźchi and foss instruments, respectively.
the results demonstrate that all four objectives of the study
were met. four distinct computerised algorithms of pca
and factorisation were used to construct three separate
spectral libraries from a common calibration and test set,
each residing on different instruments in different laborato-
ries and one residing on stand-alone software, the referee
model. the use of the “referee” model helped establish the
baseline values for the model parameters such as spectral
range, spectra pre-treatment, calibration algorithms, etc.
that could be used across the various software platforms
despite the variations in instrument bandwidth, spectral data
points, algorithms for smoothing, derivative and other calcu-
lations. Specifically, while it was found that a pca model
based on calculating Mahalanobis distance in pc space,
second derivative Gap 20, second order polynomial was suffi-
cient to model the calibration and test set on two of the three
instruments, it became evident early on in the experiment
that by using exactly the same model consisting of the same
algorithm calculation and same pre-treatment routine was
not possible. However, by the use of the referee software
which, when loaded with the calibration set from all three
instruments, one could easily determine how to optimise the
spectral range, and smoothing and derivative pre-treatment
parameters in order to achieve similar calibration and test
set parameters. the limiting calibration setting was found to
be the polynomial function, which effects how many points
are used to calculate the smoothing and derivative function.
as was shown in figure 4, there is a very narrow spectral
range (100 nm or 1446.46 cm–1
on which separation can be
made on the four data sets in spectra space.
Having met the first objective with some deviations, the
second, third and fourth objectives were easily met, as
the three models were successful at predicting the vali-
dation sets for each instrument, resulting in four distinct
clusters in multidimensional space, each cluster repre-
senting the innovator and generic brand Ibuprofen formula-
tions. additionally, samples from Generic a lots 6Ge0118
(cVS Brand) and p42058 (rite aid) were correctly identified
as not belonging to Generic a lots comprising the cali-
bration sets for all three data sets. It was observed that
these samples have a distinct banding pattern in the region
from 1400 nm–1500 nm from all other sample spectra. the
most likely cause of this is assuredly due to the presence
of a component that absorbs in the nIr region not found in
those samples comprising the calibration set. as a result of
the new band in the critical region between 1400–1500 nm,
these samples also produce a fifth separate and distinct
cluster in pc space.
It must be mentioned here that this experiment, being
performed in different laboratories within different organisa-
tional cultures, was totally driven by a protocol that was jointly
crafted and agreed upon prior to execution. the authors feel
that this is a key point since this experiment was designed to
meet specific objectives, despite the fact that the instruments,
software and personnel were at different locations. this, of
course, was not the major factor. the major factor was trying
to coordinate all of the steps within the protocol from within
different organisations. While nIr experiments are generally
described as non-destructive, fast and cost effective, when
done on a large scale they require planning, discussion and
coordination. this is rarely mentioned. It is hoped that this
example may serve as a model for future applications that
involve large sample sets and multi-organisations using
multiple instrument–software combinations. the current
global pharmaceutical counterfeiting problem is one area that
should benefit from examples like the one demonstrated in
this paper.
Conclusions
the specific objective of the study was to obtain log 1/R spectra
of four formulations of Ibuprofen (200mg) from two branded
Instrument Lot Innovator A
tablets
Innovator B
tablets
Generic A Generic B
foss 9 20 20 0 20
10 20 8 0 20
Bruker 9 20 20 0 20
10 20 8 0 20
BĂźchi 9 20 20 0 20
10 20 20 0 20
twenty tablets were tested for each combination of instrument and lot, except for lot 10 of Innovator B where only eight were tested on the foss and Bruker
instruments
Table 7. Number of tablets identified.
244 An NIR Comparison of Method Development Approaches Using a Drug Product
and two store-branded Ibuprofen (200mg) immediate-release
tablets and use them to design, develop, validate and deploy a
calibration model that can subsequently be used to correctly
classify by discriminant analysis using pca, log 1/R spectra
from unknown samples (validation set) on nIr instruments of
varying types and software configurations.
this experiment was designed to study the impact that nIr
instrument hardware and software configurations have on
nIr method development. Several variables were detected
and assessed. Spectrometer types, sample holders, spec-
tral acquisition settings, data pre-treatments and pca algo-
rithms were studied. nIr method development was attempted
by three different analysts on three different instruments
located in two different laboratories. It was determined
that even though identical samples were used for model-
ling and prediction, and the same calibration approach was
tried on the accompanying software, spectra differences were
observed due to the number of data points, and that these
impact the ability to perform the same or similar data pre-
treatments in different software. furthermore, the algorithms
employed in each software platform may limit the ability to
deploy a method developed on any single software platform
to be deployed across different software platforms. However,
despite the differences observed, it was possible to find a
common method using each software that enabled accurate
predictions of the validation samples when each model was
used independent of instrument and software configuration.
Knowing the sources of variability that impact the log 1/R
nIr spectrum will minimise the overall prediction varia-
bility and increase the likelihood of correctly classifying by
discriminant analysis, the log 1/R spectra from unknown
samples subsequently measured and compared to the spec-
tral library and classified by the calibration model, when
model parameters are used on different instrument and
software combinations.
It was found that using Savitzky–Golay, first derivative, 21
point smoothing, third order polynomial, pre-treated spectra
and either pca or factorisation model (either by Md in prin-
cipal component space for the foss and BĂźchi or ed in wave-
length space for the Bruker factorisation method) resulted
in different models but possessing the same accuracy capa-
bilities (100%) for predicting samples comprising similar
validation sets. one validation sample set, a store-branded
Ibuprofen (200 mg) immediate-release tablet, was correctly
identified as not belonging to the samples represented in the
calibration set by all three models. Based on these results,
and despite differences in instrument configuration [disper-
sive or fourier transform (ft)], number of spectral data
points, pca or factorisation algorithms and validation model-
ling approach, exact and accurate spectroscopic results can
be achieved using nIr spectroscopy for discriminate analysis.
More importantly, this study shows that the same nIr method
spectral range and pre-treatment parameters can be used
and that nearly the same multivariate models can be obtained,
despite instrumental and software differences, to accurately
predict the identity of pharmaceutical dosage forms.
Acknowledgements
the authors are grateful for the contributions to this project
from each of the following scientists: darrell abernathy,
rebecca allen, todd cecil, Walter Hauck, andrea Iwanik,
Steven lane, Samir Wahab and patricia White from uSp,
rudy flach, charles petersen and Heather coffin from Irvine,
William Martin from the fda, Michael Surgeary from BĂźchi,
cynthia Kradjel from Integrated technical Solutions and Verne
Hebard from Bruker.
References
1. p. de peinder, M.J. Vredenbregt, t. Visser and d. de
Kaste, “detection of lipitor counterfeits: a comparison of
nIr and raman spectroscopy in combination with che-
mometrics”, J. Pharmaceut. Biomed. Anal. 47, 688 (2008).
doi: 10.1016/j.jpba.2008.02.016
2. J. luypaerta, d.l. Massart and y. Vander Heyden,
“near-infrared spectroscopy applications in pharma-
ceutical analysis”, Talanta 72, 865 (2007). doi: 10.1016/j.
talanta.2006.12.023
3. y. roggo, p. chalusa, l. Maurera, c. lema-Martineza, a.
edmonda and n. Jenta, “a review of near infrared spec-
troscopy and chemometrics in pharmaceutical technolo-
gies”, J. Pharmaceut. Biomed. Anal. 44, 683 (2007). doi:
10.1016/j.jpba.2007.03.023
4. a.K. deisingh, “pharmaceutical counterfeiting”, Analyst
130, 271 (2005). doi: 10.1039/b407759h
5. c.a. anderson, J.K. drennen and e.W. ciurczak,
“pharmaceutical applications of near infrared spec-
troscopy”, in Handbook of Near-Infrared Analysis”, 3rd
edn, (practical Spectroscopy Series Volume 35) ed by
d.a. Burns and e. W. ciurczak. crc press, Boca raton,
florida, uSa, p. 585 (2008).
6. r. de Maesschalck, d. Jouan-rimbaud and d.l.
Massart, “tutorial—the Mahalanobis distance”,
Chemometr. Intell. Lab Syst. 50, 1 (2000). doi: 10.1016/
S0169-7439(99)00047-7
7. USP general information chapter <1119> Near-Infrared
Spectroscopy (http://www.usp.org/uSpnf/revisions/
usp31nf26secondSupplement01.html.
8. S.H.f. Scafi and c. pasquini, “Identification of counterfeit
drugs using near-infrared spectroscopy”, Analyst 126,
2218 (2001). doi: 10.1039/b106744n
9. J. Workman Jr and J. Brown, “a new standard practice
for multivariate, quantitative infrared analysis-part I”,
Spectroscopy 11(2), 48 (1996).
10. J. Workman Jr and J. Brown, “a new standard practice
for multivariate, quantitative infrared analysis-part II”,
Spectroscopy 11(9), 24 (1996).
11. a. Savitzky and M.J.e. Golay, “Smoothing and
differentiation of data by simplified least squares pro-
cedures,” Anal. Chem. 36, 1627 (1964). doi: 10.1021/
ac60214a047

More Related Content

Similar to A Comparison Of Near Infrared Method Development Approaches Using A Drug Product On Different Spectrophotometers And Chemometric Software Algorithms

NEAFS_DSA_FTIR_Image_NBOMes and LSD_Poster
NEAFS_DSA_FTIR_Image_NBOMes and LSD_PosterNEAFS_DSA_FTIR_Image_NBOMes and LSD_Poster
NEAFS_DSA_FTIR_Image_NBOMes and LSD_Poster
Amanda Moore
 
Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery
Sean Ekins
 
Nanobiosensor for diclofop detection based on chemically modified afm probes ...
Nanobiosensor for diclofop detection based on chemically modified afm probes ...Nanobiosensor for diclofop detection based on chemically modified afm probes ...
Nanobiosensor for diclofop detection based on chemically modified afm probes ...
Grupo de Pesquisa em Nanoneurobiofisica
 
Clasificacion biofarmaceutica y benchmarketing
Clasificacion biofarmaceutica y benchmarketingClasificacion biofarmaceutica y benchmarketing
Clasificacion biofarmaceutica y benchmarketing
Hector Gonzalez
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
Kamel Mansouri
 
Anomaly Detection in Fruits using Hyper Spectral Images
Anomaly Detection in Fruits using Hyper Spectral ImagesAnomaly Detection in Fruits using Hyper Spectral Images
Anomaly Detection in Fruits using Hyper Spectral Images
ijtsrd
 
Drug discovery and evaluation safety and pharmacokinetic assays
Drug discovery and evaluation safety and pharmacokinetic assaysDrug discovery and evaluation safety and pharmacokinetic assays
Drug discovery and evaluation safety and pharmacokinetic assays
Springer
 
Analytical method development and validation of novel Levofloxacin, Pruliflox...
Analytical method development and validation of novel Levofloxacin, Pruliflox...Analytical method development and validation of novel Levofloxacin, Pruliflox...
Analytical method development and validation of novel Levofloxacin, Pruliflox...
Dr. Ravi Sankar
 

Similar to A Comparison Of Near Infrared Method Development Approaches Using A Drug Product On Different Spectrophotometers And Chemometric Software Algorithms (20)

NEAFS_DSA_FTIR_Image_NBOMes and LSD_Poster
NEAFS_DSA_FTIR_Image_NBOMes and LSD_PosterNEAFS_DSA_FTIR_Image_NBOMes and LSD_Poster
NEAFS_DSA_FTIR_Image_NBOMes and LSD_Poster
 
Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery
 
Nanobiosensor for diclofop detection based on chemically modified afm probes ...
Nanobiosensor for diclofop detection based on chemically modified afm probes ...Nanobiosensor for diclofop detection based on chemically modified afm probes ...
Nanobiosensor for diclofop detection based on chemically modified afm probes ...
 
Vierordt’s method
Vierordt’s methodVierordt’s method
Vierordt’s method
 
Clasificacion biofarmaceutica y benchmarketing
Clasificacion biofarmaceutica y benchmarketingClasificacion biofarmaceutica y benchmarketing
Clasificacion biofarmaceutica y benchmarketing
 
An Investigation Of The RWPE Prostate Derived Family Of Cell Lines Using FTIR...
An Investigation Of The RWPE Prostate Derived Family Of Cell Lines Using FTIR...An Investigation Of The RWPE Prostate Derived Family Of Cell Lines Using FTIR...
An Investigation Of The RWPE Prostate Derived Family Of Cell Lines Using FTIR...
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
 
Anomaly Detection in Fruits using Hyper Spectral Images
Anomaly Detection in Fruits using Hyper Spectral ImagesAnomaly Detection in Fruits using Hyper Spectral Images
Anomaly Detection in Fruits using Hyper Spectral Images
 
C044041723
C044041723C044041723
C044041723
 
An evaluation of machine learning algorithms coupled to an electronic olfact...
An evaluation of machine learning algorithms coupled to an  electronic olfact...An evaluation of machine learning algorithms coupled to an  electronic olfact...
An evaluation of machine learning algorithms coupled to an electronic olfact...
 
UV Spectrophotometric Method Development and Validation for Quantitative Esti...
UV Spectrophotometric Method Development and Validation for Quantitative Esti...UV Spectrophotometric Method Development and Validation for Quantitative Esti...
UV Spectrophotometric Method Development and Validation for Quantitative Esti...
 
Priyakant_Author_IJTLD
Priyakant_Author_IJTLDPriyakant_Author_IJTLD
Priyakant_Author_IJTLD
 
Toxicity studies
Toxicity studiesToxicity studies
Toxicity studies
 
Drug discovery and evaluation safety and pharmacokinetic assays
Drug discovery and evaluation safety and pharmacokinetic assaysDrug discovery and evaluation safety and pharmacokinetic assays
Drug discovery and evaluation safety and pharmacokinetic assays
 
Chromatographic Analysis of Pharmaceuticals Second Edition by John A. Adamovics
Chromatographic Analysis of Pharmaceuticals Second Edition by John A. AdamovicsChromatographic Analysis of Pharmaceuticals Second Edition by John A. Adamovics
Chromatographic Analysis of Pharmaceuticals Second Edition by John A. Adamovics
 
Analytical method development and validation of novel Levofloxacin, Pruliflox...
Analytical method development and validation of novel Levofloxacin, Pruliflox...Analytical method development and validation of novel Levofloxacin, Pruliflox...
Analytical method development and validation of novel Levofloxacin, Pruliflox...
 
Illicit Drugs and Homemade Explosives Identification using FTIR Microscopy
Illicit Drugs and Homemade Explosives Identification using FTIR MicroscopyIllicit Drugs and Homemade Explosives Identification using FTIR Microscopy
Illicit Drugs and Homemade Explosives Identification using FTIR Microscopy
 
Role of instrumentation in Pharmacology
Role of instrumentation in PharmacologyRole of instrumentation in Pharmacology
Role of instrumentation in Pharmacology
 
Computer software used in PHARMCAY AND PHARMACEUTICAL RESEARCH
Computer software used in PHARMCAY AND PHARMACEUTICAL RESEARCHComputer software used in PHARMCAY AND PHARMACEUTICAL RESEARCH
Computer software used in PHARMCAY AND PHARMACEUTICAL RESEARCH
 
Modeling and optimization of nebulizers' performance in non-invasive ventilat...
Modeling and optimization of nebulizers' performance in non-invasive ventilat...Modeling and optimization of nebulizers' performance in non-invasive ventilat...
Modeling and optimization of nebulizers' performance in non-invasive ventilat...
 

More from Simar Neasy

More from Simar Neasy (20)

014 Sample Of College Essays Idas Ponderresear
014 Sample Of College Essays Idas Ponderresear014 Sample Of College Essays Idas Ponderresear
014 Sample Of College Essays Idas Ponderresear
 
A Poster With An Image Of Writin. Online assignment writing service.
A Poster With An Image Of Writin. Online assignment writing service.A Poster With An Image Of Writin. Online assignment writing service.
A Poster With An Image Of Writin. Online assignment writing service.
 
Essay, Essay Writing, Essay Writing Help, Essay Writing T
Essay, Essay Writing, Essay Writing Help, Essay Writing TEssay, Essay Writing, Essay Writing Help, Essay Writing T
Essay, Essay Writing, Essay Writing Help, Essay Writing T
 
A4 Lined Paper Image,Lined Paper With Blue Lines Colleg
A4 Lined Paper Image,Lined Paper With Blue Lines CollegA4 Lined Paper Image,Lined Paper With Blue Lines Colleg
A4 Lined Paper Image,Lined Paper With Blue Lines Colleg
 
Term Paper Bibliography Format - Home - Citation G
Term Paper Bibliography Format - Home - Citation GTerm Paper Bibliography Format - Home - Citation G
Term Paper Bibliography Format - Home - Citation G
 
001 Essay Example Informative Examples For
001 Essay Example Informative Examples For001 Essay Example Informative Examples For
001 Essay Example Informative Examples For
 
Essay On First Day Of School - Brain. Online assignment writing service.
Essay On First Day Of School - Brain. Online assignment writing service.Essay On First Day Of School - Brain. Online assignment writing service.
Essay On First Day Of School - Brain. Online assignment writing service.
 
Research Paper Abstract Templates At Allbusinesste
Research Paper Abstract Templates At AllbusinessteResearch Paper Abstract Templates At Allbusinesste
Research Paper Abstract Templates At Allbusinesste
 
014 Essay Example Comparison And Contrast
014 Essay Example Comparison And Contrast014 Essay Example Comparison And Contrast
014 Essay Example Comparison And Contrast
 
Project Proposal Writing, Project Proposal Example,
Project Proposal Writing, Project Proposal Example,Project Proposal Writing, Project Proposal Example,
Project Proposal Writing, Project Proposal Example,
 
Editing College Essays Matter. Online assignment writing service.
Editing College Essays Matter. Online assignment writing service.Editing College Essays Matter. Online assignment writing service.
Editing College Essays Matter. Online assignment writing service.
 
Underline Italicize Quote Quotations,. Online assignment writing service.
Underline Italicize Quote Quotations,. Online assignment writing service.Underline Italicize Quote Quotations,. Online assignment writing service.
Underline Italicize Quote Quotations,. Online assignment writing service.
 
Best College Essays Examples College Thesis Wr
Best College Essays Examples College Thesis WrBest College Essays Examples College Thesis Wr
Best College Essays Examples College Thesis Wr
 
Volcano Worksheet Volcano Worksheet, Volcan
Volcano Worksheet Volcano Worksheet, VolcanVolcano Worksheet Volcano Worksheet, Volcan
Volcano Worksheet Volcano Worksheet, Volcan
 
No 1 Nursing Essay Help Wr. Online assignment writing service.
No 1 Nursing Essay Help Wr. Online assignment writing service.No 1 Nursing Essay Help Wr. Online assignment writing service.
No 1 Nursing Essay Help Wr. Online assignment writing service.
 
Oxford StudentS Tips For Essay Writing Part 1
Oxford StudentS Tips For Essay Writing Part 1Oxford StudentS Tips For Essay Writing Part 1
Oxford StudentS Tips For Essay Writing Part 1
 
University Essay Writing Guide. Writin. Online assignment writing service.
University Essay Writing Guide. Writin. Online assignment writing service.University Essay Writing Guide. Writin. Online assignment writing service.
University Essay Writing Guide. Writin. Online assignment writing service.
 
Prospectus Template Research Paper - How To
Prospectus Template Research Paper - How ToProspectus Template Research Paper - How To
Prospectus Template Research Paper - How To
 
Compare And Contrast Introduction Paragraph
Compare And Contrast Introduction ParagraphCompare And Contrast Introduction Paragraph
Compare And Contrast Introduction Paragraph
 
College Essay Paper Writing Services Usa
College Essay Paper Writing Services UsaCollege Essay Paper Writing Services Usa
College Essay Paper Writing Services Usa
 

Recently uploaded

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 

Recently uploaded (20)

Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 

A Comparison Of Near Infrared Method Development Approaches Using A Drug Product On Different Spectrophotometers And Chemometric Software Algorithms

  • 1. Journal of near Infrared SpectroScopy 233 ISSn: 0967-0335 Š IM publications llp 2009 doi: 10.1255/jnirs.854 all rights reserved A comparison of near infrared method development approaches using a drug product on different spectrophotometers and chemometric software algorithms Assad Kazeminy,a Saeed Hashemi,a Roger L. Williams,b Gary E. Ritchie,c Ronald Rubinovitzd and Sumit Sene,* a Irvine pharmaceutical Services, Inc., 10 Vanderbilt, Irvine, ca 92618, uSa b united States pharmacopeial convention, 12601 twinbrook parkway, rockville, Maryland 20852, uSa c former united States pharmacopeial convention, 12601 twinbrook parkway, rockville, Maryland 20852, uSa d BĂźchi corporation, 19 lukens drive, new castle, de 19720, uSa e united States food and drug administration, 19701 fairchild, Irvine, ca 92612, uSa. e-mail: sumit_sen@hotmail.com It is well known that spectral variability in near infrared (NIR) spectroscopy can be attributed to the analyst, sample, sample positioning, instrument configuration and software (in both algorithm formats and structures used as well as in the execution of data pre-treatment and analysis). It is often acknowledged that the single largest factor impacting NIR results is sample presentation. However, what is obvious but not often acknowledged is that there are instrumental and software differences as well. These differences, evident in the quality of the spectra, may impact the chemometrics that are subsequently performed and, possibly, the results obtained from the multivariate statistical models. In order to investigate just what are these sources of variability, and just how much these variations may impact the results of the multivariate models for predicting the identification of pharmaceutical dosage forms, a study has been conducted. To the authors’ knowledge, no other systematic study of this kind has been published. In this study, we are interested in learning what variability, if any, the choices for instrument and software have on the development of a NIR method for the identification of pharmaceutical dosage forms. Furthermore, we would like to learn what and how do the choices made early on in the experimental design impact the final quality of the spectra and the resulting multivariate models obtained from these spectra. A study protocol was designed, using a common data set consisting of four formulations of Ibuprofen, involving three investigating parties, namely, US FDA, USP and Irvine Pharmaceutical Services and using three NIR instruments, namely (listed in alphabetical order), a Bruker spectrometer, a BĂźchi spectrometer and a Foss spectrometer. Based on the results and despite differences in instrument configuration [dispersive or Fourier Transform (FT)], number of spectral data points, principal components analysis (PCA) or factorisation algorithms, and valida- tion modelling approach, exact and accurate spectroscopic results can be achieved using NIR spectroscopy for discriminate analysis. More importantly, this study shows that the same NIR method spectral range and pre-treatment parameters can be used, and that nearly the same multivariate models can be obtained, despite instrumental and software differences, to accurately predict the identity of pharmaceutical dosage forms. Keywords: near infrared (NIR) spectroscopy, instrument variability, chemometric software algorithms, multivariate discriminant analysis, PCA A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) received: 9 June 2009 n revised: 30 September 2009 n accepted: 11 october 2009 n publication: 30 october 2009
  • 2. 234 An NIR Comparison of Method Development Approaches Using a Drug Product It has been successfully demonstrated that near infrared (nIr) spectroscopy can be used for the analysis of pharma- ceutical products using a single instrument and software configuration.1–6 the focus of this study was to determine if the variability observed across different nIr instruments and chemometric software packages can be an impediment for the deployment of this technique if different instrument and software types are to be used for the development of nIr models. the united States food and drug administration (uS fda), the united States pharmacopeia (uSp) and Irvine pharmaceutical Services, have undertaken a feasibility study using nIr spectroscopy to identify, by the chemometric method of multivariate discriminate analysis using principal component analysis (pca) and factorisation, four formula- tions of Ibuprofen (200 mg): two branded (advil and Motrin) and two store-branded (cVS and rite aid) Ibuprofen (200mg) immediate-release tablets, using three different instrument/ software combinations. the instruments used included (listed in alphabetical order), a Bruker Vector 22n ft-nIr spectrometer, Bruker optik GmbH, ettlingen, Germany, a BĂźchi nIrflex Solids, flawil, Switzerland and a foss XdS rapid content analyser, laurel, uSa. the software used from each instrument, respectively, were opuS 5.5, nIrcal 5.2 and Vision 3.4. the unscrambler 9.7, a stand-alone multivariate analysis and experimental design software package, was used as referee software to assist in developing a common model. the Bruker Vector 22n ftnIr spectrometer and the foss XdS rapid content analyser used for this study are both located at the uSp, and the BĂźchi nIrflex Solids spec- trometer was located at the Irvine pharmaceutical Services. a study protocol was designed, using a common data set consisting of these four formulations of Ibuprofen, involving the three investigating parties, using the above-mentioned different nIr instrument/software combinations. to our knowledge, this is the first study of its kind investigating the variability of nIr instrument/software combinations and their impact on chemometric models for discriminate drug analysis. there were four specific objectives of the study: (1) demonstrate that by the use of the computerised algo- rithms of pca and factorisation, three identical, or nearly identical, spectral libraries can be obtained on three different instruments. these spectral libraries will be comprised of the same samples drawn from the same manufactured lots, scanned the same way and modelled using the same software settings. (2) compare four formulations of Ibuprofen: (200 mg) two branded and two store-branded Ibuprofen (200 mg). Immediate-release tablets will result in four distinct clusters in multi-dimensional space. (3) demonstrate that the calibration models are able to analyse unknown tablets (validation set) and determine if they are ibuprofen (200mg) immediate-release tablets from one of the four known sources of origin that constitute the libraries or are not from any of these known sources. (4) the results from each analysis of unknown samples should give the same predicted results, indifferent of the model used, and despite instrument, spectra, algorithm, or number of cali- bration samples, demonstrating that a protocol can be used in each laboratory, independent of location, device, analyst and software used to perform the identification. Experimental Instrumentation and software the instruments that were used included a Bruker Vector 22n ft-nIr spectrometer with a spectral range of 4000–12000cm–1 and spectral resolution set at 8 cm–1 (Bruker optik GmbH, ettlingen, Germany), BĂźchi nIrflex Solids with a spectral range 1000–2500nm (10,000–4000cm–1 ) and spectral resolu- tion set at 8cm–1 (BĂźchi, flawil, Switzerland) and foss XdS rapid content analyser with a spectral range of 400–2500nm (25,000–4000cm–1 ) and a spectral resolution set by the speci- fied bandpass of 9nm (foss nIrSystems, Inc., laurel, Md, uSa). the software used for each instrument were, respectively, opuS 5.5, nIrcal 5.2, and Vision 3.4. the unscrambler 9.7, a stand- alone multivariate analysis and experimental design software package (caMo technologies Inc., trondheim norway), was used as referee software to assist in developing a common model. Validation of instrument operation tests for wavelength accuracy, photometric precision, accu- racy and noise were performed on all of the instruments prior to tablet measurements as recommended by the uSp general information chapter <1119> near-Infrared Spectroscopy (http://www.usp.org/uSpnf/revisions/usp31nf26secondSup- plement01.html.7 Vendor recommended performance tests for each instrument were run as well. the tests are routine quality control tests of the instrument performance and as such were performed at the prescribed intervals to verify correct instru- ment performance. Sample sets table 1 lists the sources for the samples used for the study. ten lots of four formulations of Ibuprofen (200mg) were obtained: two branded (advil, Motrin) and two store-branded (cVS, rite aid) Ibuprofen (200mg) immediate release tablets. the formu- lations for each are shown in table 2. three sets of samples were prepared for each instrument. these sets comprise the calibration set, test set and validation set. for this study, manu- facturer lots 1–8 were used for the calibration set and test set and manufacturer lots 9 and 10 were set aside and used for the validation sets. While one of the purposes of the protocol was to control the preparation of the calibration, test, and validation sample sets, it was found later that the calibration set prepared Introduction
  • 3. A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) 235 for the foss instrument was missing one tablet (see table 6 footnote) and the validation set for the foss instrument and the Bruker instrument were different from the validation set prepared for the BĂźchi instrument, containing only 28 tablets from the Motrin validation set instead of 40 tablets originally intended, all due to packaging errors. this error had no effect on the outcome of the study (model prediction accuracy of vali- dation samples) as will be shown later in this paper. the sample sets were prepared as shown in table 3. Sample collection and analyses the study protocol was modelled on that described by Scafi and pasquini.8 the multivariate chemometric approach to calibrate, validate and maintain the spectral libraries was based on work proposed by Workman and Brown.9,10 log 1/R (sometimes referred to as absorbance) spectra from the Bruker and foss instrument, and reflectance spectra from the BĂźchi instrument, were collected from each tablet for the calibration and test sets by averaging 32 scans in diffuse reflectance mode against a 99% reflectance background reference. the spectra were collected across the following ranges from each instrument: Bruker samples were scanned from 11,999cm–1 to 3999cm–1 (833nm to 2500nm), BĂźchi samples were scanned from 10,000cm–1 to 4000cm–1 (1000 nm to 2500 nm) and using the foss instrument, each tablet was scanned from 400nm to 2500nm, photographs of the tablets are shown in figure 1. the unscrambler 9.7 was used to investigate the foss calibration set spectral proper- ties that could be used to distinguish each formulation. tables 4 and 5 show the series of spectral pre-treatments that were tested for use in order to achieve a pca that could be used to determine the identity of the samples from the validation sets. the test sets were used to optimise the prediction of the final models before being used to predict the validation sets. the Bruker software, opuS 5.5, uses the factorisa- tion method to model and so the calibration set from this instrument, while subjected to pca in the unscrambler 9.7, required a different approach when constructed in the opuS 5.5 software. table 6 lists the model values for the calibration modelling approach. Development of NIR calibrations Based on observations from the unscrambler 9.7 exploration, a Savitzky–Golay, first derivative, 21 point smoothing, 3rd order polynomial, from 1000nm to 2500nm and a pca model, using Mahalanobis distance in the wavelength range from 1400– 1500nm, was postulated as the optimal model to apply to the Bruker, BĂźchi and foss calibration data sets. figures 2(a), 2(b) and 2(c) show that four clusters can easily be separated from each other in all three data sets. the following discussions Lot Brand—Advil Brand—Motrin Generic—CVS Generic—Rite Aid 1 B946681 pca189 6ee0102 p45032 2×200=400 tablets 3×100=300 tablets 2×250=500 tablets 1×500=500 tablets 2 B87154 lla103 6He0515 p44389 2×200=400 tablets 3×100=300 tablets 4×100=400 tablets 1×500=500 tablets 3 B94669 pca112 7Be0119 p424477 3×100=300 tablets 3×100=300 tablets 1×750=750 tablets 1×250=250 tablets 4 B27624 pca226 7ae0039 p44686 4×75=300 tablets 10×24=240 tablets 1×750=750 tablets 1×250=250 tablets 5 B91364 pBa123 7ce0268 p14360 2×150=300 3×100=300 tablets 1×750=750 tablets 3×100=300 tablets 6 B73322 pBa194 7ae0699 p42476 2×150=300 tablets 3×100=300 tablets 1×500=500 tablets 3×100=300 tablets 7 B33863 paa016 6le0478 p42498 3×100=300 tablets 5×50=250 tablets 1×500=500 tablets 2×120=240 tablets 8 B91414 pBa186 7ae0270 p44151 2×200=400 tablets 5×50=250 tablets 3×100=300 tablets 5×50=250 tablets 9 B98483 pea106 6Ge0118 p44688 9×24=216 tablets 2×100=200 tablets 1×500=500 tablets 5×50=250 tablets 10 B91386 lla329 7Be0606 p42058 2×200=400 tablets 2×100=200 tablets 1×500=500 tablets 2×100=200 tablets Table 1. Sources of Ibuprofen (200mg) samples used for this study.
  • 4. 236 An NIR Comparison of Method Development Approaches Using a Drug Product track the model development in the opuS 5.5, nIrcal 5.2 and Vision 3.4 software packages. Pre-treatment I: baseline correction Based upon the exploration of data pre-treatments shown in tables 4 and 5, a baseline correction using the function: f(x)=x–min(X), where x is a variable and X denotes all selected variables for this sample, was first applied in the unscrambler 9.7. a base- line correction was shown to be the simplest pre-treatment applied to the calibration data sets to effect separation of all four formulations, although barely. furthermore, it was deter- mined that while it is possible to apply a baseline transform function to the foss calibration set using the Vision 3.4 soft- ware, the same mathematical transformation was not found to be available in the Bruker or the BĂźchi software. using the Bruker opuS 5.5 software, the closest approxima- tion to the baseline offset function used in the unscrambler 9.7 would be a normalisation function. this would then be followed by the Savitzky–Golay, first derivative, 21 point smoothing, 3rd order polynomial, from 1000nm to 2500nm, followed by factorisation (which is a Bruker pca algorithm using euclidian distance within the opus Ident set-up module where the cali- bration set resides. It was further noted that the default first derivative transformation for the opuS 5.5 software is a cubic polynomial equation. the BĂźchi nIrcal 5.2 software required the use of several spectra manipulations; to reverse the x-axis, orient the spectra in the same direction as they are for the foss and Bruker data, prior to performing the pca to the data [figures 3(a), 3(b) and 3(c)]. the Savitzky–Golay first derivative transformation for the nIrcal 5.2 software is also a cubic polynomial equation. In addition to other derivative functions, the nIrcal software allows the use of customisable linear filters, which allowed matching to the same derivative treatment used in the other software packages. the resulting filters for the BĂźchi calibra- tion set are listed in table 6. Advil Motrin CVS-Ibuprofen Rite Aid-Ibuprofen acelytated monoglycerides Beeswax propylene glycol cellulose lactose Sodium starch glycolate ethoxyethanol lecithin pharmaceutical glaze povidone Semithicone Sodium benzoate Sodium lauryl sulphate Sucrose fd&c yellow #6 Magnesium stearate polydextrose polyethylene glycol fd&c yellow #6 Magnesium stearate polydextrose polyethylene glycol croscarmellose sodium Microcrystalline cellulose pharmaceutical shellac pregelatinised starch carnauba wax Shellac ppregelatinised starch carnauba wax Hypromellose carnauba wax Hypromellose croscarmellose sodium Microcrystalline cellulose Hypromellose Iron oxides Silicon dioxide corn starch Ibuprofen Stearic acid titanium dioxide Iron oxides colloidal silicon dioxide corn starch Ibuprofen Stearic acid titanium dioxide fumed silica gel corn starch Ibuprofen Stearic acid titanium dioxide Iron oxides colloidal silicon dioxide corn starch Ibuprofen Stearic acid titanium dioxide Table 2. Formulations of Ibuprofen (200mg) samples.
  • 5. A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) 237 Pre-treatment II: Savitzky–Golay first derivative, 21 point smoothing, cubic polynomial the smoothing and differentiation of paired data by the proce- dure of simplified least squares, now called the Savitzky– Golay filter,11 was developed for the removal of the random noise from paired data sets, such as that obtained for nIr, log 1/R versus wavelength. for the data sets from this study, it can be shown that the calibration by pca or factorisation is most affected by sampling variation manifested as baseline offsets. the offsets result in significant overlap of spectra with closely resembling spectral features, which cannot be completely resolved in spectral space. tables 4 and 5 show a systematic approach of pre-treating the calibration sample spectra. the pre-treatments were applied and investigated for their ability to remove sampling variation from the spectra. the plus sign indicates that the samples from these formula- tions can be separated into distinct clusters from the other samples of a different formulation using the associated pre- treatment, whereas the minus sign indicates that samples from these formulations cannot be separated into distinct clusters from the other samples of a different formulation using the associated pre-treatment. the contribution of the variability from the samples can be seen in figures 3(a), 3(b) and 3(c). the baseline offset is evident in the exploded plot of the spectra in figure 4 and the effect of smoothing and deriv- ative pre-treatment is also evident as well (note: the number of data points applied for smoothing the spectra during deri- vatisation were investigated and 21 data points was found to be optimal for discerning spectral features of the four formulations in spectral space). particularly noticeable at the wavelength 1440nm, one can observe a sharp band in the untreated spectra, which is attributed to Innovator a samples (advil) [note: this is due to the sucrose in the formulation as indicated in table 2. the origin of this unique band is discussed by davies and Miller.12 the impact of band reso- lution from dispersive and ft is most noticeable in figure 4, which is the expanded view of the first derivative and smoothed spectra. the result of applying the Savitzky–Golay first derivative, 21 point smoothing filter is that it changes the directions of the of the Generic a and Generic B spectra relative to both the Innovator a and Innovator B spectra in the region between 1400nm and 1500nm. Principal component calculations the pca models [either by Mahalanobis distance (Md) for the foss and BĂźchi instruments or euclidian distance (ed) for the Bruker instrument) resulted in correct and accurate predictions of all three validation sets. Mahalanobis distance is the statistical distance taking into account the variance of each variable and the correlation coefficients. In the case of a single variable, it is the square of the distance (between two objects, or between an object and the centroid) divided by the variancea euclidian distance is simply the distance between Laboratory/instrument Experimental samples Innovator manufacturer Advil Innovator manufacturer Motrin Generic manufacturer CVS Generic manufacturer Rite Aid Total united States pharmacopeia/ foss calibration Set 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 192 test Set 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 64 Validation set 20 tabs×2 lots=40 14 tabs×2 lots=28 20 tabs×2 lots=40 20 tabs×2 lots=40 148 united States pharmacopeia/ Bruker calibration Set 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 192 test Set 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 64 Validation set 20 tabs×2 lots=40 14 tabs×2 lots=28 20 tabs×2 lots=40 20 tabs×2 lots=40 148 fda/Irvine pharmaceutical Services, Inc./ BĂźchi calibration Set 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 192 test Set 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 64 Validation set 20 tabs×2 lots=40 20 tabs×2 lots=40 20 tabs×2 lots=40 20 tabs×2 lots=40 160 total 312 288 312 312 1224 Table 3. Study design. Figure 1. Ibuprofen (200mg) tablets. Upper left, Advil, upper right, Motrin, lower left CVS, and lower right, Rite Aid.
  • 6. 238 An NIR Comparison of Method Development Approaches Using a Drug Product two variables and is calculated as the arithmetic difference, i.e. variable 1–variable 2. for a bi-variate model, the squared distances between two vectors in multi-dimensional space are the sum of squared differences in their coordinates. as noted previously, the algorithms used by each software were different by the calculations employed for score distances. the signifi- cance of this difference is that, even though the distances were calculated in the principal component space (pc space) for the foss and BĂźchi calibration sets, and in wavelength space for the Bruker calibration set, on similarly pre-treated spectra, using the same wavelength range, and on different number of principal components, the models gave identical predictions for the validation sets (see table 7). the reason that the pca calculation is invariant in either wavelength space or pc space is explained by de Maesschalck et al.6 the supervised training by pca, a discriminant analysis technique, relies on the measurement of distances between objects in order to achieve classification. as noted by de Maesschalck et al., this can occur in wavelength space or in pc space, and on normalised or unnormalised spectra. another factor that de Maesschalck et al. point out, that may not be obvious to a casual user of nIr techniques, is that “the Md and the ed can also be calculated using a smaller number of latent variables (pcs) obtained after pca analysis instead of the original variables. In this case, the Md, however, does not need to correct for the covariance between the vari- ables, since pcs are by definition orthogonal (uncorrelated). However, the way each of the residual pcs is weighted in the computation of the distance must be taken into account.” the paper goes on to clarify the relationship between the ed and the Md, particularly how each is calculated in the original variable space and the pc space. It should be noted here that the Bruker software, opuS 5.5, utilises an approach referred to as factorisation, which is explained as a spectral distance calculation. table 7 gives the model values for each calibra- tion from each instrument and the camo software. figure 4 shows the expanded view of the derivative spectra, Spectra treatment (1000–2500nm) Cluster Innovator A Innovator B Generic A Generic B untreated spectra + + – – Baseline corrected + + + + first derivative + – + + Second derivative + + – – Baseline corrected first derivative + + – – Baseline corrected Second derivative + – + + * pre-treatments applied to the raw spectra that resulted in a separate and distinct cluster for any member of the four sample sets following principle com- ponent analysis are denoted by a plus sign, and pre-treatments that did not result in separate and distinct clusters are denoted by a minus sign. plus signs, in bold, indicates that all four sample sets were successfully separated Table 4. Data pretreatments from 1000nm to 2500nm. Spectra treatment (1400–1500nm) Cluster Innovator A Innovator B Generic A Generic B untreated spectra + – – – Baseline corrected + + – – first derivative + + – – Second derivative + – – – Baseline corrected first derivative + + + + Baseline corrected Second derivative + – – – ** See footnote from table 4 Table 5. Data pre-treatments from 1400nm to 1500nm. a from a glossary provided with the permission of Bryan prazen of the Synovec Group, department of chemistry, university of Washington. found at http://www.spectroscopynow.com/coi/cda/detail.cda?id=101 18&type=educationfeature&chId=9&page=1
  • 7. A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) 239 Unscrambler 9.7 Vision 3.4 OPUS 5.5 NIRCal 5.2 calibration set 191 191* 192 192 test set 64 64 64 64 Validation set 148 148 148 160 raw spectra range 400–2499.50nm 400–2499.50nm 11999.86–3999.952cm–1 4000–10000cm–1 number of raw data points 4200 4200 2075 1501 Baseline transform** Baseline offset f(x)=x–min(X) (1000–2499.50nm) Math treatments baseline correction (1000nm) Manipulate offset normalisation† after transforming spectra from wavenumber to wavelength (1000–2500nm) add constant, • constant=–1000 Shift neg to • 0, 2500–1000 (total 1501/1501) n/a n/a change from wavenumber to wave- length change from wavenumber to wavelength n/a n/a n/a absorbance log10(1/×) Savitzky–Golay first derivative (SG 1st deriv.) (1000–25000nm), using a 3rd order polynomial, 21 point smoothing Modify /transform/ derivatives/S. Golay/ variables (1000nm–2500nm)/ 1st derivative/ 21 points smoothing/ 3rd order polynomial Math treatments Savitzky–Golay first derivative/ region minimum/1000nm/ region maxi- mum/ 2500nm/21 point, cubic spline polynomial evaluate/Setup Identity test method/ load method/ parameters/ preprocessing/first derivative 21 points/ regions/7695.18cm–1 – 6248.72cm–1 , cubic spline polynomial linear filter (84075, 10032, –43284, –78176, –96947, –101900, –95338, –79564, –56881, –29592, 0, 29592, 56881, 79564, 95338, 101900, 96947, 78176, 43284, –10032, –84075, 3634092) pca task Mahalanobis distance in principal component Space/1400–1500nm Mahalanobis distance in prin- cipal component space/spectral filtering/wavelength min.1400nm/ wavelength max. 1500nm pcs: 5 threshold: probability level: 0.999 factorisation: six factors (in wavelength space) threshold: 0.25/1400– 1500nm Mahalanobis distance in principal component Space/1400–1500nm pcs:3 * only seven tablets were packaged for sample 6ufGB (lot 6 uSp/foss/Generic B–rite aid), all other lots from ufGB: 1,2,3,4,5,7 and 8 were packaged to contain eight tablets, six for the calibration set and two for the test set. ** Since the baseline algorithms were all significantly different and do not give sufficiently similar spectral pre-treatments, it was decided to drop this step and proceed directly to the derivative/smoothing step on all calibration data. † the attempt to use normalisation to match the baseline correction from the Vision 3.4 and unscrambler 9.7 software was met with another unexpected obstacle which forced the investigators to not use a baseline correction as the first step of pre-treating the calibration spectra. the normalisation pre- treatment algorithm resides in a different module than the Ident module where the calibration model resides. In order to carryout the pre-treatment on the calibration and subsequent unknown samples, a macro that would pre-process the spectra and automatically load those spectra into opuS Ident as well as being able to analyse an unknown sample using offset normalisation spectral pre-processing and automatically output a result, would be required. Since this is only a requirement of the opuS 5.5 software, it was decided to drop this step and proceed directly to the derivative/smoothing step on all calibration data. Table 6. Model values for each calibration.
  • 8. 240 An NIR Comparison of Method Development Approaches Using a Drug Product 2b Innovator B Innovator A Generic B Generic A 2a Innovator A Innovator B Generic B Generic A 2c Innovator A Innovator B Generic B Generic A Figure 2. (a) BĂźchi PCA Score Plot (Unscrambler 9.7) using, (b) Foss PCA Score Plot (Unscrambler 9.7) and (c) Bruker PCA Score Plot (Unscrambler 9.7). (a) (b) (c)
  • 9. A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) 241 3 log (1/R) Innovator A (b) Log (1/R) Wavelength Log (1/R) (c) Figure 3. (a) BĂźchi raw spectra plot (NIRCal 5.2), (b) Foss raw spectra plot (Vision 3.4) and (c) Bruker raw spectra plot (OPUS 5.5). (a) (b) (c)
  • 10. 242 An NIR Comparison of Method Development Approaches Using a Drug Product Figure 4. Expanded view of BĂźchi, Foss, and Bruker Derivative Spectra of Calibration and Test Set BĂźchi (a) (c) (d) (b) BĂźchi 18 Figure 4. Expanded view of BĂźchi, Foss, and Bruker Derivative Spectra of Calibration and Test Set BĂźchi Foss (a) (b) (c) (d) Foss .0010 .0005 0 .0005 .0010 .0015 .0020 1400.61nm1411.29nm1422.12nm1433.13nm1444.31nm1455.66nm1467.19nm1478.91nm1490.82nm Variables Line Plot Bruker (a) (b) (c) (d) Bruker Figure 4. Expanded view of BĂźchi, Foss and Bruker derivative spectra of calibration and test set: (a) Innovator A, (b) Generic A, (c) Generic B and (d) Innovator B.
  • 11. A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) 243 Validation results and discussion table 7 lists a summary of the results of the validations sets from the, Bruker, BĂźchi and foss instruments, respectively. the results demonstrate that all four objectives of the study were met. four distinct computerised algorithms of pca and factorisation were used to construct three separate spectral libraries from a common calibration and test set, each residing on different instruments in different laborato- ries and one residing on stand-alone software, the referee model. the use of the “referee” model helped establish the baseline values for the model parameters such as spectral range, spectra pre-treatment, calibration algorithms, etc. that could be used across the various software platforms despite the variations in instrument bandwidth, spectral data points, algorithms for smoothing, derivative and other calcu- lations. Specifically, while it was found that a pca model based on calculating Mahalanobis distance in pc space, second derivative Gap 20, second order polynomial was suffi- cient to model the calibration and test set on two of the three instruments, it became evident early on in the experiment that by using exactly the same model consisting of the same algorithm calculation and same pre-treatment routine was not possible. However, by the use of the referee software which, when loaded with the calibration set from all three instruments, one could easily determine how to optimise the spectral range, and smoothing and derivative pre-treatment parameters in order to achieve similar calibration and test set parameters. the limiting calibration setting was found to be the polynomial function, which effects how many points are used to calculate the smoothing and derivative function. as was shown in figure 4, there is a very narrow spectral range (100 nm or 1446.46 cm–1 on which separation can be made on the four data sets in spectra space. Having met the first objective with some deviations, the second, third and fourth objectives were easily met, as the three models were successful at predicting the vali- dation sets for each instrument, resulting in four distinct clusters in multidimensional space, each cluster repre- senting the innovator and generic brand Ibuprofen formula- tions. additionally, samples from Generic a lots 6Ge0118 (cVS Brand) and p42058 (rite aid) were correctly identified as not belonging to Generic a lots comprising the cali- bration sets for all three data sets. It was observed that these samples have a distinct banding pattern in the region from 1400 nm–1500 nm from all other sample spectra. the most likely cause of this is assuredly due to the presence of a component that absorbs in the nIr region not found in those samples comprising the calibration set. as a result of the new band in the critical region between 1400–1500 nm, these samples also produce a fifth separate and distinct cluster in pc space. It must be mentioned here that this experiment, being performed in different laboratories within different organisa- tional cultures, was totally driven by a protocol that was jointly crafted and agreed upon prior to execution. the authors feel that this is a key point since this experiment was designed to meet specific objectives, despite the fact that the instruments, software and personnel were at different locations. this, of course, was not the major factor. the major factor was trying to coordinate all of the steps within the protocol from within different organisations. While nIr experiments are generally described as non-destructive, fast and cost effective, when done on a large scale they require planning, discussion and coordination. this is rarely mentioned. It is hoped that this example may serve as a model for future applications that involve large sample sets and multi-organisations using multiple instrument–software combinations. the current global pharmaceutical counterfeiting problem is one area that should benefit from examples like the one demonstrated in this paper. Conclusions the specific objective of the study was to obtain log 1/R spectra of four formulations of Ibuprofen (200mg) from two branded Instrument Lot Innovator A tablets Innovator B tablets Generic A Generic B foss 9 20 20 0 20 10 20 8 0 20 Bruker 9 20 20 0 20 10 20 8 0 20 BĂźchi 9 20 20 0 20 10 20 20 0 20 twenty tablets were tested for each combination of instrument and lot, except for lot 10 of Innovator B where only eight were tested on the foss and Bruker instruments Table 7. Number of tablets identified.
  • 12. 244 An NIR Comparison of Method Development Approaches Using a Drug Product and two store-branded Ibuprofen (200mg) immediate-release tablets and use them to design, develop, validate and deploy a calibration model that can subsequently be used to correctly classify by discriminant analysis using pca, log 1/R spectra from unknown samples (validation set) on nIr instruments of varying types and software configurations. this experiment was designed to study the impact that nIr instrument hardware and software configurations have on nIr method development. Several variables were detected and assessed. Spectrometer types, sample holders, spec- tral acquisition settings, data pre-treatments and pca algo- rithms were studied. nIr method development was attempted by three different analysts on three different instruments located in two different laboratories. It was determined that even though identical samples were used for model- ling and prediction, and the same calibration approach was tried on the accompanying software, spectra differences were observed due to the number of data points, and that these impact the ability to perform the same or similar data pre- treatments in different software. furthermore, the algorithms employed in each software platform may limit the ability to deploy a method developed on any single software platform to be deployed across different software platforms. However, despite the differences observed, it was possible to find a common method using each software that enabled accurate predictions of the validation samples when each model was used independent of instrument and software configuration. Knowing the sources of variability that impact the log 1/R nIr spectrum will minimise the overall prediction varia- bility and increase the likelihood of correctly classifying by discriminant analysis, the log 1/R spectra from unknown samples subsequently measured and compared to the spec- tral library and classified by the calibration model, when model parameters are used on different instrument and software combinations. It was found that using Savitzky–Golay, first derivative, 21 point smoothing, third order polynomial, pre-treated spectra and either pca or factorisation model (either by Md in prin- cipal component space for the foss and BĂźchi or ed in wave- length space for the Bruker factorisation method) resulted in different models but possessing the same accuracy capa- bilities (100%) for predicting samples comprising similar validation sets. one validation sample set, a store-branded Ibuprofen (200 mg) immediate-release tablet, was correctly identified as not belonging to the samples represented in the calibration set by all three models. Based on these results, and despite differences in instrument configuration [disper- sive or fourier transform (ft)], number of spectral data points, pca or factorisation algorithms and validation model- ling approach, exact and accurate spectroscopic results can be achieved using nIr spectroscopy for discriminate analysis. More importantly, this study shows that the same nIr method spectral range and pre-treatment parameters can be used and that nearly the same multivariate models can be obtained, despite instrumental and software differences, to accurately predict the identity of pharmaceutical dosage forms. Acknowledgements the authors are grateful for the contributions to this project from each of the following scientists: darrell abernathy, rebecca allen, todd cecil, Walter Hauck, andrea Iwanik, Steven lane, Samir Wahab and patricia White from uSp, rudy flach, charles petersen and Heather coffin from Irvine, William Martin from the fda, Michael Surgeary from BĂźchi, cynthia Kradjel from Integrated technical Solutions and Verne Hebard from Bruker. References 1. p. de peinder, M.J. Vredenbregt, t. Visser and d. de Kaste, “detection of lipitor counterfeits: a comparison of nIr and raman spectroscopy in combination with che- mometrics”, J. Pharmaceut. Biomed. Anal. 47, 688 (2008). doi: 10.1016/j.jpba.2008.02.016 2. J. luypaerta, d.l. Massart and y. Vander Heyden, “near-infrared spectroscopy applications in pharma- ceutical analysis”, Talanta 72, 865 (2007). doi: 10.1016/j. talanta.2006.12.023 3. y. roggo, p. chalusa, l. Maurera, c. lema-Martineza, a. edmonda and n. Jenta, “a review of near infrared spec- troscopy and chemometrics in pharmaceutical technolo- gies”, J. Pharmaceut. Biomed. Anal. 44, 683 (2007). doi: 10.1016/j.jpba.2007.03.023 4. a.K. deisingh, “pharmaceutical counterfeiting”, Analyst 130, 271 (2005). doi: 10.1039/b407759h 5. c.a. anderson, J.K. drennen and e.W. ciurczak, “pharmaceutical applications of near infrared spec- troscopy”, in Handbook of Near-Infrared Analysis”, 3rd edn, (practical Spectroscopy Series Volume 35) ed by d.a. Burns and e. W. ciurczak. crc press, Boca raton, florida, uSa, p. 585 (2008). 6. r. de Maesschalck, d. Jouan-rimbaud and d.l. Massart, “tutorial—the Mahalanobis distance”, Chemometr. Intell. Lab Syst. 50, 1 (2000). doi: 10.1016/ S0169-7439(99)00047-7 7. USP general information chapter <1119> Near-Infrared Spectroscopy (http://www.usp.org/uSpnf/revisions/ usp31nf26secondSupplement01.html. 8. S.H.f. Scafi and c. pasquini, “Identification of counterfeit drugs using near-infrared spectroscopy”, Analyst 126, 2218 (2001). doi: 10.1039/b106744n 9. J. Workman Jr and J. Brown, “a new standard practice for multivariate, quantitative infrared analysis-part I”, Spectroscopy 11(2), 48 (1996). 10. J. Workman Jr and J. Brown, “a new standard practice for multivariate, quantitative infrared analysis-part II”, Spectroscopy 11(9), 24 (1996). 11. a. Savitzky and M.J.e. Golay, “Smoothing and differentiation of data by simplified least squares pro- cedures,” Anal. Chem. 36, 1627 (1964). doi: 10.1021/ ac60214a047