A Comparison Of Near Infrared Method Development Approaches Using A Drug Product On Different Spectrophotometers And Chemometric Software Algorithms

Journal
of
near
Infrared
SpectroScopy
233
ISSn: 0967-0335 © IM publications llp 2009
doi: 10.1255/jnirs.854 all rights reserved
A comparison of near infrared method
development approaches using a drug
product on different spectrophotometers
and chemometric software algorithms
Assad Kazeminy,a
Saeed Hashemi,a
Roger L. Williams,b
Gary E. Ritchie,c
Ronald Rubinovitzd
and Sumit Sene,*
a
Irvine pharmaceutical Services, Inc., 10 Vanderbilt, Irvine, ca 92618, uSa
b
united States pharmacopeial convention, 12601 twinbrook parkway, rockville, Maryland 20852, uSa
c
former united States pharmacopeial convention, 12601 twinbrook parkway, rockville, Maryland 20852, uSa
d
Büchi corporation, 19 lukens drive, new castle, de 19720, uSa
e
united States food and drug administration, 19701 fairchild, Irvine, ca 92612, uSa. e-mail: sumit_sen@hotmail.com
It is well known that spectral variability in near infrared (NIR) spectroscopy can be attributed to the analyst, sample, sample positioning,
instrument configuration and software (in both algorithm formats and structures used as well as in the execution of data pre-treatment
and analysis). It is often acknowledged that the single largest factor impacting NIR results is sample presentation. However, what is
obvious but not often acknowledged is that there are instrumental and software differences as well. These differences, evident in the
quality of the spectra, may impact the chemometrics that are subsequently performed and, possibly, the results obtained from the
multivariate statistical models. In order to investigate just what are these sources of variability, and just how much these variations
may impact the results of the multivariate models for predicting the identification of pharmaceutical dosage forms, a study has been
conducted. To the authors’ knowledge, no other systematic study of this kind has been published. In this study, we are interested in
learning what variability, if any, the choices for instrument and software have on the development of a NIR method for the identification
of pharmaceutical dosage forms. Furthermore, we would like to learn what and how do the choices made early on in the experimental
design impact the final quality of the spectra and the resulting multivariate models obtained from these spectra. A study protocol was
designed, using a common data set consisting of four formulations of Ibuprofen, involving three investigating parties, namely, US FDA,
USP and Irvine Pharmaceutical Services and using three NIR instruments, namely (listed in alphabetical order), a Bruker spectrometer,
a Büchi spectrometer and a Foss spectrometer. Based on the results and despite differences in instrument configuration [dispersive or
Fourier Transform (FT)], number of spectral data points, principal components analysis (PCA) or factorisation algorithms, and valida-
tion modelling approach, exact and accurate spectroscopic results can be achieved using NIR spectroscopy for discriminate analysis.
More importantly, this study shows that the same NIR method spectral range and pre-treatment parameters can be used, and that
nearly the same multivariate models can be obtained, despite instrumental and software differences, to accurately predict the identity
of pharmaceutical dosage forms.
Keywords: near infrared (NIR) spectroscopy, instrument variability, chemometric software algorithms, multivariate discriminant analysis, PCA
A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009)
received: 9 June 2009 n revised: 30 September 2009 n accepted: 11 october 2009 n publication: 30 october 2009

234 An NIR Comparison of Method Development Approaches Using a Drug Product
It has been successfully demonstrated that near infrared
(nIr) spectroscopy can be used for the analysis of pharma-
ceutical products using a single instrument and software
configuration.1–6
the focus of this study was to determine
if the variability observed across different nIr instruments
and chemometric software packages can be an impediment
for the deployment of this technique if different instrument
and software types are to be used for the development of nIr
models.
the united States food and drug administration (uS
fda), the united States pharmacopeia (uSp) and Irvine
pharmaceutical Services, have undertaken a feasibility
study using nIr spectroscopy to identify, by the chemometric
method of multivariate discriminate analysis using principal
component analysis (pca) and factorisation, four formula-
tions of Ibuprofen (200 mg): two branded (advil and Motrin)
and two store-branded (cVS and rite aid) Ibuprofen (200mg)
immediate-release tablets, using three different instrument/
software combinations. the instruments used included
(listed in alphabetical order), a Bruker Vector 22n ft-nIr
spectrometer, Bruker optik GmbH, ettlingen, Germany, a
Büchi nIrflex Solids, flawil, Switzerland and a foss XdS
rapid content analyser, laurel, uSa. the software used from
each instrument, respectively, were opuS 5.5, nIrcal 5.2 and
Vision 3.4. the unscrambler 9.7, a stand-alone multivariate
analysis and experimental design software package, was
used as referee software to assist in developing a common
model. the Bruker Vector 22n ftnIr spectrometer and the
foss XdS rapid content analyser used for this study are
both located at the uSp, and the Büchi nIrflex Solids spec-
trometer was located at the Irvine pharmaceutical Services.
a study protocol was designed, using a common data set
consisting of these four formulations of Ibuprofen, involving
the three investigating parties, using the above-mentioned
different nIr instrument/software combinations. to our
knowledge, this is the first study of its kind investigating
the variability of nIr instrument/software combinations and
their impact on chemometric models for discriminate drug
analysis.
there were four specific objectives of the study:
(1) demonstrate that by the use of the computerised algo-
rithms of pca and factorisation, three identical, or nearly
identical, spectral libraries can be obtained on three different
instruments. these spectral libraries will be comprised of
the same samples drawn from the same manufactured lots,
scanned the same way and modelled using the same software
settings.
(2) compare four formulations of Ibuprofen: (200 mg)
two branded and two store-branded Ibuprofen (200 mg).
Immediate-release tablets will result in four distinct clusters
in multi-dimensional space.
(3) demonstrate that the calibration models are able to analyse
unknown tablets (validation set) and determine if they are
ibuprofen (200mg) immediate-release tablets from one of the
four known sources of origin that constitute the libraries or
are not from any of these known sources.
(4) the results from each analysis of unknown samples should
give the same predicted results, indifferent of the model used,
and despite instrument, spectra, algorithm, or number of cali-
bration samples, demonstrating that a protocol can be used in
each laboratory, independent of location, device, analyst and
software used to perform the identification.
Experimental
Instrumentation and software
the instruments that were used included a Bruker Vector 22n
ft-nIr spectrometer with a spectral range of 4000–12000cm–1
and spectral resolution set at 8 cm–1
(Bruker optik GmbH,
ettlingen, Germany), Büchi nIrflex Solids with a spectral
range 1000–2500nm (10,000–4000cm–1
) and spectral resolu-
tion set at 8cm–1
(Büchi, flawil, Switzerland) and foss XdS
rapid content analyser with a spectral range of 400–2500nm
(25,000–4000cm–1
) and a spectral resolution set by the speci-
fied bandpass of 9nm (foss nIrSystems, Inc., laurel, Md, uSa).
the software used for each instrument were, respectively, opuS
5.5, nIrcal 5.2, and Vision 3.4. the unscrambler 9.7, a stand-
alone multivariate analysis and experimental design software
package (caMo technologies Inc., trondheim norway), was
used as referee software to assist in developing a common
model.
Validation of instrument operation
tests for wavelength accuracy, photometric precision, accu-
racy and noise were performed on all of the instruments prior
to tablet measurements as recommended by the uSp general
information chapter <1119> near-Infrared Spectroscopy
(http://www.usp.org/uSpnf/revisions/usp31nf26secondSup-
plement01.html.7
Vendor recommended performance tests for
each instrument were run as well. the tests are routine quality
control tests of the instrument performance and as such were
performed at the prescribed intervals to verify correct instru-
ment performance.
Sample sets
table 1 lists the sources for the samples used for the study. ten
lots of four formulations of Ibuprofen (200mg) were obtained:
two branded (advil, Motrin) and two store-branded (cVS, rite
aid) Ibuprofen (200mg) immediate release tablets. the formu-
lations for each are shown in table 2. three sets of samples
were prepared for each instrument. these sets comprise the
calibration set, test set and validation set. for this study, manu-
facturer lots 1–8 were used for the calibration set and test set
and manufacturer lots 9 and 10 were set aside and used for the
validation sets. While one of the purposes of the protocol was
to control the preparation of the calibration, test, and validation
sample sets, it was found later that the calibration set prepared
Introduction

A. Kazeminy et al., J. Near Infrared Spectrosc. 17, 233–245 (2009) 235
for the foss instrument was missing one tablet (see table 6
footnote) and the validation set for the foss instrument and
the Bruker instrument were different from the validation set
prepared for the Büchi instrument, containing only 28 tablets
from the Motrin validation set instead of 40 tablets originally
intended, all due to packaging errors. this error had no effect
on the outcome of the study (model prediction accuracy of vali-
dation samples) as will be shown later in this paper. the sample
sets were prepared as shown in table 3.
Sample collection and analyses
the study protocol was modelled on that described by Scafi
and pasquini.8
the multivariate chemometric approach to
calibrate, validate and maintain the spectral libraries was
based on work proposed by Workman and Brown.9,10
log
1/R (sometimes referred to as absorbance) spectra from the
Bruker and foss instrument, and reflectance spectra from
the Büchi instrument, were collected from each tablet for
the calibration and test sets by averaging 32 scans in diffuse
reflectance mode against a 99% reflectance background
reference. the spectra were collected across the following
ranges from each instrument: Bruker samples were
scanned from 11,999cm–1
to 3999cm–1
(833nm to 2500nm),
Büchi samples were scanned from 10,000cm–1
to 4000cm–1
(1000 nm to 2500 nm) and using the foss instrument, each
tablet was scanned from 400nm to 2500nm, photographs of
the tablets are shown in figure 1. the unscrambler 9.7 was
used to investigate the foss calibration set spectral proper-
ties that could be used to distinguish each formulation. tables
4 and 5 show the series of spectral pre-treatments that were
tested for use in order to achieve a pca that could be used
to determine the identity of the samples from the validation
sets. the test sets were used to optimise the prediction of
the final models before being used to predict the validation
sets. the Bruker software, opuS 5.5, uses the factorisa-
tion method to model and so the calibration set from this
instrument, while subjected to pca in the unscrambler 9.7,
required a different approach when constructed in the opuS
5.5 software. table 6 lists the model values for the calibration
modelling approach.
Development of NIR
calibrations
Based on observations from the unscrambler 9.7 exploration,
a Savitzky–Golay, first derivative, 21 point smoothing, 3rd order
polynomial, from 1000nm to 2500nm and a pca model, using
Mahalanobis distance in the wavelength range from 1400–
1500nm, was postulated as the optimal model to apply to the
Bruker, Büchi and foss calibration data sets. figures 2(a), 2(b)
and 2(c) show that four clusters can easily be separated from
each other in all three data sets. the following discussions
Lot Brand—Advil Brand—Motrin Generic—CVS Generic—Rite Aid
1 B946681 pca189 6ee0102 p45032
2×200=400 tablets 3×100=300 tablets 2×250=500 tablets 1×500=500 tablets
2 B87154 lla103 6He0515 p44389
3 B94669 pca112 7Be0119 p424477
4 B27624 pca226 7ae0039 p44686
5 B91364 pBa123 7ce0268 p14360
2×150=300 3×100=300 tablets 1×750=750 tablets 3×100=300 tablets
6 B73322 pBa194 7ae0699 p42476
7 B33863 paa016 6le0478 p42498
8 B91414 pBa186 7ae0270 p44151
9 B98483 pea106 6Ge0118 p44688
10 B91386 lla329 7Be0606 p42058
Table 1. Sources of Ibuprofen (200mg) samples used for this study.

track the model development in the opuS 5.5, nIrcal 5.2 and
Vision 3.4 software packages.
Pre-treatment I: baseline correction
Based upon the exploration of data pre-treatments shown in
tables 4 and 5, a baseline correction using the function:
f(x)=x–min(X),
where x is a variable and X denotes all selected variables for
this sample, was first applied in the unscrambler 9.7. a base-
line correction was shown to be the simplest pre-treatment
applied to the calibration data sets to effect separation of all
four formulations, although barely. furthermore, it was deter-
mined that while it is possible to apply a baseline transform
function to the foss calibration set using the Vision 3.4 soft-
ware, the same mathematical transformation was not found
to be available in the Bruker or the Büchi software.
using the Bruker opuS 5.5 software, the closest approxima-
tion to the baseline offset function used in the unscrambler 9.7
would be a normalisation function. this would then be followed
by the Savitzky–Golay, first derivative, 21 point smoothing,
3rd order polynomial, from 1000nm to 2500nm, followed by
factorisation (which is a Bruker pca algorithm using euclidian
distance within the opus Ident set-up module where the cali-
bration set resides. It was further noted that the default first
derivative transformation for the opuS 5.5 software is a cubic
polynomial equation.
the Büchi nIrcal 5.2 software required the use of several
spectra manipulations; to reverse the x-axis, orient the spectra
in the same direction as they are for the foss and Bruker data,
prior to performing the pca to the data [figures 3(a), 3(b) and
3(c)]. the Savitzky–Golay first derivative transformation for
the nIrcal 5.2 software is also a cubic polynomial equation.
In addition to other derivative functions, the nIrcal software
allows the use of customisable linear filters, which allowed
matching to the same derivative treatment used in the other
software packages. the resulting filters for the Büchi calibra-
tion set are listed in table 6.
Advil Motrin CVS-Ibuprofen Rite Aid-Ibuprofen
acelytated monoglycerides
Beeswax
propylene glycol
cellulose
lactose
Sodium starch glycolate
ethoxyethanol
lecithin
pharmaceutical glaze
povidone
Semithicone
Sodium benzoate
Sodium lauryl sulphate
Sucrose
fd&c yellow #6
Magnesium stearate
polydextrose
polyethylene glycol
fd&c yellow #6
Magnesium stearate
polydextrose
polyethylene glycol
croscarmellose sodium
Microcrystalline cellulose
pharmaceutical shellac
pregelatinised starch
carnauba wax
Shellac
ppregelatinised starch
carnauba wax
Hypromellose
carnauba wax
Hypromellose
croscarmellose sodium
Microcrystalline cellulose
Hypromellose
Iron oxides
Silicon dioxide
corn starch
Ibuprofen
Stearic acid
titanium dioxide
Iron oxides
colloidal silicon dioxide
corn starch
Ibuprofen
Stearic acid
titanium dioxide
fumed silica gel
corn starch
Ibuprofen
Stearic acid
titanium dioxide
Iron oxides
colloidal silicon dioxide
corn starch
Ibuprofen
Stearic acid
titanium dioxide
Table 2. Formulations of Ibuprofen (200mg) samples.

Pre-treatment II: Savitzky–Golay first
derivative, 21 point smoothing, cubic
polynomial
the smoothing and differentiation of paired data by the proce-
dure of simplified least squares, now called the Savitzky–
Golay filter,11
was developed for the removal of the random
noise from paired data sets, such as that obtained for nIr,
log 1/R versus wavelength. for the data sets from this study,
it can be shown that the calibration by pca or factorisation is
most affected by sampling variation manifested as baseline
offsets. the offsets result in significant overlap of spectra
with closely resembling spectral features, which cannot be
completely resolved in spectral space. tables 4 and 5 show
a systematic approach of pre-treating the calibration sample
spectra. the pre-treatments were applied and investigated
for their ability to remove sampling variation from the spectra.
the plus sign indicates that the samples from these formula-
tions can be separated into distinct clusters from the other
samples of a different formulation using the associated pre-
treatment, whereas the minus sign indicates that samples
from these formulations cannot be separated into distinct
clusters from the other samples of a different formulation
using the associated pre-treatment. the contribution of the
variability from the samples can be seen in figures 3(a), 3(b)
and 3(c). the baseline offset is evident in the exploded plot of
the spectra in figure 4 and the effect of smoothing and deriv-
ative pre-treatment is also evident as well (note: the number
of data points applied for smoothing the spectra during deri-
vatisation were investigated and 21 data points was found
to be optimal for discerning spectral features of the four
formulations in spectral space). particularly noticeable at
the wavelength 1440nm, one can observe a sharp band in the
untreated spectra, which is attributed to Innovator a samples
(advil) [note: this is due to the sucrose in the formulation
as indicated in table 2. the origin of this unique band is
discussed by davies and Miller.12
the impact of band reso-
lution from dispersive and ft is most noticeable in figure
4, which is the expanded view of the first derivative and
smoothed spectra. the result of applying the Savitzky–Golay
first derivative, 21 point smoothing filter is that it changes
the directions of the of the Generic a and Generic B spectra
relative to both the Innovator a and Innovator B spectra in the
region between 1400nm and 1500nm.
Principal component calculations
the pca models [either by Mahalanobis distance (Md) for
the foss and Büchi instruments or euclidian distance (ed)
for the Bruker instrument) resulted in correct and accurate
predictions of all three validation sets. Mahalanobis distance
is the statistical distance taking into account the variance of
each variable and the correlation coefficients. In the case of
a single variable, it is the square of the distance (between two
objects, or between an object and the centroid) divided by the
variancea
euclidian distance is simply the distance between
Laboratory/instrument Experimental
samples
Innovator
manufacturer
Advil
Innovator
manufacturer
Motrin
Generic
manufacturer
CVS
Generic
manufacturer
Rite Aid
Total
united States
pharmacopeia/
foss
calibration Set 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 6 tabs×8 lots=48 192
test Set 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 2 tabs×8 lots=16 64
Validation set 20 tabs×2 lots=40 14 tabs×2 lots=28 20 tabs×2 lots=40 20 tabs×2 lots=40 148
united States
pharmacopeia/
Bruker
fda/Irvine pharmaceutical
Services, Inc./
Büchi
total 312 288 312 312 1224
Table 3. Study design.
Figure 1. Ibuprofen (200mg) tablets. Upper left, Advil, upper
right, Motrin, lower left CVS, and lower right, Rite Aid.

two variables and is calculated as the arithmetic difference,
i.e. variable 1–variable 2. for a bi-variate model, the squared
distances between two vectors in multi-dimensional space are
the sum of squared differences in their coordinates. as noted
previously, the algorithms used by each software were different
by the calculations employed for score distances. the signifi-
cance of this difference is that, even though the distances were
calculated in the principal component space (pc space) for the
foss and Büchi calibration sets, and in wavelength space for
the Bruker calibration set, on similarly pre-treated spectra,
using the same wavelength range, and on different number of
principal components, the models gave identical predictions
for the validation sets (see table 7). the reason that the pca
calculation is invariant in either wavelength space or pc space
is explained by de Maesschalck et al.6
the supervised training by pca, a discriminant analysis
technique, relies on the measurement of distances between
objects in order to achieve classification. as noted by de
Maesschalck et al., this can occur in wavelength space or
in pc space, and on normalised or unnormalised spectra.
another factor that de Maesschalck et al. point out, that may
not be obvious to a casual user of nIr techniques, is that
“the Md and the ed can also be calculated using a smaller
number of latent variables (pcs) obtained after pca analysis
instead of the original variables. In this case, the Md, however,
does not need to correct for the covariance between the vari-
ables, since pcs are by definition orthogonal (uncorrelated).
However, the way each of the residual pcs is weighted in the
computation of the distance must be taken into account.” the
paper goes on to clarify the relationship between the ed and
the Md, particularly how each is calculated in the original
variable space and the pc space. It should be noted here that
the Bruker software, opuS 5.5, utilises an approach referred
to as factorisation, which is explained as a spectral distance
calculation. table 7 gives the model values for each calibra-
tion from each instrument and the camo software. figure 4
shows the expanded view of the derivative spectra,
Spectra treatment
(1000–2500nm)
Cluster
Innovator A Innovator B Generic A Generic B
untreated spectra + + – –
Baseline corrected + + + +
first derivative + – + +
Second derivative + + – –
Baseline corrected
first derivative
+ + – –
Baseline corrected
Second derivative
+ – + +
*
pre-treatments applied to the raw spectra that resulted in a separate and distinct cluster for any member of the four sample sets following principle com-
ponent analysis are denoted by a plus sign, and pre-treatments that did not result in separate and distinct clusters are denoted by a minus sign. plus signs,
in bold, indicates that all four sample sets were successfully separated
Table 4. Data pretreatments from 1000nm to 2500nm.
Spectra treatment
(1400–1500nm)
Cluster
Innovator A Innovator B Generic A Generic B
untreated spectra + – – –
Baseline corrected + + – –
first derivative + + – –
Second derivative + – – –
Baseline corrected
first derivative
+ + + +
Baseline corrected
Second derivative
+ – – –
**
See footnote from table 4
Table 5. Data pre-treatments from 1400nm to 1500nm.
a
from a glossary provided with the permission of Bryan prazen of the
Synovec Group, department of chemistry, university of Washington.
found at http://www.spectroscopynow.com/coi/cda/detail.cda?id=101
18&type=educationfeature&chId=9&page=1

Unscrambler 9.7 Vision 3.4 OPUS 5.5 NIRCal 5.2
calibration set 191 191* 192 192
test set 64 64 64 64
Validation set 148 148 148 160
raw spectra range 400–2499.50nm 400–2499.50nm 11999.86–3999.952cm–1
4000–10000cm–1
number of raw
data points
4200 4200 2075 1501
Baseline
transform**
Baseline offset
f(x)=x–min(X)
(1000–2499.50nm)
Math treatments
baseline correction
(1000nm)
Manipulate
offset normalisation†
after transforming
spectra from
wavenumber to
wavelength
(1000–2500nm)
add constant,
•
constant=–1000
Shift neg to
•
0, 2500–1000
(total 1501/1501)
n/a n/a change from
wavenumber to wave-
length
change from
wavenumber to
wavelength
n/a n/a n/a absorbance log10(1/×)
Savitzky–Golay
first derivative
(SG 1st deriv.)
(1000–25000nm),
using a 3rd order
polynomial,
21 point smoothing
Modify /transform/
derivatives/S. Golay/
variables
(1000nm–2500nm)/
1st
derivative/
21 points smoothing/
3rd
order polynomial
Math treatments
Savitzky–Golay
first derivative/
region
minimum/1000nm/
region maxi-
mum/ 2500nm/21
point, cubic spline
polynomial
evaluate/Setup
Identity test method/
load method/
parameters/
preprocessing/first
derivative 21 points/
regions/7695.18cm–1
–
6248.72cm–1
, cubic
spline polynomial
linear filter (84075,
10032, –43284,
–78176, –96947,
–101900, –95338,
–79564, –56881,
–29592, 0, 29592,
56881, 79564, 95338,
101900, 96947, 78176,
43284, –10032,
–84075, 3634092)
pca task Mahalanobis
distance in principal
component
Space/1400–1500nm
Mahalanobis
distance in prin-
cipal component
space/spectral
filtering/wavelength
min.1400nm/
wavelength max.
1500nm
pcs: 5
threshold: probability
level: 0.999
factorisation: six
factors (in wavelength
space)
threshold: 0.25/1400–
1500nm
Mahalanobis
distance in principal
component
Space/1400–1500nm
pcs:3
*
only seven tablets were packaged for sample 6ufGB (lot 6 uSp/foss/Generic B–rite aid), all other lots from ufGB: 1,2,3,4,5,7 and 8 were packaged to
contain eight tablets, six for the calibration set and two for the test set.
**
Since the baseline algorithms were all significantly different and do not give sufficiently similar spectral pre-treatments, it was decided to drop this step
and proceed directly to the derivative/smoothing step on all calibration data.
†
the attempt to use normalisation to match the baseline correction from the Vision 3.4 and unscrambler 9.7 software was met with another unexpected
obstacle which forced the investigators to not use a baseline correction as the first step of pre-treating the calibration spectra. the normalisation pre-
treatment algorithm resides in a different module than the Ident module where the calibration model resides. In order to carryout the pre-treatment on the
calibration and subsequent unknown samples, a macro that would pre-process the spectra and automatically load those spectra into opuS Ident as well
as being able to analyse an unknown sample using offset normalisation spectral pre-processing and automatically output a result, would be required. Since
this is only a requirement of the opuS 5.5 software, it was decided to drop this step and proceed directly to the derivative/smoothing step on all calibration
data.
Table 6. Model values for each calibration.

2b
Innovator B
Innovator A
Generic B
Generic A
2a
Innovator A
Innovator B
Generic B
Generic A
2c
Innovator A
Innovator B
Generic B
Generic A
Figure 2. (a) Büchi PCA Score Plot (Unscrambler 9.7) using, (b) Foss PCA Score Plot (Unscrambler 9.7) and (c) Bruker PCA Score Plot
(Unscrambler 9.7).
(a)
(b)
(c)

3
log
(1/R)
Innovator A
(b)
Log
(1/R)
Wavelength
Log
(1/R)
(c)
Figure 3. (a) Büchi raw spectra plot (NIRCal 5.2), (b) Foss raw spectra plot (Vision 3.4) and (c) Bruker raw spectra plot (OPUS 5.5).
(a)
(b)
(c)

Figure 4. Expanded view of Büchi, Foss, and Bruker
Derivative Spectra of Calibration and Test Set
Büchi
(a)
(c)
(d)
(b)
Büchi
18
Figure 4. Expanded view of Büchi, Foss, and Bruker
Derivative Spectra of Calibration and Test Set
Büchi
Foss
(a)
(b)
(c)
(d)
Foss
.0010
.0005
0
.0005
.0010
.0015
.0020
1400.61nm1411.29nm1422.12nm1433.13nm1444.31nm1455.66nm1467.19nm1478.91nm1490.82nm
Variables
Line Plot
Bruker
(a)
(b)
(c)
(d)
Bruker
Figure 4. Expanded view of Büchi, Foss and Bruker derivative spectra of calibration and test set: (a) Innovator A, (b) Generic A, (c)
Generic B and (d) Innovator B.

Validation results and
discussion
table 7 lists a summary of the results of the validations sets
from the, Bruker, Büchi and foss instruments, respectively.
the results demonstrate that all four objectives of the study
were met. four distinct computerised algorithms of pca
and factorisation were used to construct three separate
spectral libraries from a common calibration and test set,
each residing on different instruments in different laborato-
ries and one residing on stand-alone software, the referee
model. the use of the “referee” model helped establish the
baseline values for the model parameters such as spectral
range, spectra pre-treatment, calibration algorithms, etc.
that could be used across the various software platforms
despite the variations in instrument bandwidth, spectral data
points, algorithms for smoothing, derivative and other calcu-
lations. Specifically, while it was found that a pca model
based on calculating Mahalanobis distance in pc space,
second derivative Gap 20, second order polynomial was suffi-
cient to model the calibration and test set on two of the three
instruments, it became evident early on in the experiment
that by using exactly the same model consisting of the same
algorithm calculation and same pre-treatment routine was
not possible. However, by the use of the referee software
which, when loaded with the calibration set from all three
instruments, one could easily determine how to optimise the
spectral range, and smoothing and derivative pre-treatment
parameters in order to achieve similar calibration and test
set parameters. the limiting calibration setting was found to
be the polynomial function, which effects how many points
are used to calculate the smoothing and derivative function.
as was shown in figure 4, there is a very narrow spectral
range (100 nm or 1446.46 cm–1
on which separation can be
made on the four data sets in spectra space.
Having met the first objective with some deviations, the
second, third and fourth objectives were easily met, as
the three models were successful at predicting the vali-
dation sets for each instrument, resulting in four distinct
clusters in multidimensional space, each cluster repre-
senting the innovator and generic brand Ibuprofen formula-
tions. additionally, samples from Generic a lots 6Ge0118
(cVS Brand) and p42058 (rite aid) were correctly identified
as not belonging to Generic a lots comprising the cali-
bration sets for all three data sets. It was observed that
these samples have a distinct banding pattern in the region
from 1400 nm–1500 nm from all other sample spectra. the
most likely cause of this is assuredly due to the presence
of a component that absorbs in the nIr region not found in
those samples comprising the calibration set. as a result of
the new band in the critical region between 1400–1500 nm,
these samples also produce a fifth separate and distinct
cluster in pc space.
It must be mentioned here that this experiment, being
performed in different laboratories within different organisa-
tional cultures, was totally driven by a protocol that was jointly
crafted and agreed upon prior to execution. the authors feel
that this is a key point since this experiment was designed to
meet specific objectives, despite the fact that the instruments,
software and personnel were at different locations. this, of
course, was not the major factor. the major factor was trying
to coordinate all of the steps within the protocol from within
different organisations. While nIr experiments are generally
described as non-destructive, fast and cost effective, when
done on a large scale they require planning, discussion and
coordination. this is rarely mentioned. It is hoped that this
example may serve as a model for future applications that
involve large sample sets and multi-organisations using
multiple instrument–software combinations. the current
global pharmaceutical counterfeiting problem is one area that
should benefit from examples like the one demonstrated in
this paper.
Conclusions
the specific objective of the study was to obtain log 1/R spectra
of four formulations of Ibuprofen (200mg) from two branded
Instrument Lot Innovator A
tablets
Innovator B
tablets
Generic A Generic B
foss 9 20 20 0 20
10 20 8 0 20
Bruker 9 20 20 0 20
10 20 8 0 20
Büchi 9 20 20 0 20
10 20 20 0 20
twenty tablets were tested for each combination of instrument and lot, except for lot 10 of Innovator B where only eight were tested on the foss and Bruker
instruments
Table 7. Number of tablets identified.

and two store-branded Ibuprofen (200mg) immediate-release
tablets and use them to design, develop, validate and deploy a
calibration model that can subsequently be used to correctly
classify by discriminant analysis using pca, log 1/R spectra
from unknown samples (validation set) on nIr instruments of
varying types and software configurations.
this experiment was designed to study the impact that nIr
instrument hardware and software configurations have on
nIr method development. Several variables were detected
and assessed. Spectrometer types, sample holders, spec-
tral acquisition settings, data pre-treatments and pca algo-
rithms were studied. nIr method development was attempted
by three different analysts on three different instruments
located in two different laboratories. It was determined
that even though identical samples were used for model-
ling and prediction, and the same calibration approach was
tried on the accompanying software, spectra differences were
observed due to the number of data points, and that these
impact the ability to perform the same or similar data pre-
treatments in different software. furthermore, the algorithms
employed in each software platform may limit the ability to
deploy a method developed on any single software platform
to be deployed across different software platforms. However,
despite the differences observed, it was possible to find a
common method using each software that enabled accurate
predictions of the validation samples when each model was
used independent of instrument and software configuration.
Knowing the sources of variability that impact the log 1/R
nIr spectrum will minimise the overall prediction varia-
bility and increase the likelihood of correctly classifying by
discriminant analysis, the log 1/R spectra from unknown
samples subsequently measured and compared to the spec-
tral library and classified by the calibration model, when
model parameters are used on different instrument and
software combinations.
It was found that using Savitzky–Golay, first derivative, 21
point smoothing, third order polynomial, pre-treated spectra
and either pca or factorisation model (either by Md in prin-
cipal component space for the foss and Büchi or ed in wave-
length space for the Bruker factorisation method) resulted
in different models but possessing the same accuracy capa-
bilities (100%) for predicting samples comprising similar
validation sets. one validation sample set, a store-branded
Ibuprofen (200 mg) immediate-release tablet, was correctly
identified as not belonging to the samples represented in the
calibration set by all three models. Based on these results,
and despite differences in instrument configuration [disper-
sive or fourier transform (ft)], number of spectral data
points, pca or factorisation algorithms and validation model-
ling approach, exact and accurate spectroscopic results can
be achieved using nIr spectroscopy for discriminate analysis.
More importantly, this study shows that the same nIr method
spectral range and pre-treatment parameters can be used
and that nearly the same multivariate models can be obtained,
despite instrumental and software differences, to accurately
predict the identity of pharmaceutical dosage forms.
Acknowledgements
the authors are grateful for the contributions to this project
from each of the following scientists: darrell abernathy,
rebecca allen, todd cecil, Walter Hauck, andrea Iwanik,
Steven lane, Samir Wahab and patricia White from uSp,
rudy flach, charles petersen and Heather coffin from Irvine,
William Martin from the fda, Michael Surgeary from Büchi,
cynthia Kradjel from Integrated technical Solutions and Verne
Hebard from Bruker.
References
1. p. de peinder, M.J. Vredenbregt, t. Visser and d. de
Kaste, “detection of lipitor counterfeits: a comparison of
nIr and raman spectroscopy in combination with che-
mometrics”, J. Pharmaceut. Biomed. Anal. 47, 688 (2008).
doi: 10.1016/j.jpba.2008.02.016
2. J. luypaerta, d.l. Massart and y. Vander Heyden,
“near-infrared spectroscopy applications in pharma-
ceutical analysis”, Talanta 72, 865 (2007). doi: 10.1016/j.
talanta.2006.12.023
3. y. roggo, p. chalusa, l. Maurera, c. lema-Martineza, a.
edmonda and n. Jenta, “a review of near infrared spec-
troscopy and chemometrics in pharmaceutical technolo-
gies”, J. Pharmaceut. Biomed. Anal. 44, 683 (2007). doi:
10.1016/j.jpba.2007.03.023
4. a.K. deisingh, “pharmaceutical counterfeiting”, Analyst
130, 271 (2005). doi: 10.1039/b407759h
5. c.a. anderson, J.K. drennen and e.W. ciurczak,
“pharmaceutical applications of near infrared spec-
troscopy”, in Handbook of Near-Infrared Analysis”, 3rd
edn, (practical Spectroscopy Series Volume 35) ed by
d.a. Burns and e. W. ciurczak. crc press, Boca raton,
florida, uSa, p. 585 (2008).
6. r. de Maesschalck, d. Jouan-rimbaud and d.l.
Massart, “tutorial—the Mahalanobis distance”,
Chemometr. Intell. Lab Syst. 50, 1 (2000). doi: 10.1016/
S0169-7439(99)00047-7
7. USP general information chapter <1119> Near-Infrared
Spectroscopy (http://www.usp.org/uSpnf/revisions/
usp31nf26secondSupplement01.html.
8. S.H.f. Scafi and c. pasquini, “Identification of counterfeit
drugs using near-infrared spectroscopy”, Analyst 126,
2218 (2001). doi: 10.1039/b106744n
9. J. Workman Jr and J. Brown, “a new standard practice
for multivariate, quantitative infrared analysis-part I”,
Spectroscopy 11(2), 48 (1996).
10. J. Workman Jr and J. Brown, “a new standard practice
for multivariate, quantitative infrared analysis-part II”,
Spectroscopy 11(9), 24 (1996).
11. a. Savitzky and M.J.e. Golay, “Smoothing and
differentiation of data by simplified least squares pro-
cedures,” Anal. Chem. 36, 1627 (1964). doi: 10.1021/
ac60214a047

A Comparison Of Near Infrared Method Development Approaches Using A Drug Product On Different Spectrophotometers And Chemometric Software Algorithms

Recommended

Recommended

More Related Content

Similar to A Comparison Of Near Infrared Method Development Approaches Using A Drug Product On Different Spectrophotometers And Chemometric Software Algorithms

Similar to A Comparison Of Near Infrared Method Development Approaches Using A Drug Product On Different Spectrophotometers And Chemometric Software Algorithms (20)

More from Simar Neasy

More from Simar Neasy (20)

Recently uploaded

Recently uploaded (20)

A Comparison Of Near Infrared Method Development Approaches Using A Drug Product On Different Spectrophotometers And Chemometric Software Algorithms