SlideShare a Scribd company logo
1 of 35
CHM 235 – Dr. Skrabal

Statistics for Quantitative Analysis
• Statistics: Set of mathematical tools used to describe
and make judgments about data
• Type of statistics we will talk about in this class has
important assumption associated with it:
Experimental variation in the population from which samples
are drawn has a normal (Gaussian, bell-shaped) distribution.
- Parametric vs. non-parametric statistics
Normal distribution
• Infinite members of group:
population

• Characterize population by taking
samples
• The larger the number of samples,
the closer the distribution becomes to
normal
• Equation of normal distribution:

1
− ( x − µ ) 2 / 2σ 2
y=
e
σ 2π
Normal distribution
• Estimate of mean value
of population = µ
• Estimate of mean value
of samples = x

Mean =

x

=

∑x

i

i

n
Normal distribution
•

Degree of scatter (measure of
central tendency) of population is
quantified by calculating the
standard deviation

•

Std. dev. of population = σ

•

Std. dev. of sample = s

s=

∑ ( xi − x ) 2
i

n −1

x±s
•

Characterize sample by calculating
Standard deviation and the
normal distribution
• Standard deviation
defines the shape of the
normal distribution
(particularly width)
• Larger std. dev. means
more scatter about the
mean, worse precision.
• Smaller std. dev. means
less scatter about the
mean, better precision.
Standard deviation and the
normal distribution
• There is a well-defined relationship
between the std. dev. of a population
and the normal distribution of the
population:
∀ µ ± 1σ encompasses 68.3 % of
measurements
∀ µ ± 2σ encompasses 95.5% of
measurements
∀ µ ± 3σ encompasses 99.7% of
measurements
• (May also consider these
percentages of area under the curve)
Example of mean and standard
deviation calculation
Consider Cu data: 5.23, 5.79, 6.21, 5.88, 6.02 nM

x

= 5.826 nM  5.82 nM

s = 0.368 nM  0.36 nM
Answer: 5.8 2 ± 0.3 6 nM or 5.8 ± 0.4 nM
Learn how to use the statistical functions on your
calculator. Do this example by longhand calculation
once, and also by calculator to verify that you’ll get
exactly the same answer. Then use your calculator
for all future calculations.
Relative standard deviation (rsd)
or coefficient of variation (CV)
s
rsd or CV =  100
x

From previous example,
rsd = (0.36 nM/5.82 nM) 100 = 6. 1 % or 6%
Standard error
•

•

•

Tells us that standard deviation of set of samples should
decrease if we take more measurements
Standard error =

sx

=

s
n

Take twice as many measurements, s decreases by

2 ≈ 1.4

4 =2

•

Take 4x as many measurements, s decreases by

•

There are several quantitative ways to determine the sample
size required to achieve a desired precision for various statistical
applications. Can consult statistics textbooks for further
information; e.g. J.H. Zar, Biostatistical Analysis
Variance
Used in many other statistical calculations and tests
Variance = s2
From previous example, s = 0.36
s2 = (0.36)2 = 0. 129 (not rounded because it is usually
used in further calculations)
Average deviation
•

Another way to express
degree of scatter or
uncertainty in data. Not as
statistically meaningful as
standard deviation, but
useful for small samples.

d=

∑( x − x )
i

i

n

Using previous data:

d=

5.23 − 5.82 + 5.79 − 5.82 + 6.21 − 5.82 + 5.88 − 5.82 + 6.02 − 5.82
5

d = 0.25 → 0.25 or 0.2 nM

Answer : 5.82 ± 0.25 nM or 5.8 ± 0.2 nM
Relative average deviation (RAD)
d 
RAD =  100 (as percentage)
x
 
d 
RAD =  1000 (as parts per thousand , ppt )
x
 
Using previous data,
RAD = (0. 25/5.82) 100 = 4. 2 or 4%
RAD = (0. 25/5.82) 1000 = 42 ppt
 4. 2 x 10 1 or 4 x 10 1 ppt ( 0 / 00 )
Some useful statistical tests
• To characterize or make judgments about data
• Tests that use the Student’s t distribution
– Confidence intervals
– Comparing a measured result with a “known”
value
– Comparing replicate measurements (comparison
of means of two sets of data)
From D.C. Harris (2003) Quantitative Chemical Analysis, 6th Ed.
Confidence intervals
• Quantifies how far the true mean (µ) lies from the
measured mean, x. Uses the mean and standard
deviation of the sample.

µ=x±

ts
n

where t is from the t-table and n = number of
measurements.
Degrees of freedom (df) = n - 1 for the CI.
Example of calculating a
confidence interval
Consider measurement of dissolved Ti
in a standard seawater (NASS-3):
Data: 1.34, 1.15, 1.28, 1.18, 1.33, 1.65,
1.48 nM
DF = n – 1 = 7 – 1 = 6

x = 1.34 nM or 1.3 nM

s = 0.17 or 0.2 nM
95% confidence interval

t(df=6,95%) = 2.447
CI 95 = 1.3 ± 0.16 or 1.3 ± 0.2 nM
50% confidence interval

t(df=6,50%) = 0.718
CI 50 = 1.3 ± 0.05 nM

µ=x±

ts
n
Interpreting the confidence interval
• For a 95% CI, there is a 95% probability that the true
mean (µ) lies between the range 1.3 ± 0.2 nM, or
between 1.1 and 1.5 nM
• For a 50% CI, there is a 50% probability that the true
mean lies between the range 1.3 ± 0.05 nM, or
between 1.25 and 1.35 nM
• Note that CI will decrease as n is increased
• Useful for characterizing data that are regularly
obtained; e.g., quality assurance, quality control
Comparing a measured result
with a “known” value
• “Known” value would typically be a certified value
from a standard reference material (SRM)
• Another application of the t statistic

t calc =

known value − x
s

n

Will compare tcalc to tabulated value of t at appropriate df
and CL.
df = n -1 for this test
Comparing a measured result
with a “known” value--example
Dissolved Fe analysis verified using NASS-3 seawater SRM
Certified value = 5.85 nM
Experimental results: 5.76 ± 0.17 nM (n = 10)
tcalc =

known value − x
s

n

=

5.85 − 5.7 6
0.17

10

= 1.674

(Keep 3 decimal places for comparison to table.)
Compare to ttable; df = 10 - 1 = 9, 95% CL

ttable(df=9,95% CL) = 2.262
If |tcalc| < ttable, results are not significantly different at the 95% CL.
If |tcalc| ≥ ttable, results are significantly different at the 95% CL.
For this example, tcalc < ttest, so experimental results are not significantly
different at the 95% CL
Comparing replicate measurements or
comparing means of two sets of data
• Yet another application of the t statistic
• Example: Given the same sample analyzed by two
different methods, do the two methods give the “same”
result?

t calc =
s pooled =

x1 − x 2

n1 n2

s pooled

n1 + n2

2
s12 (n1 −1) + s 2 (n2 −1)
n1 + n2 − 2

Will compare tcalc to tabulated value of t at appropriate df
and CL.
df = n1 + n2 – 2 for this test
Comparing replicate measurements
or comparing means of two sets of
data—example
Determination of nickel in sewage
sludge
using two
Method 1: Atomic absorption different methods
Method 2: Spectrophotometry
spectroscopy
Data: 3.91, 4.02, 3.86, 3.99 mg/g

Data: 3.52, 3.77, 3.49, 3.59 mg/g
= 3.59 mg/g

x1 = 3.945 mg/g

x2

s1 = 0.073 mg/g

s2 = 0.12 mg/g

n1 = 4

n2 = 4
Comparing replicate measurements or
comparing means of two sets of data—
example
s pooled =

tcalc =

2
s12 (n1 −1) + s2 (n2 −1)
n1 + n2 − 2

x1 − x2

n1 n2

s pooled

n1 + n2

=

=

(0.07 3 ) 2 (4 −1) + (0.12 ) 2 (4 −1)
4+4−2

3.945 − 3.59
0.0993

(4)(4)
4+4

= 0.0993

= 5.056

Note: Keep 3 decimal places to compare to ttable.
Compare to ttable at df = 4 + 4 – 2 = 6 and 95% CL.
ttable(df=6,95% CL) = 2.447
If |tcalc| < ttable, results are not significantly different at the 95%. CL.
If |tcalc| ≥ ttable, results are significantly different at the 95% CL.
Since |tcalc| (5.056) ≥ ttable (2.447), results from the two methods are
significantly different at the 95% CL.
Comparing replicate measurements or
comparing means of two sets of data
Wait a minute! There is an important assumption
associated with this t-test:
It is assumed that the standard deviations (i.e., the
precision) of the two sets of data being compared are not
significantly different.
• How do you test to see if the two std. devs. are different?
• How do you compare two sets of data whose std. devs.
are significantly different?
F-test to compare standard
deviations
• Used to determine if std. devs. are significantly
different before application of t-test to compare
replicate measurements or compare means of two
sets of data
• Also used as a simple general test to compare the
precision (as measured by the std. devs.) of two sets
of data
• Uses F distribution
F-test to compare standard
deviations
Will compute Fcalc and compare to Ftable.

Fcalc

=

s12
2
s2

where s1 > s2

DF = n1 - 1 and n2 - 1 for this test.
Choose confidence level (95% is a typical CL).
From D.C. Harris (2003) Quantitative Chemical Analysis, 6th Ed

.
F-test to compare standard deviations
From previous example:
Let s1 = 0.12 and s2 = 0.073

Fcalc

=

s12
2
s2

=

(0.12 ) 2
(0.07 3 ) 2

= 2.70

Note: Keep 2 or 3 decimal places to compare with F table.
Compare Fcalc to Ftable at df = (n1 -1, n2 -1) = 3,3 and 95% CL.
If Fcalc < Ftable, std. devs. are not significantly different at 95% CL.
If Fcalc ≥ Ftable, std. devs. are significantly different at 95% CL.
Ftable(df=3,3;95% CL) = 9.28
Since Fcalc (2.70) < Ftable (9.28), std. devs. of the two sets of data are
not significantly different at the 95% CL. (Precisions are
similar.)
Comparing replicate measurements or
comparing means of two sets of data-revisited
The use of the t-test for comparing means was justified
for the previous example because we showed that
standard deviations of the two sets of data were not
significantly different.
If the F-test shows that std. devs. of two sets of data
are significantly different and you need to compare
the means, use a different version of the t-test 
Comparing replicate measurements or
comparing means from two sets of data
when std. devs. are significantly
different

x1 − x2

tcalc

=

DF





2
2
2
( s1 / n1 + s2 / n2 )


=  2
−2
2
 ( s1 / n1 ) 2
( s2 / n2 ) 2  


+
 n +1
 1
n2 + 1  



2
s12 / n1 + s2 / n2
Flowchart for comparing means of two
sets of data or replicate
measurements
Use F-test to see if std.
devs. of the 2 sets of
data are significantly
different or not

Std. devs. are
significantly different
Use the 2nd version
of the t-test (the
beastly version)

Std. devs. are not
significantly different
Use the 1st version of the
t-test (see previous, fully
worked-out example)
One last comment on the F-test
Note that the F-test can be used to simply test whether
or not two sets of data have statistically similar
precisions or not.
Can use to answer a question such as: Do method one
and method two provide similar precisions for the
analysis of the same analyte?
Evaluating questionable data
points using the Q-test
• Need a way to test questionable data points (outliers) in an
unbiased way.
• Q-test is a common method to do this.
• Requires 4 or more data points to apply.
Calculate Qcalc and compare to Qtable
Qcalc = gap/range
Gap = (difference between questionable data pt. and its
nearest neighbor)
Range = (largest data point – smallest data point)
Evaluating questionable data
points using the Q-test--example
Consider set of data; Cu values in sewage sample:
9.52, 10.7, 13.1, 9.71, 10.3, 9.99 mg/L
Arrange data in increasing or decreasing order:
9.52, 9.71, 9.99, 10.3, 10.7, 13.1
The questionable data point (outlier) is 13.1
Calculate

Qcalc

=

gap
range

=

(13.1 − 10.7)
(13.1 − 9.52)

= 0.670

Compare Qcalc to Qtable for n observations and desired CL (90% or
95% is typical). It is desirable to keep 2-3 decimal places in Q calc
so judgment from table can be made.
Qtable (n=6,90% CL) = 0.56
From G.D. Christian (1994) Analytical Chemistry, 5th Ed.
Evaluating questionable data points
using the Q-test--example
If Qcalc < Qtable, do not reject questionable data point at stated CL.
If Qcalc ≥ Qtable, reject questionable data point at stated CL.
From previous example,
Qcalc (0.670) > Qtable (0.56), so reject data point at 90% CL.
Subsequent calculations (e.g., mean and standard deviation)
should then exclude the rejected point.
Mean and std. dev. of remaining data: 10.04 ± 0.47 mg/L

More Related Content

What's hot

Student's T-test, Paired T-Test, ANOVA & Proportionate Test
Student's T-test, Paired T-Test, ANOVA & Proportionate TestStudent's T-test, Paired T-Test, ANOVA & Proportionate Test
Student's T-test, Paired T-Test, ANOVA & Proportionate TestAzmi Mohd Tamil
 
Statistics-Non parametric test
Statistics-Non parametric testStatistics-Non parametric test
Statistics-Non parametric testRabin BK
 
Common evaluation measures in NLP and IR
Common evaluation measures in NLP and IRCommon evaluation measures in NLP and IR
Common evaluation measures in NLP and IRRushdi Shams
 
Kruskal Wallis test, Friedman test, Spearman Correlation
Kruskal Wallis test, Friedman test, Spearman CorrelationKruskal Wallis test, Friedman test, Spearman Correlation
Kruskal Wallis test, Friedman test, Spearman CorrelationRizwan S A
 
T Test Presentation.pptx
T Test Presentation.pptxT Test Presentation.pptx
T Test Presentation.pptxVishal Doke
 
Statistics and probability
Statistics and probabilityStatistics and probability
Statistics and probabilityShahwarKhan16
 
InnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft
 
Chi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemarChi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemarAzmi Mohd Tamil
 
The chi – square test
The chi – square testThe chi – square test
The chi – square testMajesty Ortiz
 
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anovaSolution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anovaLong Beach City College
 
Reporting an independent sample t test
Reporting an independent sample t testReporting an independent sample t test
Reporting an independent sample t testKen Plummer
 
Chapter 5 experimental design for sbh
Chapter 5 experimental design for sbhChapter 5 experimental design for sbh
Chapter 5 experimental design for sbhRione Drevale
 

What's hot (20)

Mc Nemar
Mc NemarMc Nemar
Mc Nemar
 
Student's T-test, Paired T-Test, ANOVA & Proportionate Test
Student's T-test, Paired T-Test, ANOVA & Proportionate TestStudent's T-test, Paired T-Test, ANOVA & Proportionate Test
Student's T-test, Paired T-Test, ANOVA & Proportionate Test
 
Inferences about Two Proportions
 Inferences about Two Proportions Inferences about Two Proportions
Inferences about Two Proportions
 
Statistics-Non parametric test
Statistics-Non parametric testStatistics-Non parametric test
Statistics-Non parametric test
 
Common evaluation measures in NLP and IR
Common evaluation measures in NLP and IRCommon evaluation measures in NLP and IR
Common evaluation measures in NLP and IR
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Kruskal Wallis test, Friedman test, Spearman Correlation
Kruskal Wallis test, Friedman test, Spearman CorrelationKruskal Wallis test, Friedman test, Spearman Correlation
Kruskal Wallis test, Friedman test, Spearman Correlation
 
T Test Presentation.pptx
T Test Presentation.pptxT Test Presentation.pptx
T Test Presentation.pptx
 
Statistics and probability
Statistics and probabilityStatistics and probability
Statistics and probability
 
InnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas help
 
Makalah ukuran penyebaran
Makalah ukuran penyebaranMakalah ukuran penyebaran
Makalah ukuran penyebaran
 
Goodness of Fit Notation
Goodness of Fit NotationGoodness of Fit Notation
Goodness of Fit Notation
 
Chi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemarChi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemar
 
Testing a Claim About a Proportion
Testing a Claim About a ProportionTesting a Claim About a Proportion
Testing a Claim About a Proportion
 
The chi – square test
The chi – square testThe chi – square test
The chi – square test
 
Chi square test
Chi square testChi square test
Chi square test
 
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anovaSolution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
 
All non parametric test
All non parametric testAll non parametric test
All non parametric test
 
Reporting an independent sample t test
Reporting an independent sample t testReporting an independent sample t test
Reporting an independent sample t test
 
Chapter 5 experimental design for sbh
Chapter 5 experimental design for sbhChapter 5 experimental design for sbh
Chapter 5 experimental design for sbh
 

Viewers also liked (18)

Mars
MarsMars
Mars
 
Conditionals(1)
Conditionals(1)Conditionals(1)
Conditionals(1)
 
Kxu stat-anderson-ch02
Kxu stat-anderson-ch02Kxu stat-anderson-ch02
Kxu stat-anderson-ch02
 
Rock
RockRock
Rock
 
Transitive and intertransitive verbs
Transitive and intertransitive verbsTransitive and intertransitive verbs
Transitive and intertransitive verbs
 
Matter
MatterMatter
Matter
 
028 unit 4 (19)
028 unit 4 (19)028 unit 4 (19)
028 unit 4 (19)
 
Marmite: You either love it or hate it
Marmite: You either love it or hate itMarmite: You either love it or hate it
Marmite: You either love it or hate it
 
My poems!
My poems!My poems!
My poems!
 
Doc1
Doc1Doc1
Doc1
 
Electromagnetic spectrum
Electromagnetic spectrum Electromagnetic spectrum
Electromagnetic spectrum
 
5.5 triangle inequality theorem
5.5 triangle inequality theorem5.5 triangle inequality theorem
5.5 triangle inequality theorem
 
Investigatoryprojectparts 111111075710-phpapp01
Investigatoryprojectparts 111111075710-phpapp01Investigatoryprojectparts 111111075710-phpapp01
Investigatoryprojectparts 111111075710-phpapp01
 
Sound and hearing
Sound and hearingSound and hearing
Sound and hearing
 
40 Most Asked Interview Questions (with answers)
40 Most Asked Interview Questions (with answers)40 Most Asked Interview Questions (with answers)
40 Most Asked Interview Questions (with answers)
 
Mobile Games Drive Engagement
Mobile Games Drive EngagementMobile Games Drive Engagement
Mobile Games Drive Engagement
 
Elementsofpoetry 100426133835-phpapp02
Elementsofpoetry 100426133835-phpapp02Elementsofpoetry 100426133835-phpapp02
Elementsofpoetry 100426133835-phpapp02
 
8051 Timers / Counters
8051 Timers / Counters8051 Timers / Counters
8051 Timers / Counters
 

Similar to Statistics chm 235

Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceLong Beach City College
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Long Beach City College
 
Measures of Dispersion
Measures of DispersionMeasures of Dispersion
Measures of DispersionKainatIqbal7
 
3.3 Measures of relative standing and boxplots
3.3 Measures of relative standing and boxplots3.3 Measures of relative standing and boxplots
3.3 Measures of relative standing and boxplotsLong Beach City College
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceLong Beach City College
 
Dr.Dinesh-BIOSTAT-Tests-of-significance-1-min.pdf
Dr.Dinesh-BIOSTAT-Tests-of-significance-1-min.pdfDr.Dinesh-BIOSTAT-Tests-of-significance-1-min.pdf
Dr.Dinesh-BIOSTAT-Tests-of-significance-1-min.pdfHassanMohyUdDin2
 
Lecture 11 Paired t test.pptx
Lecture 11 Paired t test.pptxLecture 11 Paired t test.pptx
Lecture 11 Paired t test.pptxshakirRahman10
 
GC-S005-DataAnalysis
GC-S005-DataAnalysisGC-S005-DataAnalysis
GC-S005-DataAnalysishenry kang
 
Role of estimation in statistics
Role of estimation in statisticsRole of estimation in statistics
Role of estimation in statisticsNadeem Uddin
 
Numerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsNumerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsBabasab Patil
 

Similar to Statistics chm 235 (20)

Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 
Measures of Dispersion
Measures of DispersionMeasures of Dispersion
Measures of Dispersion
 
T test statistics
T test statisticsT test statistics
T test statistics
 
3.3 Measures of relative standing and boxplots
3.3 Measures of relative standing and boxplots3.3 Measures of relative standing and boxplots
3.3 Measures of relative standing and boxplots
 
9618821.ppt
9618821.ppt9618821.ppt
9618821.ppt
 
9618821.pdf
9618821.pdf9618821.pdf
9618821.pdf
 
Statistical test
Statistical testStatistical test
Statistical test
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 
Test of significance
Test of significanceTest of significance
Test of significance
 
Dr.Dinesh-BIOSTAT-Tests-of-significance-1-min.pdf
Dr.Dinesh-BIOSTAT-Tests-of-significance-1-min.pdfDr.Dinesh-BIOSTAT-Tests-of-significance-1-min.pdf
Dr.Dinesh-BIOSTAT-Tests-of-significance-1-min.pdf
 
Lecture 11 Paired t test.pptx
Lecture 11 Paired t test.pptxLecture 11 Paired t test.pptx
Lecture 11 Paired t test.pptx
 
GC-S005-DataAnalysis
GC-S005-DataAnalysisGC-S005-DataAnalysis
GC-S005-DataAnalysis
 
Measures of Variation
Measures of Variation Measures of Variation
Measures of Variation
 
Role of estimation in statistics
Role of estimation in statisticsRole of estimation in statistics
Role of estimation in statistics
 
Stat2013
Stat2013Stat2013
Stat2013
 
Estimating a Population Mean
Estimating a Population MeanEstimating a Population Mean
Estimating a Population Mean
 
Estimating a Population Mean
Estimating a Population MeanEstimating a Population Mean
Estimating a Population Mean
 
Numerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsNumerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec doms
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
 

More from Alex Robianes Hernandez (19)

Grade8aralingpanlipunanmodyul3 130818183043-phpapp01
Grade8aralingpanlipunanmodyul3 130818183043-phpapp01Grade8aralingpanlipunanmodyul3 130818183043-phpapp01
Grade8aralingpanlipunanmodyul3 130818183043-phpapp01
 
Indesign
IndesignIndesign
Indesign
 
Klinefelter syndrome
Klinefelter syndromeKlinefelter syndrome
Klinefelter syndrome
 
Presentation 9
Presentation 9Presentation 9
Presentation 9
 
Conditionals
ConditionalsConditionals
Conditionals
 
Earthquakes2
Earthquakes2Earthquakes2
Earthquakes2
 
Presentationofdata 120111034007-phpapp02
Presentationofdata 120111034007-phpapp02Presentationofdata 120111034007-phpapp02
Presentationofdata 120111034007-phpapp02
 
Basic sentence patterns_with_e
Basic sentence patterns_with_eBasic sentence patterns_with_e
Basic sentence patterns_with_e
 
Work and energy
Work and energyWork and energy
Work and energy
 
Adjectives and adverbs final
Adjectives and adverbs finalAdjectives and adverbs final
Adjectives and adverbs final
 
Mollusks and annelids
Mollusks and annelidsMollusks and annelids
Mollusks and annelids
 
Science investigatory project...
Science investigatory project...Science investigatory project...
Science investigatory project...
 
Dinosaurs
DinosaursDinosaurs
Dinosaurs
 
Heat transfer
Heat transferHeat transfer
Heat transfer
 
Pangkat-etniko
Pangkat-etnikoPangkat-etniko
Pangkat-etniko
 
Pamahalaang pilipino
Pamahalaang pilipinoPamahalaang pilipino
Pamahalaang pilipino
 
Light...
Light...Light...
Light...
 
Years juers
Years juersYears juers
Years juers
 
Fdj
FdjFdj
Fdj
 

Recently uploaded

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

Statistics chm 235

  • 1. CHM 235 – Dr. Skrabal Statistics for Quantitative Analysis • Statistics: Set of mathematical tools used to describe and make judgments about data • Type of statistics we will talk about in this class has important assumption associated with it: Experimental variation in the population from which samples are drawn has a normal (Gaussian, bell-shaped) distribution. - Parametric vs. non-parametric statistics
  • 2. Normal distribution • Infinite members of group: population • Characterize population by taking samples • The larger the number of samples, the closer the distribution becomes to normal • Equation of normal distribution: 1 − ( x − µ ) 2 / 2σ 2 y= e σ 2π
  • 3. Normal distribution • Estimate of mean value of population = µ • Estimate of mean value of samples = x Mean = x = ∑x i i n
  • 4. Normal distribution • Degree of scatter (measure of central tendency) of population is quantified by calculating the standard deviation • Std. dev. of population = σ • Std. dev. of sample = s s= ∑ ( xi − x ) 2 i n −1 x±s • Characterize sample by calculating
  • 5. Standard deviation and the normal distribution • Standard deviation defines the shape of the normal distribution (particularly width) • Larger std. dev. means more scatter about the mean, worse precision. • Smaller std. dev. means less scatter about the mean, better precision.
  • 6. Standard deviation and the normal distribution • There is a well-defined relationship between the std. dev. of a population and the normal distribution of the population: ∀ µ ± 1σ encompasses 68.3 % of measurements ∀ µ ± 2σ encompasses 95.5% of measurements ∀ µ ± 3σ encompasses 99.7% of measurements • (May also consider these percentages of area under the curve)
  • 7. Example of mean and standard deviation calculation Consider Cu data: 5.23, 5.79, 6.21, 5.88, 6.02 nM x = 5.826 nM  5.82 nM s = 0.368 nM  0.36 nM Answer: 5.8 2 ± 0.3 6 nM or 5.8 ± 0.4 nM Learn how to use the statistical functions on your calculator. Do this example by longhand calculation once, and also by calculator to verify that you’ll get exactly the same answer. Then use your calculator for all future calculations.
  • 8. Relative standard deviation (rsd) or coefficient of variation (CV) s rsd or CV =  100 x From previous example, rsd = (0.36 nM/5.82 nM) 100 = 6. 1 % or 6%
  • 9. Standard error • • • Tells us that standard deviation of set of samples should decrease if we take more measurements Standard error = sx = s n Take twice as many measurements, s decreases by 2 ≈ 1.4 4 =2 • Take 4x as many measurements, s decreases by • There are several quantitative ways to determine the sample size required to achieve a desired precision for various statistical applications. Can consult statistics textbooks for further information; e.g. J.H. Zar, Biostatistical Analysis
  • 10. Variance Used in many other statistical calculations and tests Variance = s2 From previous example, s = 0.36 s2 = (0.36)2 = 0. 129 (not rounded because it is usually used in further calculations)
  • 11. Average deviation • Another way to express degree of scatter or uncertainty in data. Not as statistically meaningful as standard deviation, but useful for small samples. d= ∑( x − x ) i i n Using previous data: d= 5.23 − 5.82 + 5.79 − 5.82 + 6.21 − 5.82 + 5.88 − 5.82 + 6.02 − 5.82 5 d = 0.25 → 0.25 or 0.2 nM Answer : 5.82 ± 0.25 nM or 5.8 ± 0.2 nM
  • 12. Relative average deviation (RAD) d  RAD =  100 (as percentage) x   d  RAD =  1000 (as parts per thousand , ppt ) x   Using previous data, RAD = (0. 25/5.82) 100 = 4. 2 or 4% RAD = (0. 25/5.82) 1000 = 42 ppt  4. 2 x 10 1 or 4 x 10 1 ppt ( 0 / 00 )
  • 13. Some useful statistical tests • To characterize or make judgments about data • Tests that use the Student’s t distribution – Confidence intervals – Comparing a measured result with a “known” value – Comparing replicate measurements (comparison of means of two sets of data)
  • 14. From D.C. Harris (2003) Quantitative Chemical Analysis, 6th Ed.
  • 15. Confidence intervals • Quantifies how far the true mean (µ) lies from the measured mean, x. Uses the mean and standard deviation of the sample. µ=x± ts n where t is from the t-table and n = number of measurements. Degrees of freedom (df) = n - 1 for the CI.
  • 16. Example of calculating a confidence interval Consider measurement of dissolved Ti in a standard seawater (NASS-3): Data: 1.34, 1.15, 1.28, 1.18, 1.33, 1.65, 1.48 nM DF = n – 1 = 7 – 1 = 6 x = 1.34 nM or 1.3 nM s = 0.17 or 0.2 nM 95% confidence interval t(df=6,95%) = 2.447 CI 95 = 1.3 ± 0.16 or 1.3 ± 0.2 nM 50% confidence interval t(df=6,50%) = 0.718 CI 50 = 1.3 ± 0.05 nM µ=x± ts n
  • 17. Interpreting the confidence interval • For a 95% CI, there is a 95% probability that the true mean (µ) lies between the range 1.3 ± 0.2 nM, or between 1.1 and 1.5 nM • For a 50% CI, there is a 50% probability that the true mean lies between the range 1.3 ± 0.05 nM, or between 1.25 and 1.35 nM • Note that CI will decrease as n is increased • Useful for characterizing data that are regularly obtained; e.g., quality assurance, quality control
  • 18. Comparing a measured result with a “known” value • “Known” value would typically be a certified value from a standard reference material (SRM) • Another application of the t statistic t calc = known value − x s n Will compare tcalc to tabulated value of t at appropriate df and CL. df = n -1 for this test
  • 19. Comparing a measured result with a “known” value--example Dissolved Fe analysis verified using NASS-3 seawater SRM Certified value = 5.85 nM Experimental results: 5.76 ± 0.17 nM (n = 10) tcalc = known value − x s n = 5.85 − 5.7 6 0.17 10 = 1.674 (Keep 3 decimal places for comparison to table.) Compare to ttable; df = 10 - 1 = 9, 95% CL ttable(df=9,95% CL) = 2.262 If |tcalc| < ttable, results are not significantly different at the 95% CL. If |tcalc| ≥ ttable, results are significantly different at the 95% CL. For this example, tcalc < ttest, so experimental results are not significantly different at the 95% CL
  • 20. Comparing replicate measurements or comparing means of two sets of data • Yet another application of the t statistic • Example: Given the same sample analyzed by two different methods, do the two methods give the “same” result? t calc = s pooled = x1 − x 2 n1 n2 s pooled n1 + n2 2 s12 (n1 −1) + s 2 (n2 −1) n1 + n2 − 2 Will compare tcalc to tabulated value of t at appropriate df and CL. df = n1 + n2 – 2 for this test
  • 21. Comparing replicate measurements or comparing means of two sets of data—example Determination of nickel in sewage sludge using two Method 1: Atomic absorption different methods Method 2: Spectrophotometry spectroscopy Data: 3.91, 4.02, 3.86, 3.99 mg/g Data: 3.52, 3.77, 3.49, 3.59 mg/g = 3.59 mg/g x1 = 3.945 mg/g x2 s1 = 0.073 mg/g s2 = 0.12 mg/g n1 = 4 n2 = 4
  • 22. Comparing replicate measurements or comparing means of two sets of data— example s pooled = tcalc = 2 s12 (n1 −1) + s2 (n2 −1) n1 + n2 − 2 x1 − x2 n1 n2 s pooled n1 + n2 = = (0.07 3 ) 2 (4 −1) + (0.12 ) 2 (4 −1) 4+4−2 3.945 − 3.59 0.0993 (4)(4) 4+4 = 0.0993 = 5.056 Note: Keep 3 decimal places to compare to ttable. Compare to ttable at df = 4 + 4 – 2 = 6 and 95% CL. ttable(df=6,95% CL) = 2.447 If |tcalc| < ttable, results are not significantly different at the 95%. CL. If |tcalc| ≥ ttable, results are significantly different at the 95% CL. Since |tcalc| (5.056) ≥ ttable (2.447), results from the two methods are significantly different at the 95% CL.
  • 23. Comparing replicate measurements or comparing means of two sets of data Wait a minute! There is an important assumption associated with this t-test: It is assumed that the standard deviations (i.e., the precision) of the two sets of data being compared are not significantly different. • How do you test to see if the two std. devs. are different? • How do you compare two sets of data whose std. devs. are significantly different?
  • 24. F-test to compare standard deviations • Used to determine if std. devs. are significantly different before application of t-test to compare replicate measurements or compare means of two sets of data • Also used as a simple general test to compare the precision (as measured by the std. devs.) of two sets of data • Uses F distribution
  • 25. F-test to compare standard deviations Will compute Fcalc and compare to Ftable. Fcalc = s12 2 s2 where s1 > s2 DF = n1 - 1 and n2 - 1 for this test. Choose confidence level (95% is a typical CL).
  • 26. From D.C. Harris (2003) Quantitative Chemical Analysis, 6th Ed .
  • 27. F-test to compare standard deviations From previous example: Let s1 = 0.12 and s2 = 0.073 Fcalc = s12 2 s2 = (0.12 ) 2 (0.07 3 ) 2 = 2.70 Note: Keep 2 or 3 decimal places to compare with F table. Compare Fcalc to Ftable at df = (n1 -1, n2 -1) = 3,3 and 95% CL. If Fcalc < Ftable, std. devs. are not significantly different at 95% CL. If Fcalc ≥ Ftable, std. devs. are significantly different at 95% CL. Ftable(df=3,3;95% CL) = 9.28 Since Fcalc (2.70) < Ftable (9.28), std. devs. of the two sets of data are not significantly different at the 95% CL. (Precisions are similar.)
  • 28. Comparing replicate measurements or comparing means of two sets of data-revisited The use of the t-test for comparing means was justified for the previous example because we showed that standard deviations of the two sets of data were not significantly different. If the F-test shows that std. devs. of two sets of data are significantly different and you need to compare the means, use a different version of the t-test 
  • 29. Comparing replicate measurements or comparing means from two sets of data when std. devs. are significantly different x1 − x2 tcalc = DF     2 2 2 ( s1 / n1 + s2 / n2 )   =  2 −2 2  ( s1 / n1 ) 2 ( s2 / n2 ) 2     +  n +1  1 n2 + 1     2 s12 / n1 + s2 / n2
  • 30. Flowchart for comparing means of two sets of data or replicate measurements Use F-test to see if std. devs. of the 2 sets of data are significantly different or not Std. devs. are significantly different Use the 2nd version of the t-test (the beastly version) Std. devs. are not significantly different Use the 1st version of the t-test (see previous, fully worked-out example)
  • 31. One last comment on the F-test Note that the F-test can be used to simply test whether or not two sets of data have statistically similar precisions or not. Can use to answer a question such as: Do method one and method two provide similar precisions for the analysis of the same analyte?
  • 32. Evaluating questionable data points using the Q-test • Need a way to test questionable data points (outliers) in an unbiased way. • Q-test is a common method to do this. • Requires 4 or more data points to apply. Calculate Qcalc and compare to Qtable Qcalc = gap/range Gap = (difference between questionable data pt. and its nearest neighbor) Range = (largest data point – smallest data point)
  • 33. Evaluating questionable data points using the Q-test--example Consider set of data; Cu values in sewage sample: 9.52, 10.7, 13.1, 9.71, 10.3, 9.99 mg/L Arrange data in increasing or decreasing order: 9.52, 9.71, 9.99, 10.3, 10.7, 13.1 The questionable data point (outlier) is 13.1 Calculate Qcalc = gap range = (13.1 − 10.7) (13.1 − 9.52) = 0.670 Compare Qcalc to Qtable for n observations and desired CL (90% or 95% is typical). It is desirable to keep 2-3 decimal places in Q calc so judgment from table can be made. Qtable (n=6,90% CL) = 0.56
  • 34. From G.D. Christian (1994) Analytical Chemistry, 5th Ed.
  • 35. Evaluating questionable data points using the Q-test--example If Qcalc < Qtable, do not reject questionable data point at stated CL. If Qcalc ≥ Qtable, reject questionable data point at stated CL. From previous example, Qcalc (0.670) > Qtable (0.56), so reject data point at 90% CL. Subsequent calculations (e.g., mean and standard deviation) should then exclude the rejected point. Mean and std. dev. of remaining data: 10.04 ± 0.47 mg/L