SlideShare a Scribd company logo
Standard Deviation
Dr.A.Antonyraj
Variance: a measure of how data
points differ from the mean
• Data Set 1: 3, 5, 7, 10, 10
Data Set 2: 7, 7, 7, 7, 7
What is the mean and median of the above data set?
Data Set 1: mean = 7, median = 7
Data Set 2: mean = 7, median = 7
But we know that the two data sets are not identical! The
variance shows how they are different.
We want to find a way to represent these two data set
numerically.
How to Calculate?
• If we conceptualize the spread of a distribution
as the extent to which the values in the
distribution differ from the mean and from each
other, then a reasonable measure of spread
might be the average deviation, or difference, of
the values from the mean.
( )x X
N
 
• Although this might seem reasonable, this expression
always equals 0, because the negative deviations about the
mean always cancel out the positive deviations about the
mean.
• We could just drop the negative signs, which is the same
mathematically as taking the absolute value, which is known
as the mean deviations.
• The concept of absolute value does not lend itself to the kind
of advanced mathematical manipulation necessary for the
development of inferential statistical formulas.
• The average of the squared deviations about the mean is
called the variance.
 
2
2
x X
N

 

 
2
2
1
x X
s
n
 


For population variance
For sample variance
X XX XScore
X
( )2
1
3
2
5
3
7
4
10
5
10
Totals
35
The mean is 35/5=7.
X XX XScore
X
( )2
1
3 3-7=-4
2
5 5-7=-2
3
7 7-7=0
4
10 10-7=3
5
10 10-7=3
Totals
35
X XX XScore
X
( )2
1
3 3-7=-4 16
2
5 5-7=-2 4
3
7 7-7=0 0
4
10 10-7=3 9
5
10 10-7=3 9
Totals
35 38
X XX XScore
X
( )2
1
3 3-7=-4 16
2
5 5-7=-2 4
3
7 7-7=0 0
4
10 10-7=3 9
5
10 10-7=3 9
Totals
35 38
 
2
2 38
7.6
5
x X
s
n
 
  
Example 2
Dive Mark Myrna
1 28 27
2 22 27
3 21 28
4 26 6
5 18 27
Find the mean, median, mode, range?
mean 23 23
median 22 27
range 10 22
What can be said about this data?
Due to the outlier, the median is more typical of overall performance.
Which diver was more consistent?
X X X XDive Mark's Score
X
( )2
1 28 5 25
2 22 -1 1
3 21 -2 4
4 26 3 9
5 18 -5 25
Totals 115 0 64
Mark’s Variance = 64 / 5 = 12.8
Myrna’s Variance = 362 / 5 = 72.4
Conclusion: Mark has a lower variance therefore he is more consistent.
standard deviation - a measure of
variation of scores about the mean
• Can think of standard deviation as the average
distance to the mean, although that's not
numerically accurate, it's conceptually helpful.
All ways of saying the same thing: higher
standard deviation indicates higher spread, less
consistency, and less clustering.
• sample standard deviation:
• population standard deviation:
 
2
1
x X
s
n
 


 
2
x
N


 

Another formula
• Definitional formula for variance for data in a
frequency distribution
• Definitional formula for standard deviation for
data in a frequency distribution
2
2
( )X X f
S
f




2
( )X X f
S
f




X X X X X XMyrna’s Score X f ( )2 ( )2 x f
28 1
27 3
6 1
115 5
The mean is 23
X X X X X XMyrna’s Score X f ( )2 ( )2 x f
28 1 5
27 3 4
6 1 -17
115 5
X X X X X XMyrna’s Score X f ( )2 ( )2 x f
28 1 5 25
27 3 4 16
6 1 -17 289
115 5
X X X X X XMyrna’s Score X f ( )2 ( )2 x f
28 1 5 25 25
27 3 4 16 48
6 1 -17 289 289
115 5 362
Variance = S2 = 362 / 5 = 72.4
Standard Deviation = 72.4 = 8.5
round-off rule – carry
one more decimal
place than was
present in the
original data
Bell shaped curve
• empirical rule for data (68-95-99) - only applies
to a set of data having a distribution that is
approximately bell-shaped: (figure pg 220)
•  68% of all scores fall with 1 standard deviation
of the mean
•  95% of all scores fall with 2 standard deviation
of the mean
•  99.7% of all scores fall with 3 standard
deviation of the mean

More Related Content

What's hot

Final presentation
Final presentationFinal presentation
Final presentation
paezp
 
Variance
VarianceVariance
Variance
Boris Valeroso
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
Ryan Herzog
 
2 5 standard deviation
2 5 standard deviation2 5 standard deviation
2 5 standard deviation
Ken Kretsch
 
Measures of-variation
Measures of-variationMeasures of-variation
Measures of-variation
Jhonna Barrosa
 
Topic 1 part 2
Topic 1 part 2Topic 1 part 2
Topic 1 part 2
Ryan Herzog
 
Geometric Distribution
Geometric DistributionGeometric Distribution
Geometric Distribution
Ratul Basak
 
Approach to anova questions
Approach to anova questionsApproach to anova questions
Approach to anova questions
GeorgeGidudu
 
Help on frequency distributions
Help on frequency distributionsHelp on frequency distributions
Help on frequency distributions
Brent Heard
 
Standard deviation (3)
Standard deviation (3)Standard deviation (3)
Standard deviation (3)
Sonali Prasad
 
February 13, 2015
February 13, 2015February 13, 2015
February 13, 2015
khyps13
 
A1 Test 3 study guide with answers
A1 Test 3 study guide with answersA1 Test 3 study guide with answers
A1 Test 3 study guide with answers
vhiggins1
 
Mean deviation (2018)
Mean deviation (2018)Mean deviation (2018)
Mean deviation (2018)
sumanmathews
 
Lecture determinants good one
Lecture determinants good oneLecture determinants good one
Lecture determinants good one
Hazel Joy Chong
 
Interval Notation
Interval NotationInterval Notation
Interval Notation
MarkBredin
 
3.2 Measures of variation
3.2 Measures of variation3.2 Measures of variation
3.2 Measures of variation
Long Beach City College
 
Student t t est
Student t t estStudent t t est
Student t t est
Ashok Reddy
 
Solution of system of linear equations by elimination
Solution of system of linear equations by eliminationSolution of system of linear equations by elimination
Solution of system of linear equations by elimination
Regie Panganiban
 
Elements of a sequence
Elements of a sequenceElements of a sequence
Elements of a sequence
MartinGeraldine
 
Math presentation
Math presentationMath presentation
Math presentation
MdAlAmin187
 

What's hot (20)

Final presentation
Final presentationFinal presentation
Final presentation
 
Variance
VarianceVariance
Variance
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
 
2 5 standard deviation
2 5 standard deviation2 5 standard deviation
2 5 standard deviation
 
Measures of-variation
Measures of-variationMeasures of-variation
Measures of-variation
 
Topic 1 part 2
Topic 1 part 2Topic 1 part 2
Topic 1 part 2
 
Geometric Distribution
Geometric DistributionGeometric Distribution
Geometric Distribution
 
Approach to anova questions
Approach to anova questionsApproach to anova questions
Approach to anova questions
 
Help on frequency distributions
Help on frequency distributionsHelp on frequency distributions
Help on frequency distributions
 
Standard deviation (3)
Standard deviation (3)Standard deviation (3)
Standard deviation (3)
 
February 13, 2015
February 13, 2015February 13, 2015
February 13, 2015
 
A1 Test 3 study guide with answers
A1 Test 3 study guide with answersA1 Test 3 study guide with answers
A1 Test 3 study guide with answers
 
Mean deviation (2018)
Mean deviation (2018)Mean deviation (2018)
Mean deviation (2018)
 
Lecture determinants good one
Lecture determinants good oneLecture determinants good one
Lecture determinants good one
 
Interval Notation
Interval NotationInterval Notation
Interval Notation
 
3.2 Measures of variation
3.2 Measures of variation3.2 Measures of variation
3.2 Measures of variation
 
Student t t est
Student t t estStudent t t est
Student t t est
 
Solution of system of linear equations by elimination
Solution of system of linear equations by eliminationSolution of system of linear equations by elimination
Solution of system of linear equations by elimination
 
Elements of a sequence
Elements of a sequenceElements of a sequence
Elements of a sequence
 
Math presentation
Math presentationMath presentation
Math presentation
 

Similar to Sd

Variability
VariabilityVariability
Mean, median, and mode ug
Mean, median, and mode ugMean, median, and mode ug
Mean, median, and mode ug
AbhishekDas15
 
CENTRAL LIMIT THEOREM- STATISTICS AND PROBABILITY
CENTRAL LIMIT THEOREM- STATISTICS AND PROBABILITYCENTRAL LIMIT THEOREM- STATISTICS AND PROBABILITY
CENTRAL LIMIT THEOREM- STATISTICS AND PROBABILITY
SharmaineTuliao1
 
Statistical methods
Statistical methods Statistical methods
Statistical methods
rcm business
 
Ch 6 DISPERSION.doc
Ch 6 DISPERSION.docCh 6 DISPERSION.doc
Ch 6 DISPERSION.doc
AbedurRahman5
 
Statistics-Measures of dispersions
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersions
Capricorn
 
Chapter one on sampling distributions.ppt
Chapter one on sampling distributions.pptChapter one on sampling distributions.ppt
Chapter one on sampling distributions.ppt
FekaduAman
 
Statistics 3, 4
Statistics 3, 4Statistics 3, 4
Statistics 3, 4
Diana Diana
 
Chapter 7 2022.pdf
Chapter 7 2022.pdfChapter 7 2022.pdf
Chapter 7 2022.pdf
Mohamed Ali
 
Statistical computing2
Statistical computing2Statistical computing2
Statistical computing2
Padma Metta
 
Variance & standard deviation
Variance & standard deviationVariance & standard deviation
Variance & standard deviation
Faisal Hussain
 
Discrete and continuous probability distributions ppt @ bec doms
Discrete and continuous probability distributions ppt @ bec domsDiscrete and continuous probability distributions ppt @ bec doms
Discrete and continuous probability distributions ppt @ bec doms
Babasab Patil
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
Shubham Mehta
 
Measures of dispersion by Prof Najeeb Memon BMC lumhs jamshoro
Measures of dispersion by Prof Najeeb Memon BMC lumhs jamshoroMeasures of dispersion by Prof Najeeb Memon BMC lumhs jamshoro
Measures of dispersion by Prof Najeeb Memon BMC lumhs jamshoro
muhammed najeeb
 
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdfUnit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
Ravinandan A P
 
Basic stat review
Basic stat reviewBasic stat review
Basic stat review
julienne Nicole
 
Statistics and Data Mining with Perl Data Language
Statistics and Data Mining with Perl Data LanguageStatistics and Data Mining with Perl Data Language
Statistics and Data Mining with Perl Data Language
maggiexyz
 
DESCRIPTIVE-STATISTICS.pptxxxxxxcxxxcxdff
DESCRIPTIVE-STATISTICS.pptxxxxxxcxxxcxdffDESCRIPTIVE-STATISTICS.pptxxxxxxcxxxcxdff
DESCRIPTIVE-STATISTICS.pptxxxxxxcxxxcxdff
menaguado
 
Statistik 1 6 distribusi probabilitas normal
Statistik 1 6 distribusi probabilitas normalStatistik 1 6 distribusi probabilitas normal
Statistik 1 6 distribusi probabilitas normal
Selvin Hadi
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
Long Beach City College
 

Similar to Sd (20)

Variability
VariabilityVariability
Variability
 
Mean, median, and mode ug
Mean, median, and mode ugMean, median, and mode ug
Mean, median, and mode ug
 
CENTRAL LIMIT THEOREM- STATISTICS AND PROBABILITY
CENTRAL LIMIT THEOREM- STATISTICS AND PROBABILITYCENTRAL LIMIT THEOREM- STATISTICS AND PROBABILITY
CENTRAL LIMIT THEOREM- STATISTICS AND PROBABILITY
 
Statistical methods
Statistical methods Statistical methods
Statistical methods
 
Ch 6 DISPERSION.doc
Ch 6 DISPERSION.docCh 6 DISPERSION.doc
Ch 6 DISPERSION.doc
 
Statistics-Measures of dispersions
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersions
 
Chapter one on sampling distributions.ppt
Chapter one on sampling distributions.pptChapter one on sampling distributions.ppt
Chapter one on sampling distributions.ppt
 
Statistics 3, 4
Statistics 3, 4Statistics 3, 4
Statistics 3, 4
 
Chapter 7 2022.pdf
Chapter 7 2022.pdfChapter 7 2022.pdf
Chapter 7 2022.pdf
 
Statistical computing2
Statistical computing2Statistical computing2
Statistical computing2
 
Variance & standard deviation
Variance & standard deviationVariance & standard deviation
Variance & standard deviation
 
Discrete and continuous probability distributions ppt @ bec doms
Discrete and continuous probability distributions ppt @ bec domsDiscrete and continuous probability distributions ppt @ bec doms
Discrete and continuous probability distributions ppt @ bec doms
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
 
Measures of dispersion by Prof Najeeb Memon BMC lumhs jamshoro
Measures of dispersion by Prof Najeeb Memon BMC lumhs jamshoroMeasures of dispersion by Prof Najeeb Memon BMC lumhs jamshoro
Measures of dispersion by Prof Najeeb Memon BMC lumhs jamshoro
 
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdfUnit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdf
 
Basic stat review
Basic stat reviewBasic stat review
Basic stat review
 
Statistics and Data Mining with Perl Data Language
Statistics and Data Mining with Perl Data LanguageStatistics and Data Mining with Perl Data Language
Statistics and Data Mining with Perl Data Language
 
DESCRIPTIVE-STATISTICS.pptxxxxxxcxxxcxdff
DESCRIPTIVE-STATISTICS.pptxxxxxxcxxxcxdffDESCRIPTIVE-STATISTICS.pptxxxxxxcxxxcxdff
DESCRIPTIVE-STATISTICS.pptxxxxxxcxxxcxdff
 
Statistik 1 6 distribusi probabilitas normal
Statistik 1 6 distribusi probabilitas normalStatistik 1 6 distribusi probabilitas normal
Statistik 1 6 distribusi probabilitas normal
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 

More from Antony Raj

Wto
WtoWto
Qualitycontrol
QualitycontrolQualitycontrol
Qualitycontrol
Antony Raj
 
Production management
Production managementProduction management
Production management
Antony Raj
 
Ibe
IbeIbe
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
Antony Raj
 
Banker and customer
Banker and customerBanker and customer
Banker and customer
Antony Raj
 
6sigma
6sigma6sigma
6sigma
Antony Raj
 
Ibe
IbeIbe
Banker and customer
Banker and customerBanker and customer
Banker and customer
Antony Raj
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
Antony Raj
 
Qualitycontrol
QualitycontrolQualitycontrol
Qualitycontrol
Antony Raj
 
Production management
Production managementProduction management
Production management
Antony Raj
 

More from Antony Raj (12)

Wto
WtoWto
Wto
 
Qualitycontrol
QualitycontrolQualitycontrol
Qualitycontrol
 
Production management
Production managementProduction management
Production management
 
Ibe
IbeIbe
Ibe
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Banker and customer
Banker and customerBanker and customer
Banker and customer
 
6sigma
6sigma6sigma
6sigma
 
Ibe
IbeIbe
Ibe
 
Banker and customer
Banker and customerBanker and customer
Banker and customer
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Qualitycontrol
QualitycontrolQualitycontrol
Qualitycontrol
 
Production management
Production managementProduction management
Production management
 

Recently uploaded

一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
kuntobimo2016
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 

Recently uploaded (20)

一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 

Sd

  • 2. Variance: a measure of how data points differ from the mean • Data Set 1: 3, 5, 7, 10, 10 Data Set 2: 7, 7, 7, 7, 7 What is the mean and median of the above data set? Data Set 1: mean = 7, median = 7 Data Set 2: mean = 7, median = 7 But we know that the two data sets are not identical! The variance shows how they are different. We want to find a way to represent these two data set numerically.
  • 3. How to Calculate? • If we conceptualize the spread of a distribution as the extent to which the values in the distribution differ from the mean and from each other, then a reasonable measure of spread might be the average deviation, or difference, of the values from the mean. ( )x X N  
  • 4. • Although this might seem reasonable, this expression always equals 0, because the negative deviations about the mean always cancel out the positive deviations about the mean. • We could just drop the negative signs, which is the same mathematically as taking the absolute value, which is known as the mean deviations. • The concept of absolute value does not lend itself to the kind of advanced mathematical manipulation necessary for the development of inferential statistical formulas. • The average of the squared deviations about the mean is called the variance.   2 2 x X N       2 2 1 x X s n     For population variance For sample variance
  • 5. X XX XScore X ( )2 1 3 2 5 3 7 4 10 5 10 Totals 35 The mean is 35/5=7.
  • 6. X XX XScore X ( )2 1 3 3-7=-4 2 5 5-7=-2 3 7 7-7=0 4 10 10-7=3 5 10 10-7=3 Totals 35
  • 7. X XX XScore X ( )2 1 3 3-7=-4 16 2 5 5-7=-2 4 3 7 7-7=0 0 4 10 10-7=3 9 5 10 10-7=3 9 Totals 35 38
  • 8. X XX XScore X ( )2 1 3 3-7=-4 16 2 5 5-7=-2 4 3 7 7-7=0 0 4 10 10-7=3 9 5 10 10-7=3 9 Totals 35 38   2 2 38 7.6 5 x X s n     
  • 9. Example 2 Dive Mark Myrna 1 28 27 2 22 27 3 21 28 4 26 6 5 18 27 Find the mean, median, mode, range? mean 23 23 median 22 27 range 10 22 What can be said about this data? Due to the outlier, the median is more typical of overall performance. Which diver was more consistent?
  • 10. X X X XDive Mark's Score X ( )2 1 28 5 25 2 22 -1 1 3 21 -2 4 4 26 3 9 5 18 -5 25 Totals 115 0 64 Mark’s Variance = 64 / 5 = 12.8 Myrna’s Variance = 362 / 5 = 72.4 Conclusion: Mark has a lower variance therefore he is more consistent.
  • 11. standard deviation - a measure of variation of scores about the mean • Can think of standard deviation as the average distance to the mean, although that's not numerically accurate, it's conceptually helpful. All ways of saying the same thing: higher standard deviation indicates higher spread, less consistency, and less clustering. • sample standard deviation: • population standard deviation:   2 1 x X s n       2 x N     
  • 12. Another formula • Definitional formula for variance for data in a frequency distribution • Definitional formula for standard deviation for data in a frequency distribution 2 2 ( )X X f S f     2 ( )X X f S f    
  • 13. X X X X X XMyrna’s Score X f ( )2 ( )2 x f 28 1 27 3 6 1 115 5 The mean is 23
  • 14. X X X X X XMyrna’s Score X f ( )2 ( )2 x f 28 1 5 27 3 4 6 1 -17 115 5
  • 15. X X X X X XMyrna’s Score X f ( )2 ( )2 x f 28 1 5 25 27 3 4 16 6 1 -17 289 115 5
  • 16. X X X X X XMyrna’s Score X f ( )2 ( )2 x f 28 1 5 25 25 27 3 4 16 48 6 1 -17 289 289 115 5 362 Variance = S2 = 362 / 5 = 72.4 Standard Deviation = 72.4 = 8.5 round-off rule – carry one more decimal place than was present in the original data
  • 17. Bell shaped curve • empirical rule for data (68-95-99) - only applies to a set of data having a distribution that is approximately bell-shaped: (figure pg 220) •  68% of all scores fall with 1 standard deviation of the mean •  95% of all scores fall with 2 standard deviation of the mean •  99.7% of all scores fall with 3 standard deviation of the mean