SlideShare a Scribd company logo
1 of 32
Central Tendency & Dispersion
 Types of Distributions: Normal, Skewed
 Central Tendency: Mean, Median, Mode
 Dispersion: Variance, Standard Deviation
This PowerPoint has been ripped off from I don’t know where,
and improved upon by yours truly…Mrs. T  Enjoy!
DESCRIPTIVE STATISTICS
are concerned with describing the
characteristics of frequency distributions
 Where is the center?
 What is the range?
 What is the shape [of the
distribution]?
Frequency Table
Test Scores
Observation Frequency
(scores) (# occurrences)
65 1
70 2
75 3
80 4
85 3
90 2
95 1
What is the range of test scores?
A: 30 (95 minus 65)
When calculating mean, one
must divide by what number?
A: 16 (total # occurrences)
Frequency Distributions
Test Score
Frequency
(# occurrences)
4
3
2
1
65 70 75 80 85 90 95
Normally Distributed Curve
Voter Turnout in 50 States - 1980
Skewed Distributions
We say the distribution is skewed
to the left  (when the “tail” is
to the left)
We say the distribution is skewed
to the right  (when the “tail” is
to the right)
Voter Turnout in 50 States - 1940
Q: Is this distribution, positively or negatively skewed?
Q: Would we say this distribution is
skewed to the left or right?
A: Negatively
A: Left (skewed in direction of tail)
Characteristics - Normal Distribution
 It is symmetrical - half the values are to one side of the
center (mean), and half the values are on the other side.
 The distribution is single-peaked, not bimodal or multi-
modal.
 Most of the data values will be “bunched” near the center
portion of the curve. As values become more extreme
they become less frequent with the “outliers” being found
at the “tails” of the distribution and are few in number.
 The Mean, Median, and Mode are the same in a perfectly
symmetrical normal distribution.
 Percentage of values that occur in any range of the curve
can be calculated using the Empirical Rule.
Empirical Rule
Summarizing Distributions
Two key characteristics of a frequency distribution
are especially important when summarizing data
or when making a prediction:
 CENTRAL TENDENCY
 What is in the “middle”?
 What is most common?
 What would we use to predict?
 DISPERSION
 How spread out is the distribution?
 What shape is it?
 3 measures of central tendency are commonly
used in statistical analysis - MEAN, MEDIAN,
and MODE.
 Each measure is designed to represent a
“typical” value in the distribution.
 The choice of which measure to use depends on
the shape of the distribution (whether normal or
skewed).
The MEASURES of Central Tendency
Mean - Average
 Most common measure of central tendency.
 Is sensitive to the influence of a few extreme
values (outliers), thus it is not always the most
appropriate measure of central tendency.
 Best used for making predictions when a
distribution is more or less normal (or symmetrical).
 Symbolized as:
 x for the mean of a sample
 μ for the mean of a population
Finding the Mean
 Formula for Mean: X = (Σ x)
N
 Given the data set: {3, 5, 10, 4, 3}
X = (3 + 5 + 10 + 4 + 3) = 25
5 5
X = 5
Find the Mean
Q: 85, 87, 89, 91, 98, 100
A: 91.67
Median: 90
Q: 5, 87, 89, 91, 98, 100
A: 78.3 (Extremely low score lowered the Mean)
Median: 90 (The median remained unchanged.)
Median
 Used to find middle value (center) of a distribution.
 Used when one must determine whether the data
values fall into either the upper 50% or lower 50%
of a distribution.
 Used when one needs to report the typical value of
a data set, ignoring the outliers (few extreme
values in a data set).
 Example: median salary, median home prices in a market
 Is a better indicator of central tendency than mean
when one has a skewed distribution.
To compute the median
 first you order the values of X from low to high:
 85, 90, 94, 94, 95, 97, 97, 97, 97, 98
 then count number of observations = 10.
 When the number of observations are even,
average the two middle numbers to calculate the
median.
 This example, 96 is the median
(middle) score.
Median
 Find the Median
4 5 6 6 7 8 9 10 12
 Find the Median
5 6 6 7 8 9 10 12
 Find the Median
5 6 6 7 8 9 10 100,000
Mode
 Used when the most typical (common) value is
desired.
 Often used with categorical data.
 The mode is not always unique. A distribution can
have no mode, one mode, or more than one mode.
When there are two modes, we say the distribution is
bimodal.
EXAMPLES:
a) {1,0,5,9,12,8} - No mode
b) {4,5,5,5,9,20,30} – mode = 5
c) {2,2,5,9,9,15} - bimodal, mode 2 and 9
Measures of Variability
 Central Tendency doesn’t tell us
everything Dispersion/Deviation/Spread
tells us a lot about how the data values
are distributed.
 We are most interested in:
Standard Deviation (σ) and
Variance (σ2)
Why can’t the mean tell us everything?
 Mean describes the average outcome.
 The question becomes how good a
representation of the distribution is the mean?
How good is the mean as a description of
central tendency -- or how accurate is the mean
as a predictor?
 ANSWER -- it depends on the shape of the
distribution. Is the distribution normal or
skewed?
Dispersion
 Once you determine that the data of interest is
normally distributed, ideally by producing a
histogram of the values, the next question to ask
is: How spread out are the values about the
mean?
 Dispersion is a key concept in statistical thinking.
 The basic question being asked is how much do
the values deviate from the Mean? The more
“bunched up” around the mean the better
your ability to make accurate predictions.
Means
 Consider these means for
hours worked day each day:
X = {7, 8, 6, 7, 7, 6, 8, 7}
X = (7+8+6+7+7+6+8+7)/8
X = 7
Notice that all the data values
are bunched near the mean.
Thus, 7 would be a pretty
good prediction of the average
hrs. worked each day.
X = {12, 2, 0, 14, 10, 9, 5, 4}
X = (12+2+0+14+10+9+5+4)/8
X = 7
The mean is the same for this data
set, but the data values are more
spread out.
So, 7 is not a good prediction of
hrs. worked on average each day.
Data is more spread out, meaning it has greater variability.
Below, the data is grouped closer to the center, less spread out,
or smaller variability.
 How well does the mean represent the values
in a distribution?
 The logic here is to determine how much
spread is in the values. How much do the
values "deviate" from the mean? Think of the
mean as the true value, or as your best
guess. If every X were very close to the
Mean, the Mean would be a very good
predictor.
 If the distribution is very sharply peaked then
the mean is a good measure of central
tendency and if you were to use the Mean to
make predictions you would be correct or
very close much of the time.
What if scores are widely
distributed?
The mean is still your best measure and your
best predictor, but your predictive power
would be less.
How do we describe this?
 Measures of variability
 Mean Absolute Deviation (You used in Math1)
 Variance (We use in Math 2)
 Standard Deviation (We use in Math 2)
Mean Absolute Deviation
The key concept for describing normal distributions
and making predictions from them is called
deviation from the mean.
We could just calculate the average distance between
each observation and the mean.
 We must take the absolute value of the distance,
otherwise they would just cancel out to zero!
Formula: | |
i
X X
n


Mean Absolute Deviation:
An Example
1. Compute X (Average)
2. Compute X – X and take
the Absolute Value to get
Absolute Deviations
3. Sum the Absolute
Deviations
4. Divide the sum of the
absolute deviations by N
X – Xi Abs. Dev.
7 – 6 1
7 – 10 3
7 – 5 2
7 – 4 3
7 – 9 2
7 – 8 1
Data: X = {6, 10, 5, 4, 9, 8} X = 42 / 6 = 7
Total: 12 12 / 6 = 2
What Does it Mean?
 On Average, each value is two units away
from the mean.
Is it Really that Easy?
 No!
 Absolute values are difficult to manipulate
algebraically
 Absolute values cause enormous problems
for calculus (Discontinuity)
 We need something else…
Variance and Standard Deviation
 Instead of taking the absolute value, we square
the deviations from the mean. This yields a
positive value.
 This will result in measures we call the Variance
and the Standard Deviation
Sample - Population -
s Standard Deviation σ Standard Deviation
s2 Variance σ2 Variance
Calculating the Variance and/or
Standard Deviation
Formulae:
Variance:
Examples Follow . . .
2
( )
i
X X
s
N



2
2
( )
i
X X
s
N



Standard Deviation:
Example:
-1 1
3 9
-2 4
-3 9
2 4
1 1
Data: X = {6, 10, 5, 4, 9, 8}; N = 6
Total: 42 Total: 28
Standard Deviation:
7
6
42




N
X
X
Mean:
Variance:
2
2
( ) 28
4.67
6
X X
s
N

  

16
.
2
67
.
4
2


 s
s
X
X  2
)
( X
X 
X
6
10
5
4
9
8

More Related Content

Similar to statical-data-1 to know how to measure.ppt

Bio statistics
Bio statisticsBio statistics
Bio statisticsNc Das
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersionMayuri Joshi
 
These is info only ill be attaching the questions work CJ 301 – .docx
These is info only ill be attaching the questions work CJ 301 – .docxThese is info only ill be attaching the questions work CJ 301 – .docx
These is info only ill be attaching the questions work CJ 301 – .docxmeagantobias
 
Measures of Dispersion .pptx
Measures of Dispersion .pptxMeasures of Dispersion .pptx
Measures of Dispersion .pptxVishal543707
 
Descriptive Statistics.pptx
Descriptive Statistics.pptxDescriptive Statistics.pptx
Descriptive Statistics.pptxShashank Mishra
 
asDescriptive_Statistics2.ppt
asDescriptive_Statistics2.pptasDescriptive_Statistics2.ppt
asDescriptive_Statistics2.pptradha91354
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersionGilbert Joseph Abueg
 
Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematicshktripathy
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptaigil2
 
Statistics in research
Statistics in researchStatistics in research
Statistics in researchBalaji P
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mininghktripathy
 
Lecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptxLecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptxNabeelAli89
 
Ders 1 mean mod media st dev.pptx
Ders 1 mean mod media st dev.pptxDers 1 mean mod media st dev.pptx
Ders 1 mean mod media st dev.pptxErgin Akalpler
 
Review & Hypothesis Testing
Review & Hypothesis TestingReview & Hypothesis Testing
Review & Hypothesis TestingSr Edith Bogue
 
CJ 301 – Measures of DispersionVariability Think back to the .docx
CJ 301 – Measures of DispersionVariability Think back to the .docxCJ 301 – Measures of DispersionVariability Think back to the .docx
CJ 301 – Measures of DispersionVariability Think back to the .docxmonicafrancis71118
 

Similar to statical-data-1 to know how to measure.ppt (20)

Statistics
StatisticsStatistics
Statistics
 
Bio statistics
Bio statisticsBio statistics
Bio statistics
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
These is info only ill be attaching the questions work CJ 301 – .docx
These is info only ill be attaching the questions work CJ 301 – .docxThese is info only ill be attaching the questions work CJ 301 – .docx
These is info only ill be attaching the questions work CJ 301 – .docx
 
Measures of Dispersion .pptx
Measures of Dispersion .pptxMeasures of Dispersion .pptx
Measures of Dispersion .pptx
 
Descriptive Statistics.pptx
Descriptive Statistics.pptxDescriptive Statistics.pptx
Descriptive Statistics.pptx
 
asDescriptive_Statistics2.ppt
asDescriptive_Statistics2.pptasDescriptive_Statistics2.ppt
asDescriptive_Statistics2.ppt
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersion
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
 
Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematics
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .ppt
 
Statistics in research
Statistics in researchStatistics in research
Statistics in research
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mining
 
Measure of Dispersion in statistics
Measure of Dispersion in statisticsMeasure of Dispersion in statistics
Measure of Dispersion in statistics
 
Lecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptxLecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptx
 
Ders 1 mean mod media st dev.pptx
Ders 1 mean mod media st dev.pptxDers 1 mean mod media st dev.pptx
Ders 1 mean mod media st dev.pptx
 
Review & Hypothesis Testing
Review & Hypothesis TestingReview & Hypothesis Testing
Review & Hypothesis Testing
 
CJ 301 – Measures of DispersionVariability Think back to the .docx
CJ 301 – Measures of DispersionVariability Think back to the .docxCJ 301 – Measures of DispersionVariability Think back to the .docx
CJ 301 – Measures of DispersionVariability Think back to the .docx
 

More from NazarudinManik1

Hadist Kurikulum pehsgskabsndidikan.pptx
Hadist Kurikulum pehsgskabsndidikan.pptxHadist Kurikulum pehsgskabsndidikan.pptx
Hadist Kurikulum pehsgskabsndidikan.pptxNazarudinManik1
 
i dont know waht 5_6316622979346205722.pptx
i dont know waht 5_6316622979346205722.pptxi dont know waht 5_6316622979346205722.pptx
i dont know waht 5_6316622979346205722.pptxNazarudinManik1
 
Principles of learning By Group 2 and 3.pptx
Principles of learning By Group 2 and 3.pptxPrinciples of learning By Group 2 and 3.pptx
Principles of learning By Group 2 and 3.pptxNazarudinManik1
 
PPT bt bajuri sahnan THESIS PROPOSAL.pptx
PPT bt bajuri sahnan THESIS PROPOSAL.pptxPPT bt bajuri sahnan THESIS PROPOSAL.pptx
PPT bt bajuri sahnan THESIS PROPOSAL.pptxNazarudinManik1
 
Procedure Text explanqation and the ppt presentation.pptx
Procedure Text explanqation and the ppt presentation.pptxProcedure Text explanqation and the ppt presentation.pptx
Procedure Text explanqation and the ppt presentation.pptxNazarudinManik1
 
new Teacher-identity-and-inclusion-NH2.pptx
new Teacher-identity-and-inclusion-NH2.pptxnew Teacher-identity-and-inclusion-NH2.pptx
new Teacher-identity-and-inclusion-NH2.pptxNazarudinManik1
 
ppt aisyah hadist (Rekam Jejak dan Karya Imuan.pptx
ppt aisyah hadist (Rekam Jejak dan Karya Imuan.pptxppt aisyah hadist (Rekam Jejak dan Karya Imuan.pptx
ppt aisyah hadist (Rekam Jejak dan Karya Imuan.pptxNazarudinManik1
 

More from NazarudinManik1 (8)

Hadist Kurikulum pehsgskabsndidikan.pptx
Hadist Kurikulum pehsgskabsndidikan.pptxHadist Kurikulum pehsgskabsndidikan.pptx
Hadist Kurikulum pehsgskabsndidikan.pptx
 
i dont know waht 5_6316622979346205722.pptx
i dont know waht 5_6316622979346205722.pptxi dont know waht 5_6316622979346205722.pptx
i dont know waht 5_6316622979346205722.pptx
 
Principles of learning By Group 2 and 3.pptx
Principles of learning By Group 2 and 3.pptxPrinciples of learning By Group 2 and 3.pptx
Principles of learning By Group 2 and 3.pptx
 
PPT bt bajuri sahnan THESIS PROPOSAL.pptx
PPT bt bajuri sahnan THESIS PROPOSAL.pptxPPT bt bajuri sahnan THESIS PROPOSAL.pptx
PPT bt bajuri sahnan THESIS PROPOSAL.pptx
 
Procedure Text explanqation and the ppt presentation.pptx
Procedure Text explanqation and the ppt presentation.pptxProcedure Text explanqation and the ppt presentation.pptx
Procedure Text explanqation and the ppt presentation.pptx
 
new Teacher-identity-and-inclusion-NH2.pptx
new Teacher-identity-and-inclusion-NH2.pptxnew Teacher-identity-and-inclusion-NH2.pptx
new Teacher-identity-and-inclusion-NH2.pptx
 
ppt aisyah hadist (Rekam Jejak dan Karya Imuan.pptx
ppt aisyah hadist (Rekam Jejak dan Karya Imuan.pptxppt aisyah hadist (Rekam Jejak dan Karya Imuan.pptx
ppt aisyah hadist (Rekam Jejak dan Karya Imuan.pptx
 
MID Brian.docx
MID Brian.docxMID Brian.docx
MID Brian.docx
 

Recently uploaded

Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupJonathanParaisoCruz
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 

Recently uploaded (20)

Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized Group
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 

statical-data-1 to know how to measure.ppt

  • 1. Central Tendency & Dispersion  Types of Distributions: Normal, Skewed  Central Tendency: Mean, Median, Mode  Dispersion: Variance, Standard Deviation This PowerPoint has been ripped off from I don’t know where, and improved upon by yours truly…Mrs. T  Enjoy!
  • 2. DESCRIPTIVE STATISTICS are concerned with describing the characteristics of frequency distributions  Where is the center?  What is the range?  What is the shape [of the distribution]?
  • 3. Frequency Table Test Scores Observation Frequency (scores) (# occurrences) 65 1 70 2 75 3 80 4 85 3 90 2 95 1 What is the range of test scores? A: 30 (95 minus 65) When calculating mean, one must divide by what number? A: 16 (total # occurrences)
  • 4. Frequency Distributions Test Score Frequency (# occurrences) 4 3 2 1 65 70 75 80 85 90 95
  • 6. Voter Turnout in 50 States - 1980
  • 7. Skewed Distributions We say the distribution is skewed to the left  (when the “tail” is to the left) We say the distribution is skewed to the right  (when the “tail” is to the right)
  • 8. Voter Turnout in 50 States - 1940 Q: Is this distribution, positively or negatively skewed? Q: Would we say this distribution is skewed to the left or right? A: Negatively A: Left (skewed in direction of tail)
  • 9. Characteristics - Normal Distribution  It is symmetrical - half the values are to one side of the center (mean), and half the values are on the other side.  The distribution is single-peaked, not bimodal or multi- modal.  Most of the data values will be “bunched” near the center portion of the curve. As values become more extreme they become less frequent with the “outliers” being found at the “tails” of the distribution and are few in number.  The Mean, Median, and Mode are the same in a perfectly symmetrical normal distribution.  Percentage of values that occur in any range of the curve can be calculated using the Empirical Rule.
  • 11. Summarizing Distributions Two key characteristics of a frequency distribution are especially important when summarizing data or when making a prediction:  CENTRAL TENDENCY  What is in the “middle”?  What is most common?  What would we use to predict?  DISPERSION  How spread out is the distribution?  What shape is it?
  • 12.  3 measures of central tendency are commonly used in statistical analysis - MEAN, MEDIAN, and MODE.  Each measure is designed to represent a “typical” value in the distribution.  The choice of which measure to use depends on the shape of the distribution (whether normal or skewed). The MEASURES of Central Tendency
  • 13. Mean - Average  Most common measure of central tendency.  Is sensitive to the influence of a few extreme values (outliers), thus it is not always the most appropriate measure of central tendency.  Best used for making predictions when a distribution is more or less normal (or symmetrical).  Symbolized as:  x for the mean of a sample  μ for the mean of a population
  • 14. Finding the Mean  Formula for Mean: X = (Σ x) N  Given the data set: {3, 5, 10, 4, 3} X = (3 + 5 + 10 + 4 + 3) = 25 5 5 X = 5
  • 15. Find the Mean Q: 85, 87, 89, 91, 98, 100 A: 91.67 Median: 90 Q: 5, 87, 89, 91, 98, 100 A: 78.3 (Extremely low score lowered the Mean) Median: 90 (The median remained unchanged.)
  • 16. Median  Used to find middle value (center) of a distribution.  Used when one must determine whether the data values fall into either the upper 50% or lower 50% of a distribution.  Used when one needs to report the typical value of a data set, ignoring the outliers (few extreme values in a data set).  Example: median salary, median home prices in a market  Is a better indicator of central tendency than mean when one has a skewed distribution.
  • 17. To compute the median  first you order the values of X from low to high:  85, 90, 94, 94, 95, 97, 97, 97, 97, 98  then count number of observations = 10.  When the number of observations are even, average the two middle numbers to calculate the median.  This example, 96 is the median (middle) score.
  • 18. Median  Find the Median 4 5 6 6 7 8 9 10 12  Find the Median 5 6 6 7 8 9 10 12  Find the Median 5 6 6 7 8 9 10 100,000
  • 19. Mode  Used when the most typical (common) value is desired.  Often used with categorical data.  The mode is not always unique. A distribution can have no mode, one mode, or more than one mode. When there are two modes, we say the distribution is bimodal. EXAMPLES: a) {1,0,5,9,12,8} - No mode b) {4,5,5,5,9,20,30} – mode = 5 c) {2,2,5,9,9,15} - bimodal, mode 2 and 9
  • 20. Measures of Variability  Central Tendency doesn’t tell us everything Dispersion/Deviation/Spread tells us a lot about how the data values are distributed.  We are most interested in: Standard Deviation (σ) and Variance (σ2)
  • 21. Why can’t the mean tell us everything?  Mean describes the average outcome.  The question becomes how good a representation of the distribution is the mean? How good is the mean as a description of central tendency -- or how accurate is the mean as a predictor?  ANSWER -- it depends on the shape of the distribution. Is the distribution normal or skewed?
  • 22. Dispersion  Once you determine that the data of interest is normally distributed, ideally by producing a histogram of the values, the next question to ask is: How spread out are the values about the mean?  Dispersion is a key concept in statistical thinking.  The basic question being asked is how much do the values deviate from the Mean? The more “bunched up” around the mean the better your ability to make accurate predictions.
  • 23. Means  Consider these means for hours worked day each day: X = {7, 8, 6, 7, 7, 6, 8, 7} X = (7+8+6+7+7+6+8+7)/8 X = 7 Notice that all the data values are bunched near the mean. Thus, 7 would be a pretty good prediction of the average hrs. worked each day. X = {12, 2, 0, 14, 10, 9, 5, 4} X = (12+2+0+14+10+9+5+4)/8 X = 7 The mean is the same for this data set, but the data values are more spread out. So, 7 is not a good prediction of hrs. worked on average each day.
  • 24. Data is more spread out, meaning it has greater variability. Below, the data is grouped closer to the center, less spread out, or smaller variability.
  • 25.  How well does the mean represent the values in a distribution?  The logic here is to determine how much spread is in the values. How much do the values "deviate" from the mean? Think of the mean as the true value, or as your best guess. If every X were very close to the Mean, the Mean would be a very good predictor.  If the distribution is very sharply peaked then the mean is a good measure of central tendency and if you were to use the Mean to make predictions you would be correct or very close much of the time.
  • 26. What if scores are widely distributed? The mean is still your best measure and your best predictor, but your predictive power would be less. How do we describe this?  Measures of variability  Mean Absolute Deviation (You used in Math1)  Variance (We use in Math 2)  Standard Deviation (We use in Math 2)
  • 27. Mean Absolute Deviation The key concept for describing normal distributions and making predictions from them is called deviation from the mean. We could just calculate the average distance between each observation and the mean.  We must take the absolute value of the distance, otherwise they would just cancel out to zero! Formula: | | i X X n  
  • 28. Mean Absolute Deviation: An Example 1. Compute X (Average) 2. Compute X – X and take the Absolute Value to get Absolute Deviations 3. Sum the Absolute Deviations 4. Divide the sum of the absolute deviations by N X – Xi Abs. Dev. 7 – 6 1 7 – 10 3 7 – 5 2 7 – 4 3 7 – 9 2 7 – 8 1 Data: X = {6, 10, 5, 4, 9, 8} X = 42 / 6 = 7 Total: 12 12 / 6 = 2
  • 29. What Does it Mean?  On Average, each value is two units away from the mean. Is it Really that Easy?  No!  Absolute values are difficult to manipulate algebraically  Absolute values cause enormous problems for calculus (Discontinuity)  We need something else…
  • 30. Variance and Standard Deviation  Instead of taking the absolute value, we square the deviations from the mean. This yields a positive value.  This will result in measures we call the Variance and the Standard Deviation Sample - Population - s Standard Deviation σ Standard Deviation s2 Variance σ2 Variance
  • 31. Calculating the Variance and/or Standard Deviation Formulae: Variance: Examples Follow . . . 2 ( ) i X X s N    2 2 ( ) i X X s N    Standard Deviation:
  • 32. Example: -1 1 3 9 -2 4 -3 9 2 4 1 1 Data: X = {6, 10, 5, 4, 9, 8}; N = 6 Total: 42 Total: 28 Standard Deviation: 7 6 42     N X X Mean: Variance: 2 2 ( ) 28 4.67 6 X X s N      16 . 2 67 . 4 2    s s X X  2 ) ( X X  X 6 10 5 4 9 8