SlideShare a Scribd company logo
1 of 79
DESCRIPTIVE
STATISTICS
13/07/2018 Descriptive Statistics
Describing data with numbers:
measures of location
1
TYPES OF STATISTICS
Types of statistics:
 descriptive (which summarize some characteristic
of a sample)
• Measures of central tendency
• Measures of distribution
• Measures of skewness
 inferential (which test for significant differences
between groups and/or significant relationships
among variables within the sample
 t-ratio, chi-square, beta-value
13/07/2018 Descriptive Statistics 2
WHAT TO DESCRIBE?
 What is the “location” or “center” of the data?
(“measures of location”)
 How do the data vary? (“measures of variability”)
13/07/2018 Descriptive Statistics 3
DESCRIPTIVE STATISTICS
Class A--IQs of 13
Students
102 115
128 109
131 89
98 106
140 119
93 97
110
Class B--IQs of 13
Students
127 162
131 103
96 111
80 109
93 87
120 105
109
An Illustration : Which Group is Smarter?
Each individual may be different. If you try to understand a group by remembering
the qualities of each member, you become overwhelmed and fail to understand the
group.13/07/2018 Descriptive Statistics 4
DESCRIPTIVE STATISTICS
Which group is smarter now?
Class A--Average IQ Class B--Average IQ
110.54 110.23
They’re roughly the same!
With a summary descriptive statistic, it is
much easier to answer our question.
13/07/2018 Descriptive Statistics 5
TYPES OF DESCRIPTIVE
STATISTICS
Organize Data
Tables
Graphs
Summarize Data
Central Tendency
Variation
13/07/2018 Descriptive Statistics 6
DESCRIPTIVE STATISTICS
Types of descriptive statistics:
Organize Data
Tables
 Frequency Distributions
 Relative Frequency Distributions
Graphs
 Bar Chart or Histogram
 Stem and Leaf Plot
 Frequency Polygon
13/07/2018 Descriptive Statistics 7
DESCRIPTIVE STATISTICS
Summarizing Data:
Central Tendency (or Groups’ “Middle
Values”)
 Mean
 Median
 Mode
Variation (or Summary of Differences
Within Groups)
 Range
 Interquartile Range
 Variance
 Standard Deviation
13/07/2018 9Descriptive Statistics
CENTRAL TENDENCY
INDICATORS OF CENTRAL
TENDENCY
Mode
Most Frequently Occurring Score
Median
Middle Score
Mean
Arithmetic Average, etc.
13/07/2018 Descriptive Statistics 10
INDICATORS OF CENTRAL
TENDENCY
Mode
Annual
Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k
= 15 k.y-1
•Quick and easy to compute
•Unaffected by extreme scores
•Can be used at any level of measurement.
Advantages
13/07/2018 Descriptive Statistics 11
INDICATORS OF CENTRAL
TENDENCY
Mode
Annual
Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k
= 15 k.y-1
•Terminal Statistic
• A given sub-group could make
this measure unrepresentative.
Disadvantages
13/07/2018 Descriptive Statistics 12
INDICATORS OF CENTRAL
TENDENCY
Median
Annual
Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k
50th Percentile = n + 1
2
13/07/2018 Descriptive Statistics 13
INDICATORS OF CENTRAL
TENDENCY
Median
Annual
Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k
For an even number of scores take
the mean of the middle two:
(19 + 20)
2 = 19.5
= 19.5 k.y-1
13/07/2018 Descriptive Statistics 14
INDICATORS OF CENTRAL
TENDENCY
Median
Annual
Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k
•Unaffected by extreme scores
•Can be used at all levels above nominal.
Advantages
= 19.5 k.y-1
13/07/2018 Descriptive Statistics 15
INDICATORS OF CENTRAL
TENDENCY
Median
Annual
Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k
•Only considers order- value ignored.
Disadvantages
= 19.5 k.y-1
13/07/2018 Descriptive Statistics 16
INDICATORS OF CENTRAL
TENDENCY
Douglas Bag drove 100 miles from Bath to
London at 100 miles∙h-1 but was caught in
traffic on his return journey, so was limited to
50 miles∙h-1.
What was his average speed for the entire journey?
Mean
13/07/2018 Descriptive Statistics 17
INDICATORS OF CENTRAL
TENDENCY
-Arithmetic Mean
-Harmonic Mean
-Geometric Mean
also.. -f mean
-Truncated mean
-Power mean
-Weighted arithmetic mean
-Chisini mean
-Identric mean, etc, etc…
13/07/2018 Descriptive Statistics 18
INDICATORS OF CENTRAL
TENDENCY
Mean
Annual
Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k
X =
∑ X
n
13/07/2018 Descriptive Statistics 19
INDICATORS OF CENTRAL
TENDENCY
Mean
Annual
Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k
X =
(10+11+11+15+15+15+19+20+21+21+22+22+24+25)
14
251
= 17.9 k.y-1
13/07/2018 Descriptive Statistics 20
INDICATORS OF CENTRAL
TENDENCY
Mean
Annual
Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k
= 17.9 k.y-1
•Very sensitive measure
•Takes into account all the available information
•Can be combined with means of other groups to give the overall mean.
Advantages
13/07/2018 Descriptive Statistics 21
INDICATORS OF CENTRAL
TENDENCY
Mean
Annual
Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k
= 17.9 k.y-1
•Very sensitive measure
•Can only be used on interval or ratio data
Disadvantages
13/07/2018 Descriptive Statistics 22
13/07/2018 23Descriptive Statistics
FREQUENCY
DISTRIBUTIONS
AND THEIR GRAPHS
Upper Class
Limits
Class Frequency, f
1 – 4 4
5 – 8 5
9 – 12 3
13 – 16 4
17 – 20 2
FREQUENCY DISTRIBUTIONS
A frequency distribution is a table that shows classes or
intervals of data with a count of the number in each
class. The frequency f of a class is the number of data
points in the class.
Frequencies
Lower Class
Limits
CONSTRUCTING A FREQUENCY
DISTRIBUTION
18 20 21 27 29 20
19 30 32 19 34 19
24 29 18 37 38 22
30 39 32 44 33 46
54 49 18 51 21 21
Example:
The following data represents the ages of 30 students in a statistics
class. Construct a frequency distribution that has five classes.
Continued.
Ages of Students
CONSTRUCTING A FREQUENCY
DISTRIBUTION
Example continued:
250 – 57
342 – 49
434 – 41
826 – 33
1318 – 25
Tally Frequency, fClass
30f 
Number of
studentsAges
Check that
the sum equals
the number in
the sample.
Ages of Students
RELATIVE FREQUENCY
Example:
Find the relative frequencies for the “Ages of Students” frequency distribution.
50 – 57 2
3
4
8
13
42 – 49
34 – 41
26 – 33
18 – 25
Frequency, fClass
30f 
Relative
Frequency
0.067
0.1
0.133
0.267
0.433
f
n
13
30

0.433
1
f
n
 
Portion of
students
CUMULATIVE FREQUENCY
The cumulative frequency of a class is the sum of the frequency for
that class and all the previous classes.
30
28
25
21
13
Total number
of students
+
+
+
+50 – 57 2
3
4
8
13
42 – 49
34 – 41
26 – 33
18 – 25
Frequency, fClass
30f 
Cumulative
Frequency
Ages of Students
FREQUENCY HISTOGRAM
Example:
Draw a frequency histogram for the “Ages of Students”
frequency distribution. Use the class boundaries.
2
3
4
8
13
Broken axis
Ages of Students
10
8
6
4
2
0
Age (in years)
f
12
14
17.5 25.5 33.5 41.5 49.5 57.5
FREQUENCY POLYGON
A frequency polygon is a line graph that emphasizes the
continuous change in frequencies.
Broken axis
Ages of Students
10
8
6
4
2
0
Age (in years)
f
12
14
13.5 21.5 29.5 37.5 45.5 53.5 61.5
Midpoints
Line is extended
to the x-axis.
RELATIVE FREQUENCY HISTOGRAM
A relative frequency histogram has the same
shape and the same horizontal scale as the
corresponding frequency histogram.
0.4
0.3
0.2
0.1
0.5
Ages of Students
0
Age (in years)
Relativefrequency
(portionofstudents)
17.5 25.5 33.5 41.5 49.5 57.5
0.433
0.267
0.133
0.1
0.067
CUMULATIVE FREQUENCY GRAPH
A cumulative frequency graph or ogive, is a line
graph that displays the cumulative frequency of
each class at its upper class boundary.
17.5
Age (in years)
Ages of Students
24
18
12
6
30
0
Cumulativefrequency
(portionofstudents)
25.5 33.5 41.5 49.5 57.5
The graph ends
at the upper
boundary of
the last class.
STEM-AND-LEAF PLOT
In a stem-and-leaf plot, each number is separated into a stem (usually
the entry’s leftmost digits) and a leaf (usually the rightmost digit).
This is an example of exploratory data analysis.
Example:
The following data represents the ages of 30 students in a statistics class.
Display the data in a stem-and-leaf plot.
Ages of Students
Continued.
18 20 21 27 29 20
19 30 32 19 34 19
24 29 18 37 38 22
30 39 32 44 33 46
54 49 18 51 21 21
STEM-AND-LEAF PLOT
Ages of Students
1
2
3
4
5
8 8 8 9 9 9
0 0 1 1 1 2 4 7 9 9
0 0 2 2 3 4 7 8 9
4 6 9
1 4
Key: 1|8 = 18
This graph allows us to
see the shape of the
data as well as the
actual values.
Most of the values lie between 20 and
39.
DOT PLOT
In a dot plot, each data entry is plotted, using a point, above a
horizontal axis.
Example:
Use a dot plot to display the ages of the 30 students in the statistics class.
18 20 21 27 29 20
19 30 32 19 34 19
24 29 18 37 38 22
30 39 32 44 33 46
54 49 18 51 21 21
Ages of Students
Continued.
DOT PLOT
Ages of Students
15 18 2
4
4
5
4
8
21 513
0
5
4
3
9
4
2
3
3
3
6
2
7
5
7
From this graph, we can conclude that most of the values lie between
18 and 32.
PIE CHART
A pie chart is a circle that is divided into sectors that represent
categories. The area of each sector is proportional to the frequency of
each category.
Accidental Deaths in the USA in 2002
(Source: US Dept.
of Transportation) Continued.
Type Frequency
Motor Vehicle 43,500
Falls 12,200
Poison 6,400
Drowning 4,600
Fire 4,200
Ingestion of Food/Object 2,900
Firearms 1,400
PIE CHART
To create a pie chart for the data, find the relative frequency
(percent) of each category.
Continued.
Type Frequency
Relative
Frequency
Motor Vehicle 43,500 0.578
Falls 12,200 0.162
Poison 6,400 0.085
Drowning 4,600 0.061
Fire 4,200 0.056
Ingestion of Food/Object 2,900 0.039
Firearms 1,400 0.019
n = 75,200
PIE CHART
Firearms
1.9%
Motor
vehicles
57.8%
Poison
8.5%
Falls
16.2%
Drowning
6.1%
Fire
5.6%
Ingestion
3.9%
Pareto Chart
A Pareto chart is a vertical bar graph is which the height
of each bar represents the frequency. The bars are placed
in order of decreasing height, with the tallest bar to the
left. Accidental Deaths in the USA in 2002
(Source: US Dept.
of Transportation) Continued.
Type Frequency
Motor Vehicle 43,500
Falls 12,200
Poison 6,400
Drowning 4,600
Fire 4,200
Ingestion of Food/Object 2,900
Firearms 1,400
Pareto Chart
Accidental Deaths
5000
10000
35000
40000
45000
30000
25000
20000
15000
Poison
Poison DrowningFalls
Motor
Vehicles
Fire Firearms
Ingestion of
Food/Object
SCATTER PLOT
When each entry in one data set corresponds to an entry in
another data set, the sets are called paired data sets.
In a scatter plot, the ordered pairs are graphed as points
in a coordinate plane. The scatter plot is used to show
the relationship between two quantitative variables.
The following scatter plot represents the relationship
between the number of absences from a class during the
semester and the final grade.
Continued.
SCATTER PLOT
Absences Grade
x
8
2
5
12
15
9
6
y
78
92
90
58
43
74
81
Final
grade
(y)
0 2 4 6 8 10 12 14 16
40
50
60
70
80
90
Absences (x)
100
From the scatter plot, you can see that as the number of absences increases,
the final grade tends to decrease.
TIMES SERIES CHART
A data set that is composed of quantitative data entries taken at
regular intervals over a period of time is a time series. A time series
chart is used to graph a time series.
Example:
The following table lists
the number of minutes
Robert used on his cell
phone for the last six
months.
Continued.
Month Minutes
January 236
February 242
March 188
April 175
May 199
June 135
Construct a time series chart
for the number of minutes
used.
TIMES SERIES CHART
Robert’s Cell Phone Usage
200
150
100
50
250
0
Minutes
Month
Jan Feb Mar Apr May June
13/07/2018 46Descriptive Statistics
FREQUENCY &
TENDENCY
MEAN OF A FREQUENCY
DISTRIBUTION
The mean of a frequency distribution for a sample is approximated by
where x and f are the midpoints and frequencies of the classes.
No
( )
te that
x fx n f
n
 
  
Example:
The following frequency distribution represents the ages of 30 students in a
statistics class. Find the mean of the frequency distribution.
Continued.
MEAN OF A FREQUENCY
DISTRIBUTION
Class x f (x · f )
18 – 25 21.5 13 279.5
26 – 33 29.5 8 236.0
34 – 41 37.5 4 150.0
42 – 49 45.5 3 136.5
50 – 57 53.5 2 107.0
n = 30 Σ = 909.0
Class midpoint
( )x fx
n
 

The mean age of the students is 30.3 years.
909
30
 30.3
SHAPES OF DISTRIBUTIONS
A frequency distribution is symmetric when a vertical line
can be drawn through the middle of a graph of the
distribution and the resulting halves are approximately the
mirror images.
A frequency distribution is uniform (or rectangular) when all
entries, or classes, in the distribution have equal
frequencies. A uniform distribution is also symmetric.
A frequency distribution is skewed if the “tail” of the graph
elongates more to one side than to the other. A distribution
is skewed left (negatively skewed) if its tail extends to the
left. A distribution is skewed right (positively skewed) if its
tail extends to the right.
SYMMETRIC DISTRIBUTION
mean = median = mode
= $25,000
15,000
20,000
22,000
24,000
25,000
25,000
26,000
28,000
30,000
35,000
10 Annual Incomes
Income5
4
3
2
1
0
f
$25000
SKEWED LEFT DISTRIBUTION
mean = $23,500
median = mode = $25,000 Mean < Median
0
20,000
22,000
24,000
25,000
25,000
26,000
28,000
30,000
35,000
10 Annual Incomes
Income
5
4
3
2
1
0
f
$25000
SKEWED RIGHT DISTRIBUTION
mean = $121,500
median = mode = $25,000 Mean > Median
15,000
20,000
22,000
24,000
25,000
25,000
26,000
28,000
30,000
1,000,000
10 Annual Incomes
Income
5
4
3
2
1
0
f
$25000
UniformSymmetric
Skewed right Skewed left
Mean > Median Mean < Median
SUMMARY OF SHAPES OF
DISTRIBUTIONS
Mean = Median
The most appropriate
measure of location
depends on …
the shape of the data’s
distribution.
MOST APPROPRIATE
MEASURE OF LOCATION
Depends on whether or not data are
“symmetric” or “skewed”.
Depends on whether or not data have
one (“unimodal”) or more
(“multimodal”) modes.
MEDIAN
If the recorded values for a variable form a symmetric
distribution, the median and mean are identical.
In skewed data, the mean lies further toward the skew
than the median.
Mean
Median
Mean
Median
Symmetric Skewed
MODE
1. It may give you the most likely experience
rather than the “typical” or “central”
experience.
2. In symmetric distributions, the mean, median,
and mode are the same.
3. In skewed data, the mean and median lie
further toward the skew than the mode.
Median
Mean
Median MeanMode Mode
Symmetric Skewed
SKEWNESS OF DISTRIBUTIONS
 Measures look at how lopsided distributions are—
how far from the ideal of the normal curve they are
 When the median and the mean are different, the
distribution is skewed. The greater the
difference, the greater the skew.
 If the skewness is extreme, the researcher should
either transform the data to make them better
resemble a normal curve or else use a different set
of statistics—nonparametric statistics—to carry
out the analysis
13/07/2018 61Descriptive Statistics
VARIABILITY
MEASUREMENT
METHODS OF VARIABILITY
MEASUREMENT
Commonly used methods: range, variance, standard
deviation, interquartile range, coefficient of variation etc.
Variability (or dispersion) measures the amount of scatter
in a dataset.
RANGE
The range of a data set is the difference between the maximum and
minimum date entries in the set.
Range = (Maximum data entry) – (Minimum data entry)
Example:
The following data are the closing prices for a certain stock on ten successive
Fridays. Find the range.
Stock 56 56 57 58 61 63 63 67 67 67
The range is 67 – 56 = 11.
RANGE
Source: www.animatedsoftware.com/ statglos/sgrange.htm
DEVIATION
The deviation of an entry x in a population data set is the difference
between the entry and the mean μ of the data set.
Deviation of x = x – μ
Example:
The following data are the closing
prices for a certain stock on five
successive Fridays. Find the
deviation of each price.
The mean stock price is
μ = 305/5 = 61.
67 – 61 = 6
Σ(x – μ) = 0
63 – 61 = 2
61 – 61 = 0
58 – 61 = – 3
56 – 61 = – 5
Deviation
x – μ
Σx =
305
67
63
61
58
56
Stock
x
Variance (S2)
Average of squared distances of individual
points from the mean
sample variance
High variance means that most scores are
far away from the mean. Low variance
indicates that most scores cluster tightly
about the mean.
ESTIMATES OF
DISPERSION
VARIANCE AND STANDARD
DEVIATION
The population variance of a population data set of N entries is
Population variance =
2
2 ( )
.
x μ
N
  

“sigma
squared”
The population standard deviation of a population data set of N entries is the
square root of the population variance.
Population standard deviation =
2
2 ( )
.
x μ
N
   
 
“sigma”
FINDING THE POPULATION
STANDARD DEVIATION
Guidelines
In Words In Symbols
1. Find the mean of the population
data set.
2. Find the deviation of each entry.
3. Square each deviation.
4. Add to get the sum of squares.
5. Divide by N to get the population
variance.
6. Find the square root of the
variance to get the population
standard deviation.
xμ
N


x μ
 2
x μ
 2
xSS x μ  
 2
2 x μ
N

 

 2
x μ
N

 

FINDING THE SAMPLE STANDARD
DEVIATION
Guidelines
In Words In Symbols
1. Find the mean of the sample data
set.
2. Find the deviation of each entry.
3. Square each deviation.
4. Add to get the sum of squares.
5. Divide by n – 1 to get the sample
variance.
6. Find the square root of the
variance to get the sample
standard deviation.
xx
n


x x
 2
x x
 2
xSS x x  
 2
2
1
x x
s
n
 


 2
1
x x
s
n
 


INTERPRETING STANDARD
DEVIATION
When interpreting standard deviation, remember that is a
measure of the typical amount an entry deviates from the
mean. The more the entries are spread out, the greater
the standard deviation.
10
8
6
4
2
0
Data value
Frequency
12
14
2 4 6
= 4
s = 1.18
x
10
8
6
4
2
0
Data value
Frequency
12
14
2 4 6
= 4
s = 0
x
13/07/2018 71Descriptive Statistics
MEASURES OF
POSITION
QUARTILES
The three quartiles, Q1, Q2, and Q3, approximately divide an ordered data
set into four equal parts.
Median
0 5025 10075
Q3Q2Q1
Q1 is the median of the data
below Q2.
Q3 is the median of the
data above Q2.
FINDING QUARTILES
Example:
The quiz scores for 15 students is listed below. Find the first, second and third quartiles
of the scores.
28 43 48 51 43 30 55 44 48 33 45 37 37 42 38
Order the data.
28 30 33 37 37 38 42 43 43 44 45 48 48 51 55
Lower half Upper half
Q2Q1 Q3
About one fourth of the students scores 37 or less; about one
half score 43 or less; and about three fourths score 48 or
less.
INTERQUARTILE RANGE
The interquartile range (IQR) of a data set is the difference between the third and first
quartiles.
Interquartile range (IQR) = Q3 – Q1.
Example:
The quartiles for 15 quiz scores are listed below. Find the interquartile range.
(IQR) = Q3 – Q1
Q2 = 43 Q3 = 48Q1 = 37
= 48 – 37
= 11
The quiz scores in the middle portion of
the data set vary by at most 11 points.
BOX AND WHISKER PLOT
A box-and-whisker plot is an exploratory data analysis tool that
highlights the important features of a data set.
The five-number summary is used to draw the graph.
• The minimum entry
• Q1
• Q2 (median)
• Q3
• The maximum entry
Example:
Use the data from the 15 quiz scores to draw a box-and-whisker plot.
Continued.
28 30 33 37 37 38 42 43 43 44 45 48 48 51 55
BOX AND WHISKER PLOT
Five-number summary
• The minimum entry
• Q1
• Q2 (median)
• Q3
• The maximum entry
37
28
55
43
48
40 44 48 52363228 56
28 37 43 48 55
Quiz Scores
BOXPLOT
0
20
40
60
80
100
120
140
160
1
q1
min
median
max
q3
Distribution of Age in Month
0
20
40
60
80
100
120
140
160
1
q1
min
median
max
q3
PERCENTILES AND
DECILES
Fractiles are numbers that partition, or divide, an ordered data
set.
Percentiles divide an ordered data set into 100 parts. There are
99 percentiles: P1, P2, P3…P99.
Deciles divide an ordered data set into 10 parts. There are 9
deciles: D1, D2, D3…D9.
A test score at the 80th percentile (P80), indicates that
the test score is greater than 80% of all other test
scores and less than or equal to 20% of the scores.
DECILES AND PERCENTILES
Percentiles: If data is ordered and divided into 100
parts, then cut points are called Percentiles. 25th
percentile is the Q1, 50th percentile is the Median (Q2)
and the 75th percentile of the data is Q3.
Coefficient of Variation: The standard deviation of data
divided by it’s mean. It is usually expressed in percent.
100
x
Coefficient of
Variation =

More Related Content

What's hot

Introduction to Statistics (Part -I)
Introduction to Statistics (Part -I)Introduction to Statistics (Part -I)
Introduction to Statistics (Part -I)YesAnalytics
 
Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Dalia El-Shafei
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statisticsewhite00
 
Basics of statistics
Basics of statisticsBasics of statistics
Basics of statisticsdonthuraj
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsAnand Thokal
 
Measure OF Central Tendency
Measure OF Central TendencyMeasure OF Central Tendency
Measure OF Central TendencyIqrabutt038
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central TendencyRejvi Ahmed
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variabilitysyedaumme
 
Sampling and statistical inference
Sampling and statistical inferenceSampling and statistical inference
Sampling and statistical inferenceBhavik A Shah
 
Simple random sampling
Simple random samplingSimple random sampling
Simple random samplingsuncil0071
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statisticsaan786
 
Normal distribution
Normal distributionNormal distribution
Normal distributionSonamWadhwa3
 
Basic Statistical Concepts and Methods
Basic Statistical Concepts and MethodsBasic Statistical Concepts and Methods
Basic Statistical Concepts and MethodsAhmed-Refat Refat
 
What is statistics
What is statisticsWhat is statistics
What is statisticsRaj Teotia
 

What's hot (20)

Introduction to Statistics (Part -I)
Introduction to Statistics (Part -I)Introduction to Statistics (Part -I)
Introduction to Statistics (Part -I)
 
Basic Statistics
Basic  StatisticsBasic  Statistics
Basic Statistics
 
Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 
Basics of statistics
Basics of statisticsBasics of statistics
Basics of statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Measure OF Central Tendency
Measure OF Central TendencyMeasure OF Central Tendency
Measure OF Central Tendency
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central Tendency
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variability
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Sampling and statistical inference
Sampling and statistical inferenceSampling and statistical inference
Sampling and statistical inference
 
Measures of Variation
Measures of VariationMeasures of Variation
Measures of Variation
 
Simple random sampling
Simple random samplingSimple random sampling
Simple random sampling
 
1.2 types of data
1.2 types of data1.2 types of data
1.2 types of data
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statistics
 
Sampling techniques
Sampling techniques  Sampling techniques
Sampling techniques
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
 
Basic Statistical Concepts and Methods
Basic Statistical Concepts and MethodsBasic Statistical Concepts and Methods
Basic Statistical Concepts and Methods
 
Measures of Variability
Measures of VariabilityMeasures of Variability
Measures of Variability
 
What is statistics
What is statisticsWhat is statistics
What is statistics
 

Similar to Descriptive statistics

Mod 4 data presentation graphs bar charts tables
Mod 4 data presentation graphs bar charts tablesMod 4 data presentation graphs bar charts tables
Mod 4 data presentation graphs bar charts tablesmpape
 
Lecture 10.1 10.2 bt
Lecture 10.1 10.2 btLecture 10.1 10.2 bt
Lecture 10.1 10.2 btbtmathematics
 
Lecture-2 Descriptive Statistics-Box Plot Descriptive Measures.pdf
Lecture-2 Descriptive Statistics-Box Plot  Descriptive Measures.pdfLecture-2 Descriptive Statistics-Box Plot  Descriptive Measures.pdf
Lecture-2 Descriptive Statistics-Box Plot Descriptive Measures.pdfAzanHaider1
 
2. Descriptive Statistics.pdf
2. Descriptive Statistics.pdf2. Descriptive Statistics.pdf
2. Descriptive Statistics.pdfYomifDeksisaHerpa
 
Penggambaran Data dengan Grafik
Penggambaran Data dengan GrafikPenggambaran Data dengan Grafik
Penggambaran Data dengan Grafikanom0164
 
Numerical and statistical methods new
Numerical and statistical methods newNumerical and statistical methods new
Numerical and statistical methods newAabha Tiwari
 
maft0a2_Statistics_lecture2_2021.pptx
maft0a2_Statistics_lecture2_2021.pptxmaft0a2_Statistics_lecture2_2021.pptx
maft0a2_Statistics_lecture2_2021.pptxTshegofatso Mphake
 
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)Pranjal Saxena
 
cewatrew namsneh hasukjak
cewatrew namsneh hasukjakcewatrew namsneh hasukjak
cewatrew namsneh hasukjakgarbimels
 
17545489052556.pdf
17545489052556.pdf17545489052556.pdf
17545489052556.pdfAshish Gupta
 
Graphical Representation of Statistical data
Graphical Representation of Statistical dataGraphical Representation of Statistical data
Graphical Representation of Statistical dataMD SAMSER
 
Data presentation Lecture
Data presentation Lecture Data presentation Lecture
Data presentation Lecture AB Rajar
 
Lecture 10.1 10.2 bt
Lecture 10.1 10.2 btLecture 10.1 10.2 bt
Lecture 10.1 10.2 btbtmathematics
 
Mathematical Statistics with Applications in R 2nd Edition Ramachandran Solut...
Mathematical Statistics with Applications in R 2nd Edition Ramachandran Solut...Mathematical Statistics with Applications in R 2nd Edition Ramachandran Solut...
Mathematical Statistics with Applications in R 2nd Edition Ramachandran Solut...Alvaradoree
 
Statistical Methods: Graphical Representation of Data
Statistical Methods: Graphical Representation of DataStatistical Methods: Graphical Representation of Data
Statistical Methods: Graphical Representation of DataDr. Ramkrishna Singh Solanki
 
Basic understanding of Plots and diagrams used in data interpretation
 Basic understanding of Plots and diagrams used in data interpretation   Basic understanding of Plots and diagrams used in data interpretation
Basic understanding of Plots and diagrams used in data interpretation Subedi Suraj
 

Similar to Descriptive statistics (20)

Mod 4 data presentation graphs bar charts tables
Mod 4 data presentation graphs bar charts tablesMod 4 data presentation graphs bar charts tables
Mod 4 data presentation graphs bar charts tables
 
Lecture 10.1 10.2 bt
Lecture 10.1 10.2 btLecture 10.1 10.2 bt
Lecture 10.1 10.2 bt
 
Lecture-2 Descriptive Statistics-Box Plot Descriptive Measures.pdf
Lecture-2 Descriptive Statistics-Box Plot  Descriptive Measures.pdfLecture-2 Descriptive Statistics-Box Plot  Descriptive Measures.pdf
Lecture-2 Descriptive Statistics-Box Plot Descriptive Measures.pdf
 
Mc1020
Mc1020Mc1020
Mc1020
 
2. Descriptive Statistics.pdf
2. Descriptive Statistics.pdf2. Descriptive Statistics.pdf
2. Descriptive Statistics.pdf
 
Penggambaran Data dengan Grafik
Penggambaran Data dengan GrafikPenggambaran Data dengan Grafik
Penggambaran Data dengan Grafik
 
Numerical and statistical methods new
Numerical and statistical methods newNumerical and statistical methods new
Numerical and statistical methods new
 
maft0a2_Statistics_lecture2_2021.pptx
maft0a2_Statistics_lecture2_2021.pptxmaft0a2_Statistics_lecture2_2021.pptx
maft0a2_Statistics_lecture2_2021.pptx
 
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
 
Statistics
StatisticsStatistics
Statistics
 
cewatrew namsneh hasukjak
cewatrew namsneh hasukjakcewatrew namsneh hasukjak
cewatrew namsneh hasukjak
 
Statistics 220131103055
Statistics 220131103055Statistics 220131103055
Statistics 220131103055
 
17545489052556.pdf
17545489052556.pdf17545489052556.pdf
17545489052556.pdf
 
Graphical Representation of Statistical data
Graphical Representation of Statistical dataGraphical Representation of Statistical data
Graphical Representation of Statistical data
 
Data presentation Lecture
Data presentation Lecture Data presentation Lecture
Data presentation Lecture
 
Lecture 10.1 10.2 bt
Lecture 10.1 10.2 btLecture 10.1 10.2 bt
Lecture 10.1 10.2 bt
 
Mathematical Statistics with Applications in R 2nd Edition Ramachandran Solut...
Mathematical Statistics with Applications in R 2nd Edition Ramachandran Solut...Mathematical Statistics with Applications in R 2nd Edition Ramachandran Solut...
Mathematical Statistics with Applications in R 2nd Edition Ramachandran Solut...
 
Forcast2
Forcast2Forcast2
Forcast2
 
Statistical Methods: Graphical Representation of Data
Statistical Methods: Graphical Representation of DataStatistical Methods: Graphical Representation of Data
Statistical Methods: Graphical Representation of Data
 
Basic understanding of Plots and diagrams used in data interpretation
 Basic understanding of Plots and diagrams used in data interpretation   Basic understanding of Plots and diagrams used in data interpretation
Basic understanding of Plots and diagrams used in data interpretation
 

More from University of Jaffna (16)

Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 
Test of hypotheses
Test of hypothesesTest of hypotheses
Test of hypotheses
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Correlation
CorrelationCorrelation
Correlation
 
Comparing means
Comparing meansComparing means
Comparing means
 
Data presentation
Data presentationData presentation
Data presentation
 
Intro to stats
Intro to statsIntro to stats
Intro to stats
 
Capital stucture copy
Capital stucture   copyCapital stucture   copy
Capital stucture copy
 
Overview of cost & Management Accounting
Overview of cost & Management AccountingOverview of cost & Management Accounting
Overview of cost & Management Accounting
 
Service portfolio structure & firm performance
Service portfolio structure & firm performanceService portfolio structure & firm performance
Service portfolio structure & firm performance
 
Impact of leverage on shareholders’ return (1)
Impact of leverage on shareholders’ return (1)Impact of leverage on shareholders’ return (1)
Impact of leverage on shareholders’ return (1)
 
Comparison of strategic plan of cbsl & sbp
Comparison of strategic plan of cbsl & sbpComparison of strategic plan of cbsl & sbp
Comparison of strategic plan of cbsl & sbp
 
srilankan apparel industry
srilankan apparel industrysrilankan apparel industry
srilankan apparel industry
 
business failures in Jaffna
business failures in Jaffnabusiness failures in Jaffna
business failures in Jaffna
 
tax administration in srilanka
tax administration in srilankatax administration in srilanka
tax administration in srilanka
 
lkas 21
lkas 21lkas 21
lkas 21
 

Recently uploaded

04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 

Recently uploaded (20)

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 

Descriptive statistics

  • 2. TYPES OF STATISTICS Types of statistics:  descriptive (which summarize some characteristic of a sample) • Measures of central tendency • Measures of distribution • Measures of skewness  inferential (which test for significant differences between groups and/or significant relationships among variables within the sample  t-ratio, chi-square, beta-value 13/07/2018 Descriptive Statistics 2
  • 3. WHAT TO DESCRIBE?  What is the “location” or “center” of the data? (“measures of location”)  How do the data vary? (“measures of variability”) 13/07/2018 Descriptive Statistics 3
  • 4. DESCRIPTIVE STATISTICS Class A--IQs of 13 Students 102 115 128 109 131 89 98 106 140 119 93 97 110 Class B--IQs of 13 Students 127 162 131 103 96 111 80 109 93 87 120 105 109 An Illustration : Which Group is Smarter? Each individual may be different. If you try to understand a group by remembering the qualities of each member, you become overwhelmed and fail to understand the group.13/07/2018 Descriptive Statistics 4
  • 5. DESCRIPTIVE STATISTICS Which group is smarter now? Class A--Average IQ Class B--Average IQ 110.54 110.23 They’re roughly the same! With a summary descriptive statistic, it is much easier to answer our question. 13/07/2018 Descriptive Statistics 5
  • 6. TYPES OF DESCRIPTIVE STATISTICS Organize Data Tables Graphs Summarize Data Central Tendency Variation 13/07/2018 Descriptive Statistics 6
  • 7. DESCRIPTIVE STATISTICS Types of descriptive statistics: Organize Data Tables  Frequency Distributions  Relative Frequency Distributions Graphs  Bar Chart or Histogram  Stem and Leaf Plot  Frequency Polygon 13/07/2018 Descriptive Statistics 7
  • 8. DESCRIPTIVE STATISTICS Summarizing Data: Central Tendency (or Groups’ “Middle Values”)  Mean  Median  Mode Variation (or Summary of Differences Within Groups)  Range  Interquartile Range  Variance  Standard Deviation
  • 10. INDICATORS OF CENTRAL TENDENCY Mode Most Frequently Occurring Score Median Middle Score Mean Arithmetic Average, etc. 13/07/2018 Descriptive Statistics 10
  • 11. INDICATORS OF CENTRAL TENDENCY Mode Annual Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k = 15 k.y-1 •Quick and easy to compute •Unaffected by extreme scores •Can be used at any level of measurement. Advantages 13/07/2018 Descriptive Statistics 11
  • 12. INDICATORS OF CENTRAL TENDENCY Mode Annual Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k = 15 k.y-1 •Terminal Statistic • A given sub-group could make this measure unrepresentative. Disadvantages 13/07/2018 Descriptive Statistics 12
  • 13. INDICATORS OF CENTRAL TENDENCY Median Annual Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k 50th Percentile = n + 1 2 13/07/2018 Descriptive Statistics 13
  • 14. INDICATORS OF CENTRAL TENDENCY Median Annual Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k For an even number of scores take the mean of the middle two: (19 + 20) 2 = 19.5 = 19.5 k.y-1 13/07/2018 Descriptive Statistics 14
  • 15. INDICATORS OF CENTRAL TENDENCY Median Annual Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k •Unaffected by extreme scores •Can be used at all levels above nominal. Advantages = 19.5 k.y-1 13/07/2018 Descriptive Statistics 15
  • 16. INDICATORS OF CENTRAL TENDENCY Median Annual Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k •Only considers order- value ignored. Disadvantages = 19.5 k.y-1 13/07/2018 Descriptive Statistics 16
  • 17. INDICATORS OF CENTRAL TENDENCY Douglas Bag drove 100 miles from Bath to London at 100 miles∙h-1 but was caught in traffic on his return journey, so was limited to 50 miles∙h-1. What was his average speed for the entire journey? Mean 13/07/2018 Descriptive Statistics 17
  • 18. INDICATORS OF CENTRAL TENDENCY -Arithmetic Mean -Harmonic Mean -Geometric Mean also.. -f mean -Truncated mean -Power mean -Weighted arithmetic mean -Chisini mean -Identric mean, etc, etc… 13/07/2018 Descriptive Statistics 18
  • 19. INDICATORS OF CENTRAL TENDENCY Mean Annual Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k X = ∑ X n 13/07/2018 Descriptive Statistics 19
  • 20. INDICATORS OF CENTRAL TENDENCY Mean Annual Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k X = (10+11+11+15+15+15+19+20+21+21+22+22+24+25) 14 251 = 17.9 k.y-1 13/07/2018 Descriptive Statistics 20
  • 21. INDICATORS OF CENTRAL TENDENCY Mean Annual Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k = 17.9 k.y-1 •Very sensitive measure •Takes into account all the available information •Can be combined with means of other groups to give the overall mean. Advantages 13/07/2018 Descriptive Statistics 21
  • 22. INDICATORS OF CENTRAL TENDENCY Mean Annual Salary: 10k 11k 11k 15k 15k 15k 19k 20k 21k 21k 22k 22k 24k 25k = 17.9 k.y-1 •Very sensitive measure •Can only be used on interval or ratio data Disadvantages 13/07/2018 Descriptive Statistics 22
  • 24. Upper Class Limits Class Frequency, f 1 – 4 4 5 – 8 5 9 – 12 3 13 – 16 4 17 – 20 2 FREQUENCY DISTRIBUTIONS A frequency distribution is a table that shows classes or intervals of data with a count of the number in each class. The frequency f of a class is the number of data points in the class. Frequencies Lower Class Limits
  • 25. CONSTRUCTING A FREQUENCY DISTRIBUTION 18 20 21 27 29 20 19 30 32 19 34 19 24 29 18 37 38 22 30 39 32 44 33 46 54 49 18 51 21 21 Example: The following data represents the ages of 30 students in a statistics class. Construct a frequency distribution that has five classes. Continued. Ages of Students
  • 26. CONSTRUCTING A FREQUENCY DISTRIBUTION Example continued: 250 – 57 342 – 49 434 – 41 826 – 33 1318 – 25 Tally Frequency, fClass 30f  Number of studentsAges Check that the sum equals the number in the sample. Ages of Students
  • 27. RELATIVE FREQUENCY Example: Find the relative frequencies for the “Ages of Students” frequency distribution. 50 – 57 2 3 4 8 13 42 – 49 34 – 41 26 – 33 18 – 25 Frequency, fClass 30f  Relative Frequency 0.067 0.1 0.133 0.267 0.433 f n 13 30  0.433 1 f n   Portion of students
  • 28. CUMULATIVE FREQUENCY The cumulative frequency of a class is the sum of the frequency for that class and all the previous classes. 30 28 25 21 13 Total number of students + + + +50 – 57 2 3 4 8 13 42 – 49 34 – 41 26 – 33 18 – 25 Frequency, fClass 30f  Cumulative Frequency Ages of Students
  • 29. FREQUENCY HISTOGRAM Example: Draw a frequency histogram for the “Ages of Students” frequency distribution. Use the class boundaries. 2 3 4 8 13 Broken axis Ages of Students 10 8 6 4 2 0 Age (in years) f 12 14 17.5 25.5 33.5 41.5 49.5 57.5
  • 30. FREQUENCY POLYGON A frequency polygon is a line graph that emphasizes the continuous change in frequencies. Broken axis Ages of Students 10 8 6 4 2 0 Age (in years) f 12 14 13.5 21.5 29.5 37.5 45.5 53.5 61.5 Midpoints Line is extended to the x-axis.
  • 31. RELATIVE FREQUENCY HISTOGRAM A relative frequency histogram has the same shape and the same horizontal scale as the corresponding frequency histogram. 0.4 0.3 0.2 0.1 0.5 Ages of Students 0 Age (in years) Relativefrequency (portionofstudents) 17.5 25.5 33.5 41.5 49.5 57.5 0.433 0.267 0.133 0.1 0.067
  • 32. CUMULATIVE FREQUENCY GRAPH A cumulative frequency graph or ogive, is a line graph that displays the cumulative frequency of each class at its upper class boundary. 17.5 Age (in years) Ages of Students 24 18 12 6 30 0 Cumulativefrequency (portionofstudents) 25.5 33.5 41.5 49.5 57.5 The graph ends at the upper boundary of the last class.
  • 33. STEM-AND-LEAF PLOT In a stem-and-leaf plot, each number is separated into a stem (usually the entry’s leftmost digits) and a leaf (usually the rightmost digit). This is an example of exploratory data analysis. Example: The following data represents the ages of 30 students in a statistics class. Display the data in a stem-and-leaf plot. Ages of Students Continued. 18 20 21 27 29 20 19 30 32 19 34 19 24 29 18 37 38 22 30 39 32 44 33 46 54 49 18 51 21 21
  • 34. STEM-AND-LEAF PLOT Ages of Students 1 2 3 4 5 8 8 8 9 9 9 0 0 1 1 1 2 4 7 9 9 0 0 2 2 3 4 7 8 9 4 6 9 1 4 Key: 1|8 = 18 This graph allows us to see the shape of the data as well as the actual values. Most of the values lie between 20 and 39.
  • 35. DOT PLOT In a dot plot, each data entry is plotted, using a point, above a horizontal axis. Example: Use a dot plot to display the ages of the 30 students in the statistics class. 18 20 21 27 29 20 19 30 32 19 34 19 24 29 18 37 38 22 30 39 32 44 33 46 54 49 18 51 21 21 Ages of Students Continued.
  • 36. DOT PLOT Ages of Students 15 18 2 4 4 5 4 8 21 513 0 5 4 3 9 4 2 3 3 3 6 2 7 5 7 From this graph, we can conclude that most of the values lie between 18 and 32.
  • 37. PIE CHART A pie chart is a circle that is divided into sectors that represent categories. The area of each sector is proportional to the frequency of each category. Accidental Deaths in the USA in 2002 (Source: US Dept. of Transportation) Continued. Type Frequency Motor Vehicle 43,500 Falls 12,200 Poison 6,400 Drowning 4,600 Fire 4,200 Ingestion of Food/Object 2,900 Firearms 1,400
  • 38. PIE CHART To create a pie chart for the data, find the relative frequency (percent) of each category. Continued. Type Frequency Relative Frequency Motor Vehicle 43,500 0.578 Falls 12,200 0.162 Poison 6,400 0.085 Drowning 4,600 0.061 Fire 4,200 0.056 Ingestion of Food/Object 2,900 0.039 Firearms 1,400 0.019 n = 75,200
  • 40. Pareto Chart A Pareto chart is a vertical bar graph is which the height of each bar represents the frequency. The bars are placed in order of decreasing height, with the tallest bar to the left. Accidental Deaths in the USA in 2002 (Source: US Dept. of Transportation) Continued. Type Frequency Motor Vehicle 43,500 Falls 12,200 Poison 6,400 Drowning 4,600 Fire 4,200 Ingestion of Food/Object 2,900 Firearms 1,400
  • 41. Pareto Chart Accidental Deaths 5000 10000 35000 40000 45000 30000 25000 20000 15000 Poison Poison DrowningFalls Motor Vehicles Fire Firearms Ingestion of Food/Object
  • 42. SCATTER PLOT When each entry in one data set corresponds to an entry in another data set, the sets are called paired data sets. In a scatter plot, the ordered pairs are graphed as points in a coordinate plane. The scatter plot is used to show the relationship between two quantitative variables. The following scatter plot represents the relationship between the number of absences from a class during the semester and the final grade. Continued.
  • 43. SCATTER PLOT Absences Grade x 8 2 5 12 15 9 6 y 78 92 90 58 43 74 81 Final grade (y) 0 2 4 6 8 10 12 14 16 40 50 60 70 80 90 Absences (x) 100 From the scatter plot, you can see that as the number of absences increases, the final grade tends to decrease.
  • 44. TIMES SERIES CHART A data set that is composed of quantitative data entries taken at regular intervals over a period of time is a time series. A time series chart is used to graph a time series. Example: The following table lists the number of minutes Robert used on his cell phone for the last six months. Continued. Month Minutes January 236 February 242 March 188 April 175 May 199 June 135 Construct a time series chart for the number of minutes used.
  • 45. TIMES SERIES CHART Robert’s Cell Phone Usage 200 150 100 50 250 0 Minutes Month Jan Feb Mar Apr May June
  • 47. MEAN OF A FREQUENCY DISTRIBUTION The mean of a frequency distribution for a sample is approximated by where x and f are the midpoints and frequencies of the classes. No ( ) te that x fx n f n      Example: The following frequency distribution represents the ages of 30 students in a statistics class. Find the mean of the frequency distribution. Continued.
  • 48. MEAN OF A FREQUENCY DISTRIBUTION Class x f (x · f ) 18 – 25 21.5 13 279.5 26 – 33 29.5 8 236.0 34 – 41 37.5 4 150.0 42 – 49 45.5 3 136.5 50 – 57 53.5 2 107.0 n = 30 Σ = 909.0 Class midpoint ( )x fx n    The mean age of the students is 30.3 years. 909 30  30.3
  • 49. SHAPES OF DISTRIBUTIONS A frequency distribution is symmetric when a vertical line can be drawn through the middle of a graph of the distribution and the resulting halves are approximately the mirror images. A frequency distribution is uniform (or rectangular) when all entries, or classes, in the distribution have equal frequencies. A uniform distribution is also symmetric. A frequency distribution is skewed if the “tail” of the graph elongates more to one side than to the other. A distribution is skewed left (negatively skewed) if its tail extends to the left. A distribution is skewed right (positively skewed) if its tail extends to the right.
  • 50. SYMMETRIC DISTRIBUTION mean = median = mode = $25,000 15,000 20,000 22,000 24,000 25,000 25,000 26,000 28,000 30,000 35,000 10 Annual Incomes Income5 4 3 2 1 0 f $25000
  • 51. SKEWED LEFT DISTRIBUTION mean = $23,500 median = mode = $25,000 Mean < Median 0 20,000 22,000 24,000 25,000 25,000 26,000 28,000 30,000 35,000 10 Annual Incomes Income 5 4 3 2 1 0 f $25000
  • 52. SKEWED RIGHT DISTRIBUTION mean = $121,500 median = mode = $25,000 Mean > Median 15,000 20,000 22,000 24,000 25,000 25,000 26,000 28,000 30,000 1,000,000 10 Annual Incomes Income 5 4 3 2 1 0 f $25000
  • 53. UniformSymmetric Skewed right Skewed left Mean > Median Mean < Median SUMMARY OF SHAPES OF DISTRIBUTIONS Mean = Median
  • 54. The most appropriate measure of location depends on … the shape of the data’s distribution.
  • 55. MOST APPROPRIATE MEASURE OF LOCATION Depends on whether or not data are “symmetric” or “skewed”. Depends on whether or not data have one (“unimodal”) or more (“multimodal”) modes.
  • 56. MEDIAN If the recorded values for a variable form a symmetric distribution, the median and mean are identical. In skewed data, the mean lies further toward the skew than the median. Mean Median Mean Median Symmetric Skewed
  • 57. MODE 1. It may give you the most likely experience rather than the “typical” or “central” experience. 2. In symmetric distributions, the mean, median, and mode are the same. 3. In skewed data, the mean and median lie further toward the skew than the mode. Median Mean Median MeanMode Mode Symmetric Skewed
  • 58. SKEWNESS OF DISTRIBUTIONS  Measures look at how lopsided distributions are— how far from the ideal of the normal curve they are  When the median and the mean are different, the distribution is skewed. The greater the difference, the greater the skew.  If the skewness is extreme, the researcher should either transform the data to make them better resemble a normal curve or else use a different set of statistics—nonparametric statistics—to carry out the analysis
  • 59.
  • 60.
  • 62. METHODS OF VARIABILITY MEASUREMENT Commonly used methods: range, variance, standard deviation, interquartile range, coefficient of variation etc. Variability (or dispersion) measures the amount of scatter in a dataset.
  • 63. RANGE The range of a data set is the difference between the maximum and minimum date entries in the set. Range = (Maximum data entry) – (Minimum data entry) Example: The following data are the closing prices for a certain stock on ten successive Fridays. Find the range. Stock 56 56 57 58 61 63 63 67 67 67 The range is 67 – 56 = 11.
  • 65. DEVIATION The deviation of an entry x in a population data set is the difference between the entry and the mean μ of the data set. Deviation of x = x – μ Example: The following data are the closing prices for a certain stock on five successive Fridays. Find the deviation of each price. The mean stock price is μ = 305/5 = 61. 67 – 61 = 6 Σ(x – μ) = 0 63 – 61 = 2 61 – 61 = 0 58 – 61 = – 3 56 – 61 = – 5 Deviation x – μ Σx = 305 67 63 61 58 56 Stock x
  • 66. Variance (S2) Average of squared distances of individual points from the mean sample variance High variance means that most scores are far away from the mean. Low variance indicates that most scores cluster tightly about the mean. ESTIMATES OF DISPERSION
  • 67. VARIANCE AND STANDARD DEVIATION The population variance of a population data set of N entries is Population variance = 2 2 ( ) . x μ N     “sigma squared” The population standard deviation of a population data set of N entries is the square root of the population variance. Population standard deviation = 2 2 ( ) . x μ N       “sigma”
  • 68. FINDING THE POPULATION STANDARD DEVIATION Guidelines In Words In Symbols 1. Find the mean of the population data set. 2. Find the deviation of each entry. 3. Square each deviation. 4. Add to get the sum of squares. 5. Divide by N to get the population variance. 6. Find the square root of the variance to get the population standard deviation. xμ N   x μ  2 x μ  2 xSS x μ    2 2 x μ N      2 x μ N    
  • 69. FINDING THE SAMPLE STANDARD DEVIATION Guidelines In Words In Symbols 1. Find the mean of the sample data set. 2. Find the deviation of each entry. 3. Square each deviation. 4. Add to get the sum of squares. 5. Divide by n – 1 to get the sample variance. 6. Find the square root of the variance to get the sample standard deviation. xx n   x x  2 x x  2 xSS x x    2 2 1 x x s n      2 1 x x s n    
  • 70. INTERPRETING STANDARD DEVIATION When interpreting standard deviation, remember that is a measure of the typical amount an entry deviates from the mean. The more the entries are spread out, the greater the standard deviation. 10 8 6 4 2 0 Data value Frequency 12 14 2 4 6 = 4 s = 1.18 x 10 8 6 4 2 0 Data value Frequency 12 14 2 4 6 = 4 s = 0 x
  • 72. QUARTILES The three quartiles, Q1, Q2, and Q3, approximately divide an ordered data set into four equal parts. Median 0 5025 10075 Q3Q2Q1 Q1 is the median of the data below Q2. Q3 is the median of the data above Q2.
  • 73. FINDING QUARTILES Example: The quiz scores for 15 students is listed below. Find the first, second and third quartiles of the scores. 28 43 48 51 43 30 55 44 48 33 45 37 37 42 38 Order the data. 28 30 33 37 37 38 42 43 43 44 45 48 48 51 55 Lower half Upper half Q2Q1 Q3 About one fourth of the students scores 37 or less; about one half score 43 or less; and about three fourths score 48 or less.
  • 74. INTERQUARTILE RANGE The interquartile range (IQR) of a data set is the difference between the third and first quartiles. Interquartile range (IQR) = Q3 – Q1. Example: The quartiles for 15 quiz scores are listed below. Find the interquartile range. (IQR) = Q3 – Q1 Q2 = 43 Q3 = 48Q1 = 37 = 48 – 37 = 11 The quiz scores in the middle portion of the data set vary by at most 11 points.
  • 75. BOX AND WHISKER PLOT A box-and-whisker plot is an exploratory data analysis tool that highlights the important features of a data set. The five-number summary is used to draw the graph. • The minimum entry • Q1 • Q2 (median) • Q3 • The maximum entry Example: Use the data from the 15 quiz scores to draw a box-and-whisker plot. Continued. 28 30 33 37 37 38 42 43 43 44 45 48 48 51 55
  • 76. BOX AND WHISKER PLOT Five-number summary • The minimum entry • Q1 • Q2 (median) • Q3 • The maximum entry 37 28 55 43 48 40 44 48 52363228 56 28 37 43 48 55 Quiz Scores
  • 77. BOXPLOT 0 20 40 60 80 100 120 140 160 1 q1 min median max q3 Distribution of Age in Month 0 20 40 60 80 100 120 140 160 1 q1 min median max q3
  • 78. PERCENTILES AND DECILES Fractiles are numbers that partition, or divide, an ordered data set. Percentiles divide an ordered data set into 100 parts. There are 99 percentiles: P1, P2, P3…P99. Deciles divide an ordered data set into 10 parts. There are 9 deciles: D1, D2, D3…D9. A test score at the 80th percentile (P80), indicates that the test score is greater than 80% of all other test scores and less than or equal to 20% of the scores.
  • 79. DECILES AND PERCENTILES Percentiles: If data is ordered and divided into 100 parts, then cut points are called Percentiles. 25th percentile is the Q1, 50th percentile is the Median (Q2) and the 75th percentile of the data is Q3. Coefficient of Variation: The standard deviation of data divided by it’s mean. It is usually expressed in percent. 100 x Coefficient of Variation =