The document describes various statistical methods for describing and analyzing data, including measures of central tendency (mean, median), variability (range, standard deviation, interquartile range), and distribution (histograms, boxplots). It provides examples of calculating these statistics and interpreting them for real data sets. Comparisons are made between the sample mean and median, and between theoretical descriptions of data distributions (Chebyshev's Rule and the Empirical Rule) and actual data analyses.
In statistics, the standard score is the (signed) number of standard deviations an observation or datum is above the mean. Thus, a positive standard score represents a datum above the mean, while a negative standard score represents a datum below the mean. It is a dimensionless quantity obtained by subtracting the population mean from an individual raw score and then dividing the difference by the population standard deviation. This conversion process is called standardizing or normalizing (however, "normalizing" can refer to many types of ratios; see normalization (statistics) for more).Standard scores are also called z-values, z-scores, normal scores, and standardized variables; the use of "Z" is because the normal distribution is also known as the "Z distribution". They are most frequently used to compare a sample to a standard normal deviate (standard normal distribution, with μ = 0 and σ = 1), though they can be defined without assumptions of normality.
presentation of data
Tabulated data can be easily understand and interpreted.
Graphical forms makes it possible to easily draw visual impression of data.
It makes comparisons easily.
This kind of method create an imprint on mind for a long period of time.
This presentation covers statistics, its importance, its applications, branches of statistics, basic concepts used in statistics, data sampling, types of sampling,types of data and collection of data.
In statistics, the standard score is the (signed) number of standard deviations an observation or datum is above the mean. Thus, a positive standard score represents a datum above the mean, while a negative standard score represents a datum below the mean. It is a dimensionless quantity obtained by subtracting the population mean from an individual raw score and then dividing the difference by the population standard deviation. This conversion process is called standardizing or normalizing (however, "normalizing" can refer to many types of ratios; see normalization (statistics) for more).Standard scores are also called z-values, z-scores, normal scores, and standardized variables; the use of "Z" is because the normal distribution is also known as the "Z distribution". They are most frequently used to compare a sample to a standard normal deviate (standard normal distribution, with μ = 0 and σ = 1), though they can be defined without assumptions of normality.
presentation of data
Tabulated data can be easily understand and interpreted.
Graphical forms makes it possible to easily draw visual impression of data.
It makes comparisons easily.
This kind of method create an imprint on mind for a long period of time.
This presentation covers statistics, its importance, its applications, branches of statistics, basic concepts used in statistics, data sampling, types of sampling,types of data and collection of data.
TSTD 6251 Fall 2014SPSS Exercise and Assignment 120 PointsI.docxnanamonkton
TSTD 6251 Fall 2014
SPSS Exercise and Assignment 1
20 Points
In this class, we are going to study descriptive summary statistics and learn how to construct box plot. We are still working with univariate variable for this exercise.
Practice Example:
Admission receipts (in million of dollars) for a recent season are given below for the
n =
30 major league baseball teams:
19.4 26.6 22.9 44.5 24.4 19.0 27.5 19.9 22.8 19.0 16.9 15.2 25.7 19.0 15.5 17.1 15.6 10.6 16.2 15.6 15.4 18.2 15.5 14.2 9.5 9.9
10.7 11.9 26.7 17.5
Require:
a. Compute the mean, variance and standard deviation.
b. Find the sample median, first quartile, and third quartile.
c. Construct a boxplot and interpret the distribution of the data.
d. Discuss the distribution of this set of data by examining kurtosis and skewness
statistics, such as if the distribution is skewed to one side of the distribution, and if the
distribution shows a peaked/skinny curve or a spread out/flat curve.
SPSS Procedures for Computing Summary Statistics
:
Enter the 30 data values in the first column of SPSS
Data View
Tab
Variable View
and name this variable
receipts
Adjust
Decimals
to 3 decimal points
Type
Admission Receipts
($ mn)
in the
Label
column for output viewer
Return to
Data View
and click
A
nalyze
on the menu bar
Click the second menu
D
e
scriptive Statistics
Click
F
requencies …
Move
Admission Receipts
to the
Variable(s)
list by clicking the arrow button
Click
S
tatistics …
button at the top of the dialog box
Now, you can select the descriptive statistics according to what the question requires. For this practice question, it requires central tendency, dispersion, percentile and distribution statistics, so we click all the boxes
except for
P
ercentile(s): and Va
l
ues are group midpoints
.
Click
Continue
to return to the
Frequencies
dialog box
Click
OK
to generate descriptive statistic output which is pasted below:
The first table provides summary statistics and the second table lists frequencies, relative frequencies and cumulative frequencies. The statistics required for solving this problem are highlighted in red.
Statistics
Admission Receipts
N
Valid
30
Missing
0
Mean
18.76333
Std. Error of Mean
1.278590
Median
17.30000
Mode
19.000
Std. Deviation
7.003127
Variance
49.043782
Skewness
1.734
Std. Error of Skewness
.427
Kurtosis
5.160
Std. Error of Kurtosis
.833
Range
35.000
Minimum
9.500
Maximum
44.500
Sum
562.900
Percentiles
10
10.61000
20
14.40000
25
15.35000
30
15.50000
40
15.84000
50
17.30000
60
19.00000
70
19.75000
75
22.82500
80
24.10000
90
26.69000
Admission Receipts
Frequency
Percent
Valid Percent
Cumulative Percent
Valid
9.500
1
3.3
3.3
3.3
9.900
1
3.3
3.3
6.7
10.600
1
3.3
3.3
10.0
10.700
1
3.3
3.3
13.3
11.900
1
3.3
3.3
16.7
14.200
1
3.3
3.3
20.0
15.2.
Answer the questions in one paragraph 4-5 sentences. · Why did t.docxboyfieldhouse
Answer the questions in one paragraph 4-5 sentences.
· Why did the class collectively sign a blank check? Was this a wise decision; why or why not? we took a decision all the class without hesitation
· What is something that I said individuals should always do; what is it; why wasn't it done this time? Which mitigation strategies were used; what other strategies could have been used/considered? individuals should always participate in one group and take one decision
SAMPLING MEAN:
DEFINITION:
The term sampling mean is a statistical term used to describe the properties of statistical distributions. In statistical terms, the sample meanfrom a group of observations is an estimate of the population mean. Given a sample of size n, consider n independent random variables X1, X2... Xn, each corresponding to one randomly selected observation. Each of these variables has the distribution of the population, with mean and standard deviation. The sample mean is defined to be
WHAT IT IS USED FOR:
It is also used to measure central tendency of the numbers in a database. It can also be said that it is nothing more than a balance point between the number and the low numbers.
HOW TO CALCULATE IT:
To calculate this, just add up all the numbers, then divide by how many numbers there are.
Example: what is the mean of 2, 7, and 9?
Add the numbers: 2 + 7 + 9 = 18
Divide by how many numbers (i.e., we added 3 numbers): 18 ÷ 3 = 6
So the Mean is 6
SAMPLE VARIANCE:
DEFINITION:
The sample variance, s2, is used to calculate how varied a sample is. A sample is a select number of items taken from a population. For example, if you are measuring American people’s weights, it wouldn’t be feasible (from either a time or a monetary standpoint) for you to measure the weights of every person in the population. The solution is to take a sample of the population, say 1000 people, and use that sample size to estimate the actual weights of the whole population.
WHAT IT IS USED FOR:
The sample variance helps you to figure out the spread out in the data you have collected or are going to analyze. In statistical terminology, it can be defined as the average of the squared differences from the mean.
HOW TO CALCULATE IT:
Given below are steps of how a sample variance is calculated:
· Determine the mean
· Then for each number: subtract the Mean and square the result
· Then work out the mean of those squared differences.
To work out the mean, add up all the values then divide by the number of data points.
First add up all the values from the previous step.
But how do we say "add them all up" in mathematics? We use the Roman letter Sigma: Σ
The handy Sigma Notation says to sum up as many terms as we want.
· Next we need to divide by the number of data points, which is simply done by multiplying by "1/N":
Statistically it can be stated by the following:
·
· This value is the variance
EXAMPLE:
Sam has 20 Rose Bushes.
The number of flowers on each b.
SAMPLING MEAN DEFINITION The term sampling mean is.docxagnesdcarey33086
SAMPLING MEAN:
DEFINITION:
The term sampling mean is a statistical term used to describe the properties of statistical
distributions. In statistical terms, the sample mean from a group of observations is an
estimate of the population mean . Given a sample of size n, consider n independent random
variables X1, X2... Xn, each corresponding to one randomly selected observation. Each of these
variables has the distribution of the population, with mean and standard deviation . The
sample mean is defined to be
WHAT IT IS USED FOR:
It is also used to measure central tendency of the numbers in a database. It can also be said that
it is nothing more than a balance point between the number and the low numbers.
HOW TO CALCULATE IT:
To calculate this, just add up all the numbers, then divide by how many numbers there are.
Example: what is the mean of 2, 7, and 9?
Add the numbers: 2 + 7 + 9 = 18
Divide by how many numbers (i.e., we added 3 numbers): 18 ÷ 3 = 6
So the Mean is 6
SAMPLE VARIANCE:
DEFINITION:
The sample variance, s2, is used to calculate how varied a sample is. A sample is a select number
of items taken from a population. For example, if you are measuring American people’s weights,
it wouldn’t be feasible (from either a time or a monetary standpoint) for you to measure the
weights of every person in the population. The solution is to take a sample of the population, say
1000 people, and use that sample size to estimate the actual weights of the whole population.
WHAT IT IS USED FOR:
The sample variance helps you to figure out the spread out in the data you have collected or are
going to analyze. In statistical terminology, it can be defined as the average of the squared
differences from the mean.
HOW TO CALCULATE IT:
Given below are steps of how a sample variance is calculated:
• Determine the mean
• Then for each number: subtract the Mean and square the result
• Then work out the mean of those squared differences.
To work out the mean, add up all the values then divide by the number of data points.
First add up all the values from the previous step.
But how do we say "add them all up" in mathematics? We use the Roman letter Sigma: Σ
The handy Sigma Notation says to sum up as many terms as we want.
• Next we need to divide by the number of data points, which is simply done by
multiplying by "1/N":
Statistically it can be stated by the following:
•
• This value is the variance
EXAMPLE:
Sam has 20 Rose Bushes.
The number of flowers on each bush is
9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4, 10, 9, 6, 9, 4
Work out the sample variance
Step 1. Work out the mean
In the formula above, µ (the Greek letter "mu") is the mean of all our values.
For this example, the data points are: 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4, 10, 9, 6, 9, 4
The mean is:
(9+2+5+4+12+7+8+11+9+3+7+4+12+5+4+10+9+6+9+4) / 20 = 140/20 = 7
So:
µ.
Model Attribute Check Company Auto PropertyCeline George
In Odoo, the multi-company feature allows you to manage multiple companies within a single Odoo database instance. Each company can have its own configurations while still sharing common resources such as products, customers, and suppliers.
Read| The latest issue of The Challenger is here! We are thrilled to announce that our school paper has qualified for the NATIONAL SCHOOLS PRESS CONFERENCE (NSPC) 2024. Thank you for your unwavering support and trust. Dive into the stories that made us stand out!
Palestine last event orientationfvgnh .pptxRaedMohamed3
An EFL lesson about the current events in Palestine. It is intended to be for intermediate students who wish to increase their listening skills through a short lesson in power point.
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxEduSkills OECD
Andreas Schleicher presents at the OECD webinar ‘Digital devices in schools: detrimental distraction or secret to success?’ on 27 May 2024. The presentation was based on findings from PISA 2022 results and the webinar helped launch the PISA in Focus ‘Managing screen time: How to protect and equip students against distraction’ https://www.oecd-ilibrary.org/education/managing-screen-time_7c225af4-en and the OECD Education Policy Perspective ‘Students, digital devices and success’ can be found here - https://oe.cd/il/5yV
How to Make a Field invisible in Odoo 17Celine George
It is possible to hide or invisible some fields in odoo. Commonly using “invisible” attribute in the field definition to invisible the fields. This slide will show how to make a field invisible in odoo 17.
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
3. Describing the Center of a Data Set with the arithmetic mean The population mean is denoted by µ , is the average of all x values in the entire population.
4.
5.
6.
7.
8. Describing the Center of a Data Set with the median The sample median is obtained by first ordering the n observations from smallest to largest (with any repeated values included, so that every sample observation appears in the ordered list). Then
9. Example of Median Calculation Consider the Fancytown data. First, we put the data in numerical increasing order to get 231,000 285,000 287,000 294,000 297,000 299,000 312,000 313,000 315,000 317,000 Since there are 10 (even) data values, the median is the mean of the two values in the middle.
10. Example of Median Calculation Consider the Lowtown data. We put the data in numerical increasing order to get 93,000 95,000 97,000 99,000 100,000 110,000 113,000 121,000 122,000 2,000,000 Since there are 10 (even) data values, the median is the mean of the two values in the middle.
19. Categorical Data - Sample Proportion If we look at the student data sample, consider the variable gender and treat being female as a success, we have 25 of the sample of 79 students are female, so the sample proportion (of females) is
29. Quartiles and IQR Example With 15 data values, the median is the 8 th value. Specifically, the median is 10. 2, 4, 7, 8, 9, 10, 10, 10, 11, 12, 12, 14, 15, 19, 25 Lower quartile = 8 Upper quartile = 14 Iqr = 14 - 8 = 6 Median Lower Half Upper Half Lower quartile Q 1 Upper quartile Q 3
30.
31.
32.
33.
34.
35.
36. Modified Boxplot Example Here is the modified boxplot for the student age data. Smallest data value that isn’t an outlier Largest data value that isn’t an outlier Mild Outliers Extreme Outliers 15 20 25 30 35 40 45 50
37. Modified Boxplot Example Here is the same boxplot reproduced with a vertical orientation. 50 45 40 35 30 25 20 15
38. Comparative Boxplot Example By putting boxplots of two separate groups or subgroups we can compare their distributional behaviors. Notice that the distributional pattern of female and male student weights have similar shapes, although the females are roughly 20 lbs lighter (as a group). 100 120 140 160 180 200 220 240 Females Males G e n d e r Student Weight
45. Z Scores The z score is how many standard deviations the observation is from the mean. A positive z score indicates the observation is above the mean and a negative z score indicates the observation is below the mean.
46. Z Scores Computing the z score is often referred to as standardization and the z score is called a standardized score .
47. Example A sample of GPAs of 38 statistics students appear below (sorted in increasing order) 2.00 2.25 2.36 2.37 2.50 2.50 2.60 2.67 2.70 2.70 2.75 2.78 2.80 2.80 2.82 2.90 2.90 3.00 3.02 3.07 3.15 3.20 3.20 3.20 3.23 3.29 3.30 3.30 3.42 3.46 3.48 3.50 3.50 3.58 3.75 3.80 3.83 3.97
48. Example The following stem and leaf indicates that the GPA data is reasonably symmetric and unimodal. 2 0 2 233 2 55 2 667777 2 88899 3 0001 3 2222233 3 444555 3 7 3 889 Stem: Units digit Leaf: Tenths digit
50. Example Notice that the empirical rule gives reasonably good estimates for this example. 38/38 = 100% 99.7% within 3 standard deviations of the mean 37/38 = 97% 95% within 2 standard deviations of the mean 27/38 = 71% 68% within 1 standard deviation of the mean Actual Empirical Rule Interval
51. Comparison of Chebyshev’s Rule and the Empirical Rule The following refers to the weights in the sample of 79 students. Notice that the stem and leaf diagram suggest the data distribution is unimodal but is positively skewed because of the outliers on the high side. Nevertheless, the results for the Empirical Rule are good. 10 3 11 37 12 011444555 13 000000455589 14 000000000555 15 000000555567 16 000005558 17 0000005555 18 0358 19 5 20 00 21 0 22 55 23 79 Stem: Hundreds & tens digits Leaf: Units digit
52. Comparison of Chebyshev’ Rule and the Empirical Rule Notice that even with moderate positive skewing of the data, the Empirical Rule gave a much more usable and meaningful result. 99.7% 95% 68% Empirical Rule 79/79 = 100% 88.8% within 3 standard deviations of the mean 75/79 = 94.9% 75% within 2 standard deviations of the mean 56/79 = 70.9% 0% within 1 standard deviation of the mean Actual Chebyshev’s Rule Interval