3. Population: A population consists of all
elements – individuals, items, or objects – whose
characteristics are being studied. The population
that is being studied is also called the target
population.
Or
The entire category under consideration. Or the
complete set of elements being studied. The
population size is usually indicated by a capital N.
Examples: every lawyer in the United States;
all single women in the United States.
3
Key Terms
4. Key Terms
Sample. A portion of the population selected for study is referred
to as a sample.
or
That portion of the population that is available, or to be made
available, for analysis. A good sample is representative of the
population. We will learn about probability samples and how they
provide assurance that a sample is indeed representative. The
sample size is shown as lower case n.
If your company manufactures one million laptops, they might take a
sample of say, 500, of them to test quality. The population size is N =
1,000,000 and the sample size is n= 500.
Census: A survey that includes every member of the population is called
a census. The technique of collecting information from a portion of the
population is called a sample survey.
7. TYPES OF STATISTICS
Descriptive Statistics consists of methods for organizing,
displaying, and describing data by using tables, graphs, and
summary measures. Those statistics that summarize a
sample of numerical data in terms of averages and other
measures for the purpose of description.
Descriptive statistics, as opposed to inferential statistics,
are not concerned with the theory and methodology for
drawing inferences that extend beyond the particular set
of data examined.
Thus, a teacher who gives a class, of say, 35 students,
an exam is interested in the descriptive statistics to
assess the performance of the class. What was the class
average, the median grade, the standard deviation,
etc.? The teacher is not interested in making any
inferences to some larger population.
9. Example of inferential statistics from quality control:
GE manufactures LED bulbs and wants to know how
many are defective. Suppose one million bulbs a year
are produced in its new plant in Staten Island. The
company might sample, say, 500 bulbs to estimate the
proportion of defectives.
N = 1,000,000 and n = 500
If 5 out of 500 bulbs tested are defective, the sample
proportion of defectives will be 1% (5/500). This statistic
may be used to estimate the true proportion of defective
bulbs (the population proportion).
In this case, the sample proportion is used to make
inferences about the population proportion.
9
TYPES OF STATISTICS
10. POPULATION VERSUS SAMPLE
A sample that represents the characteristics of
the population as closely as possible is called a
representative sample.
A sample drawn in such a way that each
element of the population has a chance of being
selected is called a random sample. If all
samples of the same size selected from a
population have the same chance of being
selected, we call it simple random sampling.
Such a sample is called a simple random
sample.
Sample with replacement
Sample without replacement
11. BASIC TERMS
An element or member of a sample or
population is a specific subject or object (for
example, a person, firm, item, state, or country)
about which the information is collected.
A variable is a characteristic under study that
assumes different values for different elements. In
contrast to a variable, the value of a constant is
fixed.
The value of a variable for an element is called an
observation or measurement.
A data set is a collection of observations on one
or more variables.
16. Quantitative Variables
Discrete variables A variable whose values
are countable is called a discrete variable.
In other words, a discrete variable can
assume only certain values with no
intermediate values.
Example: How many courses have you
taken at this College? ____
17. Quantitative Variables
Continuous variables A variable that can assume
any numerical value over a certain interval or intervals
is called a continuous variable.
Arise from a measuring process.
Example: How much do you weigh? ____
One way to determine whether data is continuous, is
to ask yourself whether you can add several decimal
places to the answer.
For example, you may weigh 150 pounds but in
actuality may weigh 150.23568924567 pounds.
On the other hand, if you have 2 children, you do
not have 2.3217638 children.
24. Primary data. This is data that has been
compiled by the researcher using such techniques
as surveys, experiments, depth interviews,
observation, focus groups.
Types of surveys. A lot of data is obtained
using surveys. Each survey type has advantages
and disadvantages.
Mail: lowest rate of response; usually the lowest cost
Personally administered: can “probe”; most costly;
interviewer effects (the interviewer might influence the
response)
Telephone: fastest
Web: fast and inexpensive
Introduction 24
Primary vs. Secondary Data
25. Secondary data. This is data that has been
compiled or published elsewhere, e.g.,
census data.
The trick is to find data that is useful. The data was
probably collected for some purpose other than
helping to solve the researcher’s problem at hand.
Advantages: It can be gathered quickly and
inexpensively. It enables researchers to build on
past research.
Problems: Data may be outdated. Variation in
definition of terms. Different units of measurement.
May not be accurate (e.g., census undercount).
Introduction 25
Primary vs. Secondary Data