SlideShare a Scribd company logo
7 HYPOTHETICALS AND YOU TESTING YOUR
QUESTIONS
7: MEDIA LIBRARY
Premium Videos
Core Concepts in Stats Video
· Probability and Hypothesis Testing
Lightboard Lecture Video
· Hypothesis Testing
Difficulty Scale
(don’t plan on going out tonight)
WHAT YOU WILL LEARN IN THIS CHAPTER
· Understanding the difference between a sample and a
population
· Understanding the importance of the null and research
hypotheses
· Using criteria to judge a good hypothesis
SO YOU WANT TO BE A SCIENTIST
You might have heard the term hypothesis used in other classes.
You may even have had to formulate one for a research project
you did for another class, or you may have read one or two in a
journal article. If so, then you probably have a good idea what a
hypothesis is. For those of you who are unfamiliar with this
often-used term, a hypothesis is basically “an educated guess.”
Its most important role is to reflect the general problem
statement or question that was the motivation for asking the
research question in the first place.
That’s why taking the care and time to formulate a really
precise and clear research question is so important. This
research question will guide your creation of a hypothesis, and
in turn, the hypothesis will determine the techniques you will
use to test it and answer the question that was originally asked.
So, a good hypothesis translates a problem statement or a
research question into a format that makes it easier to examine.
This format is called a hypothesis. We will talk about what
makes a hypothesis a good one later in this chapter. Before that,
let’s turn our attention to the difference between a sample and a
population. This is an important distinction, because while
hypotheses usually describe a population,
hypothesis testing deals with a sample and then the results are
generalized to the larger population. We also address the two
main types of hypotheses (the null hypothesis and the research
hypothesis). But first, let’s formally define some simple terms
that we have used earlier in Statistics for People Who (Think
They) Hate Statistics.
SAMPLES AND POPULATIONS
As a good scientist, you would like to be able to say that if
Method A is better than Method B in your study, this is true
forever and always and for all people in the universe, right?
Indeed. And, if you do enough research on the relative merits of
Methods A and B and test enough people, you may someday be
able to say that.
But don’t get too excited, because it’s unlikely you will ever be
able to speak with such confidence. It takes too much money
($$$) and too much time (all those people!) to do all that
research, and besides, it’s not even necessary. Instead, you can
just select a representative sample from the population and test
your hypothesis about the relative merits of Methods A and B
on that sample.
Given the constraints of never enough time and never enough
research funds, with which almost all scientists live, the next
best strategy is to take a portion of a larger group of
participants and do the research with that smaller group. In this
context, the larger group is referred to as a population, and the
smaller group selected from that population is referred to as
a sample. Statistics as a field, in fact, is all about looking at a
sample and inferring to the population it represents. Indeed, the
word statistic technically means a number that describes a
sample (and the word we use for a number that describes a
population is parameter).
A measure of how well a sample approximates the
characteristics of a population is called sampling error.
Sampling error is basically the difference between the values of
the sample statistic and the population parameter. The higher
the sampling error, the less precision you have in sampling, and
the more difficult it will be to make the case that what you find
in the sample indeed reflects what you expected to find in the
population. And just as there are measures of variability
regarding distributions, so are there measures of the variability
of this difference between a sample measure and a population
measure. This is often called the standard error—it’s basically
the standard deviation of the difference between these two
values.
Samples should be selected from populations in such a way that
the sample matches as closely as possible the characteristics of
the population. You know, to minimize the sampling error. The
goal is to have the sample be as much like the population as
possible. The most important implication of ensuring similarity
between the two is that the research results based on the sample
can be generalized to the population. When the sample
accurately represents the population, the results of the study are
said to have a high degree of generalizability.
A high degree of generalizability is an important quality of
good research because it means that the time and effort (and
$$$) that went into the research may have implications for
groups of people other than the original participants.
It’s easy to equate “big” with “representative.” Keep in mind
that it is far more important to have an accurately representative
sample than it is to have a big sample (people often think that
big is better—only true on Thanksgiving, by the way). Having
lots and lots of participants in a sample may be very impressive,
but if the participants do not represent the larger population,
then the research will have little value.
THE NULL HYPOTHESIS
Okay. So we have a sample of participants selected from a
population, and to begin the test of our research hypothesis, we
first formulate the null hypothesis.
The null hypothesis is an interesting little creature. If it could
talk, it would say something like “I represent no relationship
between the variables that you are studying.” In other words,
null hypotheses are statements of equality demonstrated by the
following real-life null hypotheses taken from a variety of
popular social and behavioral science journals. Names have
been changed to protect the innocent.
· There will be no difference between the average score of 9th
graders and the average score of 12th graders on a memory test.
· There is no difference between the effectiveness of
community-based, long-term care and the effectiveness of in-
home, long-term care in promoting the social activity of older
adults when measured using the Margolis Scale of Social
Activities.
· There is no relationship between reaction time and problem-
solving ability.
· There is no difference between high- and low-income families
in the amount of assistance families offer their children in
school-related activities.
What these four null hypotheses have in common is that they all
contain a statement that two or more things are equal or
unrelated (that’s the “no difference” and “no relationship” part)
to each other.
The Purposes of the Null Hypothesis
What are the basic purposes of the null hypothesis? The null
hypothesis acts as both a starting point and a benchmark against
which the actual outcomes of a study can be measured.
Let’s examine each of these purposes in more detail.
First, the null hypothesis acts as a starting point because it is
the state of affairs that is accepted as true in the absence of any
other information. For example, let’s look at the first null
hypothesis we stated earlier:
There will be no difference between the average score of 9th
graders and the average score of 12th graders on a memory test.
Given absolutely no other knowledge of 9th and 12th graders’
memory skills, you have no reason to believe that there will be
differences between the two groups, right? If you know nothing
about the relationship between these variables, the best you can
do is guess. And that’s taking a chance. You might speculate as
to why one group might outperform another, using theory or
common sense, but if you have no evidence a priori (“from
before”), then what choice do you have but to assume that they
are equal?
This lack of a relationship as a starting point is a hallmark of
this whole topic. Until you prove that there is a difference, you
have to assume that there is no difference. And a statement of
no difference or no relationship is exactly what the null
hypothesis is all about. Such a statement ensures that (as
members of the scientific community) we are starting on a level
playing field with no bias toward one or the other direction as
to how the test of our hypothesis will turn out.
Furthermore, if there are any differences between these two
groups, then you have to assume that these differences are due
to the most attractive explanation for differences between any
groups on any variable—chance! That’s right: Given no other
information, chance is always the most likely and attractive
explanation for the observed differences between two groups or
the relationship between variables. Chance explains what we
cannot. You might have thought of chance as the odds of
winning that $5,000 jackpot at the penny slots, but we’re
talking about chance as all that other “stuff” that clouds the
picture and makes it even more difficult to understand the
“true” nature of relationships between variables.
For example, you could take a group of soccer players and a
group of football players and compare their running speeds,
thinking about whether playing soccer or playing football makes
athletes faster. But look at all the factors we don’t know about
that could contribute to differences. Who is to know whether
some soccer players practice more or whether some football
players are stronger or whether both groups are receiving
additional training of different types?
What’s more, perhaps the way their speed is being measured
leaves room for chance; a faulty stopwatch or a windy day can
contribute to differences unrelated to true running speed. As
good researchers, our job is to eliminate chance factors from
explaining observed differences and to evaluate other factors
that might contribute to group differences, such as intentional
training or nutrition programs, and see how they affect speed.
The point is, if we find differences between groups and the
differences are not due to training, then we have no choice but
to attribute the difference to chance. And, by the way, you
might find it useful to think of chance as being somewhat
equivalent to the idea of error. When we can control sources of
error, the likelihood that we can offer a meaningful explanation
for some outcome increases.
The second purpose of the null hypothesis is to provide a
benchmark against which observed outcomes can be compared
to see if these differences are due to some other factor. The null
hypothesis helps to define a range within which any observed
differences between groups can be attributed to chance (which
is the null hypothesis’s contention) or are due to something
other than chance (which perhaps would be the result of the
manipulation of some variable, such as training in our example
of the soccer and football players).
Most research studies have an implied null hypothesis, and you
may not find it clearly stated in a research report or journal
article. Instead, you’ll find the research hypothesis clearly
stated, and this is where we now turn our attention.CORE
CONCEPTS IN STATS VIDEOProbability and Hypothesis
Testing
Hypothesis testing involves analyzing data to test our research
hypothesis. We've collected data and are ready to analyze it.
We'll need to complete three steps. First, examine the data by
creating tables and figures in order to understand it. Second,
calculate descriptive statistics-- measures of central tendency
and variability-- to describe the data. Finally, calculate
inferential statistics to test our research hypothesis. Think of
this process like a criminal trial. State an initial assumption
and a mutually exclusive alternative to this assumption.
Analyze the collected evidence, the data. In this scenario, the
jury has two decisions to make when deciding whether or not
to reject the initial assumption-- guilty or not guilty. These
three steps-- making an initial assumption-- known as the null
hypothesis-- and an alternative to that assumption; analyzing
the collected data; and making a decision whether to reject or
accept the initial assumption-- or null hypothesis-- are all
critical parts of the process of hypothesis testing. The
decision you make is based on the probability of obtaining a
value of a statistic. If your statistic has a low probability of
occurring, the likelihood of this happening is so low because
of chance you reject the initial assumption, or null hypothesis.
This is similar to what happens in jury trials, where if the jury
decides that the likelihood an innocent person did these things
is so low, they'll reject the presumption of innocence and
instead decide on a guilty verdict. What do we mean by "low
probability"? If the probability of a statistic is less than to
reject the null hypothesis. Think about flipping coins. The
first step is to state the null and the alternative hypotheses.
In flipping coins, we start with the null hypothesis assumption
that coins are fair. There is an equal probability of getting
heads or tails. The mutually exclusive alternative to this
assumption is that coins are not fair and there is not an equal
probability of getting a heads or tails. The second step would
be to calculate a statistic. In flipping coins, you could flip a
sample of coins and then count the number of heads. The
next step is to make a decision about the null hypothesis.
You can either reject or accept the null hypothesis-- that coins
are fair. If the number of heads in your sample of coin flips
has a low probability of occurring-- less than that coins are
fair in favor of the alternative hypothesis, that coins are not
fair. We use these examples to illustrate the process of
hypothesis testing. But the reality is that research isn't about
flipping coins and counting the number of heads. And that's
what research is about-- collecting data from samples.
THE RESEARCH HYPOTHESIS
Whereas a null hypothesis is usually a statement of no
relationship between variables or that a certain value is zero,
a research hypothesis is usually a definite statement that a
relationship exists between variables. For example, for each of
the null hypotheses stated earlier, here is a corresponding
research hypothesis. Notice that we said “a” and not “the”
corresponding research hypothesis because there certainly could
be more than one research hypothesis for any one null
hypothesis.
· The average score of 9th graders is different from the average
score of 12th graders on a memory test.
· The effectiveness of community-based, long-term care is
different from the effectiveness of in-home, long-term care in
promoting the social activity of older adults when measured
using the Margolis Scale of Social Activities.
· Slower reaction time and problem-solving ability are
positively related.
· There is a difference between high- and low-income families
in the amount of assistance families offer to their children in
school-related activities.
Each of these four research hypotheses has one thing in
common: They are all statements of inequality. They posit a
relationship between variables and not an equality, as the null
hypothesis does.
The nature of this inequality can take two different forms—a
directional or a nondirectional research hypothesis. If the
research hypothesis posits no direction to the inequality (such
as only saying “different from”), the hypothesis is a
nondirectional research hypothesis. If the research hypothesis
posits a direction to the inequality (such as “more than” or “less
than”), the research hypothesis is a directional research
hypothesis.
The Nondirectional Research Hypothesis
A nondirectional research hypothesis reflects a difference
between groups, but the direction of the difference is not
specified.
For example, the following research hypothesis is
nondirectional in that the direction of the difference between
the two groups is not specified:
The average score of 9th graders is different from the average
score of 12th graders on a memory test.
The hypothesis is a research hypothesis because it states that
there is a difference, and it is nondirectional because it says
nothing about the direction of that difference.
A nondirectional research hypothesis, like this one, would be
represented by the following equation:
(7.1)
H1:¯¯¯X9≠¯¯¯X12,H1:X¯9≠X¯12,
where
· H1 represents the symbol for the first (of possibly several)
research hypotheses,
· ¯¯¯X9X¯9 represents the average memory score for the
sample of 9th graders,
· ¯¯¯X12X¯12 represents the average memory score for the
sample of 12th graders, and
· ≠ means “is not equal to.”
The Directional Research Hypothesis
A directional research hypothesis reflects a difference between
groups, and the direction of the difference is specified.
For example, the following research hypothesis is directional
because the direction of the difference between the two groups
is specified:
The average score of 12th graders is greater than the average
score of 9th graders on a memory test.
One is hypothesized to be greater than (not just different from)
the other.
Examples of two other directional hypotheses are these:
· A is greater than B (or A > B).
· B is greater than A (or A < B).
These both represent inequalities (greater than or less than). A
directional research hypothesis such as the one described
earlier, where 12th graders are hypothesized to score better than
9th graders, would be represented by the following equation:
(7.2)
H1:¯¯¯X12>¯¯¯X9,H1:X¯12>X¯9,
where
· H1 represents the symbol for the first (of possibly several)
research hypotheses,
· ¯¯¯X9X¯9 represents the average memory score for the
sample of 9th graders,
· ¯¯¯X12X¯12 represents the average memory score for the
sample of 12th graders, and
· > means “is greater than.”
What is the purpose of the research hypothesis? It is this
hypothesis that is directly tested as an important step in the
research process. The results of this test are compared with
what you expect if you were wrong (reflecting the null
hypothesis) to see which of the two is the more attractive
explanation for any differences between groups or variables you
might observe.
Table 7.1 gives the four null hypotheses and accompanying
directional and nondirectional research hypotheses.
Another way to talk about directional and nondirectional
hypotheses is to talk about one- and two-tailed tests. A one-
tailed test (reflecting a directional hypothesis) posits a
difference in a particular direction, such as when we
hypothesize that Group 1 will score higher than Group 2. A two-
tailed test (reflecting a nondirectional hypothesis) posits a
difference but in no particular direction. We talk about “tails”
because we often understand statistical results by applying them
to a normal curve that has two “tails.” The importance of this
distinction begins when you test different types of hypotheses
(one- and two-tailed) and establish probability levels for
rejecting or not rejecting the null hypothesis. More about this
in Chapters 8 and 9. Promise.
Table 7.1 ⬢ Null Hypotheses and Corresponding Research
Hypotheses
Null Hypothesis
Nondirectional Research Hypothesis
Directional Research Hypothesis
There will be no difference in the average score of 9th graders
and the average score of 12th graders on a memory test.
Twelfth graders and 9th graders will differ on a memory test.
Twelfth graders will have a higher average score on a memory
test than will 9th graders.
There is no difference between the effectiveness of community-
based, long-term care for older adults and the effectiveness of
in-home, long-term care for older adults when measured using
the Margolis Scale of Social Activities.
The effect of community-based, long-term care for older adults
is different from the effect of in-home, long-term care for older
adults when measured using the Margolis Scale of Social
Activities.
Older adults exposed to community-based, long-term care
score higher on the Margolis Scale of Social Activities than do
older adults receiving in-home, long-term care.
There is no relationship between reaction time and problem-
solving ability.
There is a relationship between reaction time and problem-
solving ability.
There is a positive relationship between reaction time and
problem-solving ability.
There is no difference between high- and low-income families
in the amount of assistance families offer their children in
educational activities.
The amount of assistance offered by high-income families to
their children in educational activities is different from the
amount of support offered by low-income families to their
children in educational activities.
The amount of assistance offered by high-income families to
their children in educational activities is more than the amount
of support offered by low-income families to their children in
educational activities.Some Differences Between the Null
Hypothesis and the Research Hypothesis
Besides the null hypothesis usually representing an equality and
the research hypothesis usually representing an inequality, the
two types of hypotheses differ in several other important ways.
First, for a bit of review, the two types of hypotheses differ in
that one (the null hypothesis) usually states that there is no
relationship between variables (an equality), whereas the
research hypothesis usually states that there is a relationship
between the variables (an inequality). This is the primary
difference.
Second, null hypotheses always refer to the population, whereas
research hypotheses usually refer to the sample. We select a
sample of participants from a much larger population. We then
try to generalize the results from the sample back to the
population. If you remember your basic philosophy and logic
(you did take these courses, right?), you’ll remember that going
from small (as in a sample) to large (as in a population) is a
process of induction.
Third, because the entire population cannot be directly tested
(again, it is impractical, uneconomical, and often impossible),
you can’t say with 100% certainty that there is no real
difference between segments of the population on some
variable. Rather, you have to infer it (indirectly) from the
results of the test of the research hypothesis, which is based on
the sample. Hence, the null hypothesis must be indirectly tested,
and the research hypothesis can be directly tested.
Fourth, in statistics, the null hypothesis is written with an equal
sign, while the research hypothesis is written with a not equal
to, greater than, or less than sign.
Fifth, null hypotheses are always written using Greek symbols,
and research hypotheses are always written using Roman
symbols. Thus, the null hypothesis that the average score for
9th graders is equal to that of 12th graders is represented like
this:
(7.3)
H0:μ9=μ12,H0:μ9=μ12,
where
· H0 represents the null hypothesis,
· µ9 represents the theoretical average for the population of 9th
graders, and
· µ12 represents the theoretical average for the population of
12th graders.
The research hypothesis that the average score for a sample of
12th graders is greater than the average score for a sample of
9th graders is shown in Formula 7.2 (presented earlier).
Finally, because you cannot directly test the null hypothesis, it
is an implied hypothesis. But the research hypothesis is explicit
and is stated as such. This is another reason why you rarely see
null hypotheses stated in research reports and will very often
see a statement (be it in symbols or words) of the research
hypothesis.
LIGHTBOARD LECTURE VIDEO
Hypothesis Testing
Hypothesis testing is a weird sort of statistics thing. We
pretend that we think something is true, when in fact we think
the opposite is true. Let me show you what I mean. Here is a
correlation coefficient. And your research hypothesis might be
hey, A and B are correlated. In reality, the correlation is
either greater than isn't. It's either greater than Your null
hypothesis is going to be that it's equal to But what you really
think is that it's greater than So you have what's really out
there, and you can't affect that at all. Then you have your
guesses. And your guesses are either going to be that it's
greater than exactly equal to All you have to prove is that it's
greater than You don't have to prove it's exactly .26 or any of
that stuff. Now the guess that says it's greater than your
research hypothesis. That's what you really think. The guess
that says it's equal to that's your null hypothesis. So you've
got these two situations-- whatever is true in the real world
and whatever your guess was. So there's really only four
possibilities, right? You could be right. You could be wrong.
There's two ways to be right and two ways to be wrong. Let's
think. If your guess is that it's the correlation is greater than
and it really is greater than If your guess is that is greater than
and it's not greater than then that's really sad and you're
wrong. If your guess is the null hypothesis, which is how we
typically do things, you're guessing that the correlation is
exactly equal to And if it really is, that, strangely enough,
also makes you really happy. But if you're guessing that the
correlation is exactly equal to that makes you sad. So a
researcher usually has this research hypothesis-- in this case,
that the correlation is greater than And if they're right, that's
the best possible outcome.
WHAT MAKES A GOOD HYPOTHESIS?
You now know that hypotheses are educated guesses—a starting
point for a lot more to come. As with any guess, some
hypotheses are better than others right from the start. We can’t
stress enough how important it is to ask the question you want
answered and to keep in mind that any hypothesis you present is
a direct extension of the original question you asked. This
question will reflect your personal interests and motivation and
your understanding of what research has been done previously.
With that in mind, here are criteria you might use to decide
whether a hypothesis you read in a research report or one that
you formulate is acceptable.
To illustrate, let’s use an example of a study that examines the
effects of providing afterschool child care to employees who
work late on the parents’ adjustment to work. Here is a well-
written directional research hypothesis:
Parents who enroll their children in afterschool programs will
have a more positive attitude toward work, as measured by the
Attitude Toward Work survey, than will parents who do not
enroll their children in such programs.
Here are the criteria.
First, a good hypothesis is stated in declarative form and not as
a question. (It ends with a period or, if you’re really excited, an
exclamation mark!) While the preceding hypothesis may have
started in the researcher’s mind as the question “What are the
benefits of afterschool programs at work … ?” it was not posed
because hypotheses are most effective when they make a clear
and forceful statement.
Second, a good hypothesis posits an expected relationship
between variables. The hypothesis in our example clearly
describes the expected relationships among afterschool child
care and parents’ attitude. These variables are being tested to
see if one (enrollment in the afterschool program) has an effect
on the others (attitude).
Notice the word expected in the second criterion. Defining an
expected relationship is intended to prevent a fishing trip to
look for any relationships that may be found (sometimes called
the “shotgun” approach), which may be tempting but is not very
productive. You do get somewhere using the shotgun approach,
but because you don’t know where you started, you have no idea
where you end up.
In the fishing-trip approach, you throw out your line and take
anything that bites. You collect data on as many things as you
can, regardless of your interest or even whether collecting the
data is a reasonable part of a scientific investigation. Or, to use
a shotgun analogy, you load up them …
6
As you think about surveying clients, donors, or the general
public you may start by figuring out whom to contact.
Contacting everyone is seldom practical or necessary. You may
use too many resources and take up too much time. Instead you
will want to contact a sample of clients, donors, or residents.
This chapter covers the principles guiding the selection of a
sample. Knowing how to design a sample is only the beginning.
You need to identify individual sample members, contact them,
and encourage them to respond. Technology is rapidly changing
strategies for contacting sample members and encouraging them
to respond. Survey organizations are studying the impact of
changing lifestyles and new communications technologies. In
this chapter we focus on the basic sampling and data collection
strategies. At the end of the chapter we refer you to how-to
books and other sources to fill in the gaps. Our approach keeps
us from overwhelming you with information that may have a
short shelf life.
Sampling is an economical and effective way to learn about
individuals and organizations. This is true if you seek
information from individuals, case records, agency
representatives, or computerized datasets. Even with a relatively
small group, sampling enables you to gather information
relatively quickly. Depending on how you select the sample you
may be able to use the findings to generalize beyond the
sampled individuals, organizations, or other units. Because
sampling is a jargon-rich topic, we begin by defining some key
terms.
First, you want to recognize the difference between a population
and a sample. The population is the total set of units that you
are interested in. A population is often composed of individuals,
but it may consist of other units such as organizations,
households, records, or computers. A sample is a subset of the
population. Closely connected to the concepts of population and
sample are the terms parameter and statistics. A parameter is a
characteristic of the entire population. A statistic is a
characteristic of a sample. The percentage of citizens in a city
favoring a fund to finance affordable housing is a parameter.
The percentage of sampled residents who favor the fund is a
statistic. Depending on how you select the sample you may be
able to use the statistic and the sampling error to estimate the
parameter. The sampling error, which is discussed in more
detail in , is the difference between the parameter and the
statistic and is used to estimate the parameter.
One of your first steps is to precisely define the population
represented by the sample, such as “all adults over the age of 21
living in Clark County on July 1.” After you define your
population your next step is to find a sampling frame.
A sampling frame is a list of names of individuals or
organizations in your population. A sampling frame should
contain all members of your population. Complete lists seldom
exist. So, in reality the sampling frame is the list of potential
sample members. Sampling frames are not perfect. They may
aggregate units, list units more than once, omit units, or include
units that are not part of the population. A list of building
addresses, for example, may include apartment buildings, but
not individual units. If you plan to sample individual
residences, this sampling frame would be inadequate.
The unit of analysis indicates what the data represent. You may
think of the unit of analysis as what constitutes a case when you
enter data on a spreadsheet. If you collect data from housing
agencies, the sampling unit is housing agency. If you report the
number of employees and the size of the budget the unit of
analysis is housing agency. If you collect data on individual
clients from the sampled agencies, then “client” would be the
unit of analysis. illustrates the components of a sample.
FIGURE 6.1
From Population to Sampled Unit
A sample design describes the strategy for selecting the sample
members from the population. Designs are classified as
probability and nonprobability. With a probability sample, each
unit of the population has a known, nonzero chance of being in
the sample. Probability samples use randomization to avoid
selection biases and use statistical theory to estimate
parameters. You cannot estimate parameters with nonprobability
samples. Both types of designs have value but for different
purposes and in different situations.
AN APPLICATION OF SAMPLING TERMINOLOGY
• Population. All participants who have completed the Clark
County Job Training Program within the past 2 years
• Parameter. Average starting salaries of all graduates of the
Clark County Job Training Program within the past 2 years
• Sampling frame. All (8) graduation lists from the past 2 years
• Sampling design. Probability sample
• Sample. 120 graduates randomly selected from sampling
frame
• Unit of analysis. Individual graduates
• Statistic. The average starting salary of the 120 graduates was
$25,000 ■
With a probability sample, you can calculate the chance that a
member of the population has of being sampled. We discuss
four common probability sampling designs: simple random
sampling, systematic sampling, stratified sampling, and cluster
sampling. These designs are often used together. Stratified and
cluster samples also use simple random sampling or systematic
sampling to select subjects. Multistage sampling combines
cluster sampling with other designs.
Simple random sampling requires that each unit in the
population has an equal probability of being in the sample.
Drawing names from a hat is the prototypical example of a
simple random sample. An analogous strategy is used to draw
lottery numbers. You do not need to put names in a hat or
marked balls in a tumbler. Rather you can use technology to
help you draw a random sample. If the cases are contained in an
electronic database you may use a computer program to select
your sample. These programs use an algorithm to create a
random sample. If the cases are not stored in an electronic
database you can number the cases. In this method, using your
calculator or a list of random numbers, you select a set of
random numbers. Finally, you match each selected random
number to the case with the same number.
If the cases are not stored electronically systematic sampling is
an acceptable alternative. It is easier and normally produces
results comparable to those of a simple random sample. To
construct a systematic sample, you divide the number of units in
the sampling frame (N) by the number desired for the sample
(n). The resulting number is called the skip interval (k). If a
sampling frame consists of 50,000 units and a sample of 1,000
is desired, the skip interval equals 50 (50,000 divided by
1,000). You select a random number, go to the sampling frame,
and use the random number to select the first case. You then
pick every kth unit for the sample. In our example every 50th
case would be chosen. If the random number 45 was selected the
cases 45, 95, 145, and so on would be in the sample. With
systematic sampling, treat the list as circular, so the last listed
unit is followed by the first. You should go through the entire
sampling frame at least once.
Systematic sampling has one potentially serious problem, that
of periodicity. If the items are listed in a pattern and the skip
interval coincides with the pattern, the sample will be biased.
Consider what could happen in sampling the daily activity logs
of a 911 call center. If the skip interval was 7 the activity logs
in the sample would all be for the same day, that is, all
Mondays or all Tuesdays, and so forth. A skip interval of any
multiple of 7 would have the same result. Periodicity is
relatively rare. If you notice something strange about your
sample, for example, that it is all female, includes only top
administrators, or has only corner houses, check for periodicity.
An easy fix that often works is to double the size of your skip
interval (k × 2). Go through the sampling frame once to get half
your sample, then choose another random starting point and go
through it a second time.
Systematic sampling may be the only feasible way to get a
probability sample of a population of unknown size, such as
people attending a community festival. You estimate the
population size, that is, the number of attendees, determine a
skip interval, and pick a random beginning point. If you plan to
sample every 20th person and start with the 6th person, the 6th
person to arrive (or depart) would be selected, as would the
26th person, the 46th person, and so on, until the end of the
sampling period.
Stratified sampling ensures that a sample adequately represents
selected groups in the population. You should consider using
stratified sampling if you plan to compare groups, if you need
to focus on a group that is a small proportion of your
population, or if your sampling frame is already divided by
groups. First, you divide or classify the population into strata,
or groups, on some characteristic such as gender, age, or
institutional affiliation. Every member of the population should
be in one and only one stratum. Use either simple random
sampling or systematic sampling to draw samples from each
stratum.
Stratified samples may be proportional or disproportional.
In proportional stratified samples the same sampling fraction is
applied to each stratum. The sample size for each stratum is
proportional to its size in the population. If, for example, you
wanted to compare three groups of employees—professional
staff, technical staff, and clerical staff—you would designate
each group as a stratum. You would select your samples by
taking the same percentage of members from each stratum, say
10 percent of the professional staff, 10 percent of the technical
staff, and 10 percent of the clerical staff. The resulting sample
would consist of three strata, each equal in size to the stratum’s
proportion of the total population. illustrates a proportional
stratified sample.
The percentage of members selected for the sample need not be
the same for each stratum. In disproportional stratified
sampling, the same sampling fraction is not applied to all strata.
Disproportional sampling is useful when a characteristic of
interest occurs infrequently in the population, making it
unlikely that a simple random or a proportional sample will
contain enough members with the characteristic to allow full
analysis. For example, a mentoring program may want to
compare recruitment and retention of Spanish-speaking
volunteers with that of non–Spanish speakers. You will want a
sample large enough so that you can analyze and compare the
two groups. If 5 percent of its volunteers are Spanish speakers,
a random sample of 200 volunteers should have about 10
Spanish speakers—too few cases for analysis. Therefore, you
might survey a larger percentage of Spanish-speaking
volunteers and a smaller percentage of other volunteers; you
could decide that half the sample should be Spanish speakers
and an equal number of non-Spanish speakers, thus over-
representing Spanish speakers. A higher percentage of Spanish
speakers will have been sampled and non-Spanish speakers
under-represented.
EXAMPLE 6.1 AN APPLICATION OF PROPORTIONAL
STRATIFIED SAMPLING
• Problem. The director of volunteer recruitment for the state
office of a large nonprofit organization wanted information
about volunteer applicants to the organization over the previous
4 years.
• Population. All applications to the volunteer service of the
nonprofit organization for each of the past selects 4 years.
• Sampling frame. Agency’s electronic file of
applications(organized by application date).
• Sampling design. Proportional stratified sample with year of
application as strata. A computer program select a random
sample of applications for each year, using the same sampling
fraction for each group (). ■
TABLE 6.1
Proportional Stratified Sampling
Strata
Number of Applications
Sampling Fraction (%)
Number in Sample
Year 1
350
15
53
Year 2
275
15
41
Year 3
250
15
38
Year 4
230
15
35
Total
1,150
167
Each strata in a disproportional stratified sample constitutes a
separate sample. To conduct your analysis you must keep the
samples separate as you compare the Spanish-speaking
volunteers with non–Spanish speakers. To determine
characteristics of all sampled volunteers you must weigh the
samples. First, determine each stratum’s sample size for a
proportional sample. To calculate the weight for each stratum,
divide the size of the proportional sample by the size of the
disproportional sample.
When you combine the samples, each Spanish speaker’s
responses would be multiplied or weighted by 0.1 and each
response by non–Spanish speakers would be multiplied by 1.9.
Cluster and multistage sampling take advantage of the fact that
members of a population can be located within groups or
clusters, such as cities and counties. Cluster sampling is useful
if a sampling frame does not exist or is impractical to use. For
cluster and multistage samples, we randomly select clusters and
then units in the selected clusters. For example, for a study of
Boys and Girls Clubs we might first select a sample of counties.
Either our final sample would consist of all the Boys and Girls
Clubs in the sampled counties or we would select a sample of
clubs from these counties. Unlike stratified samples, which
sample members from each stratum, cluster samples sample
only members from the selected clusters.
Multistage sampling is a variant of cluster sampling. It proceeds
in stages as you sample units dispersed over a large geographic
area. The following example shows how you could design a
sample to survey residents in a state’s long-term care facilities:
■ Stage One. Draw a probability sample of counties, that is,
you choose large clusters containing smaller clusters.
■ Stage Two. From your sample of counties choose a
probability sample of incorporated areas.
■ Stage Three. For your sampled incorporated areas obtain lists
of long-term care facilities located in them.
■ Stage Four. Select a sampling strategy
■ Alternative 1. Select all residents in the facilities identified
at Stage 3 or
■ Alternative 2. Select a sample of long-term facilities and
then select all residents in the selected facilities or
■ Alternative 3. Select a sample of long-term facilities and
select a sample of residents in the selected facilities.
The units selected at each stage are called sampling units. The
sampling unit may not be the same as the unit of analysis.
Different sampling units are selected at each stage. In this
example, the unit of analysis is residents in long-term care
facility. However, different sampling units were selected at
each stage of the process. Note that you can incorporate
different sampling strategies into this design. For example, you
may use stratified sampling at Stage One to make sure that you
have counties for each of the state’s major regions, such as
upstate or downstate, or the mountains and the beaches. You
may use simple random sampling to choose specific residents
for your sample.
Cluster sampling is recommended if your population is
distributed over a large geographic area. Without the ability to
limit the sample to discrete areas the costs and logistics would
make probability sampling difficult, if not impossible. If site
visits are needed to collect data, visiting a sample spread thinly
over a wide area will be extremely expensive. Cluster sampling
can help reduce this cost. Cluster sampling also helps
compensate for the lack of a sampling frame. Combining cluster
and multistage sampling requires an investigator to develop a
sampling frame for just the last stages of the process.
Although cluster and multistage sampling methods reduce travel
time and costs, they require a larger sample than other methods
for the same level of accuracy. In the multistage process,
probability samples are selected at each stage. Each time a
sample is selected there is some sampling error; thus, the
overall sampling error is likely to be larger than if a random
sample of the same size were drawn.
Nonprobability samples cannot produce estimates with
mathematical precision, because the sample members do not
have a known chance of being selected. Nonprobability
sampling is useful when designing a probability sampling is not
possible, studying small or hard-to-reach populations, doing an
exploratory or preliminary study, or conducting in-depth
interviews or focus groups. In the following paragraphs we
describe four common nonprobability designs: availability
sampling, quota sampling, purposive sampling, and snowball
sampling.
Availability Sampling: Availability or convenience sampling is
done when cases are selected because they are easily accessed.
Subjects may be selected because they can be contacted with
little effort. For example, if an emergency shelter wanted to
interview clients about the service availability and quality,
everyone at the shelter on a given day might be invited to
participate.
The obvious flaw of availability sampling is that it may exclude
cases that represent the target population, and the findings
cannot be generalized. The findings do not describe the
knowledge, attitudes, beliefs, or behaviors of others in the
target population. Consider the study at the emergency shelter.
You may avoid approaching certain people. For instance,
individuals who are sleeping or nodding off may not be asked to
participate. These individuals may be clients of a methadone
treatment program (methadone causes severe drowsiness), and
their observations about drug treatment services may go
unheard. If you conduct the study in the summer, the population
may be very different from the winter population. The day staff
may have a reputation that affects who stays around the shelter
during the day.
Quota Sampling: Quota sampling, often used in market research,
is less common in social science studies. Quota samples attempt
to overcome the major limitation of availability samples by
defining the percentage of members to be sampled from
specified groups. An agency considering opening a child care
center in an ethnically diverse community may want to survey
local families. You could design a quota sample to ensure that
you get input from families of all ethnicities. If the community
is 25% Asian, 60% White, 10% African American, and 5%
other, a sample of 200 should have 50 Asians, 120 Whites, 20
African Americans, and 10 from other groups. If the selection
of sample members is based on convenience and judgment, they
may also have characteristics that make them more
approachable. This adds a potential bias to the sample.
While your sample may be representative of one variable, it is
not necessarily representative of other variables. For example,
your sample of families based on their ethnicity probably isn’t
representative of age or income. As part of designing a quota
sample you need to determine the most meaningful variable and
its variation in the population. It may be easy to find
information on some variables (e.g., race, gender, age) but more
difficult to find out about other characteristics such as
incidence of substance abuse or child-rearing practices.
Purposive Sampling: Purposive sampling selects cases based on
specialized knowledge, distinct experience, or unique position.
You might use a purposive sample to study very successful
mentoring programs, people who underwent innovative medical
treatments, or women governors. You might select cases to
capture maximum variation of a phenomenon. The selected
cases may provide rich, detailed information about the
phenomenon of interest. Some researchers have argued that we
can learn more from very successful programs and limit their
samples to such programs. A variation of this theme is to
compare a set of very successful programs with unsuccessful
ones.
If you want to conduct in-depth interviews or focus groups you
may want to select respondents deliberately. You will want to
carefully select whom to interview. You will want to interview
individuals who can provide insight about the phenomenon
being studied, are willing to talk, and have diverse perspectives.
Such individuals are referred to as key informants. Good
informants will have given the subject matter some thought and
can express their thoughts, feelings, and opinions.
Snowball Sampling: Snowball sampling starts by finding one
member of the population of interest, speaking to him or her,
and asking that person to identify other members of the
population. You can use it to identify organizations or
individuals. This process is continued until the desired sample
has been identified. The number of members in the sample
“snowballs.” Snowball sampling is most often used to contact
individuals or groups that are hard to reach, difficult to identify,
or tightly interconnected. For example, if you wanted to
interview sex workers, you might identify a sex worker who is
willing to talk to you and to identify other sex workers. In
addition to identifying other sex workers, an informant may also
vouch for your credibility.
A concern when using snowball sampling is that the individuals
who are referred have not consented to being identified. This
may not be a problem for some populations, but it is for others.
Consequently, for this type of sampling, perhaps more than for
others, you should remind any informant of the need to protect
people’s privacy and have the informant seek permission before
sharing names and contact information.
When conducting qualitative research including interviews and
focus groups, determining how many cases to include in the
sample may be difficult. Completeness and saturation serve as a
guide for knowing when to stop selecting
participants.Completeness suggests that the subjects have given
you a clear, well-defined perspective of the theme or
issue. Saturation means that you are confident that little new
information will be learned from more interviews or focus
groups. Once you find that your newest subjects are sharing the
same ideas, themes, and perspectives as previous participants,
you can stop.
The appropriate sample size for quantitative surveys is based on
several factors. The first is how much sampling error you are
willing to accept, since accuracy is important in determining
sample size. Greater accuracy usually can be obtained by taking
larger samples. The confidence you wish to have in the results
and the variability within your target population also play a role
in determining sample size. Both the desired degree of
confidence and the population variability are related to
accuracy. We discuss these terms in more detail and their
relation to sample size in .
People unfamiliar with sampling theory often assume that a
major factor determining sample size is the size of the entire
population. That is, they may believe that representing a
population of 500,000 requires a correspondingly larger sample
than representing a population of 20,000. Generally, larger
samples will yield better estimates of population parameters
than will smaller samples. Yet, increasing the size of the sample
beyond a certain point results in very little improvement and
also, additional units bring additional expense. Thus you must
balance the need for accuracy against the need to control costs.
Further, with very large samples, the quality of the data may
even decrease. Note that the relationship between sample size
and accuracy applies only to probability samples. Sample size
for nonprobability sampling must be governed by other
considerations, such as the opportunity to get an in-depth
understanding of a problem and possible solutions.
As you think about sample size keep in mind the study’s
purpose. Unless its purpose is well defined and the validity of
your measurements has been established, conducting a
preliminary study with a small sample may be more efficient
than spending resources on a larger sample.
Sampling errors come about when we draw a sample rather than
study the entire population. If a probability sampling method
has been used, the resulting error can be estimated. This is the
powerful advantage of probability sampling. But other types of
errors can cause a sample statistic to be inaccurate.
Nonsampling errors result from flaws in the sampling design or
from faulty implementation. To reduce nonsampling errors you
must attend carefully to the selection of the sampling frame, the
implementation of the design, and the quality of the measures.
Nonsampling errors are serious, and their impact on the results
of a study may be unknown. Taking a larger sample may not
decrease nonsampling error; the errors may actually increase
with a larger sample. For example, coding and transcription
errors may increase if staff rushes to complete a large number
of forms. Similarly, a large sample may result in fewer attempts
to reach respondents who are not at home, thus increasing the
nonresponse rate.
If members of the sample who respond are consistently different
from nonrespondents, the sample will be biased. Other
nonsampling errors include unreliable or invalid measurements,
mistakes in recording and transcribing data, and failure to
follow up on nonrespondents. If a sampling frame excludes
certain members of a subgroup, such as low- or high-income
individuals, substantial bias may be built into the sample.
You can collect data using mail surveys, telephone interviews,
e-mail surveys, Web-based surveys, in-person interviews, or a
combination of these. To decide which to use you may weigh
time, cost, and your need for a probability sample. To get a
probability sample you need to consider if you can find an
appropriate sampling frame, contact sample members, and
encourage their responses. Your well-designed probability
sample may become a nonprobability sample if you have a poor
sampling frame, are unable to contact members of the sample,
or have a low response rate.
One of the first things to ask yourself is why will a subject
answer your questions. You may find that clients, donors, or
staff will respond if they see the value of the study and your
request isn’t burdensome. The general public may not be so
inclined. Survey response rates have declined substantially in
recent years, which is attributed both to less success in
contacting respondents and to more people refusing to
participate. Low response rates, the percentage of the sample
that responds, increase costs and call into question accuracy of
results. Without careful planning, you risk a low response rate
and wasting your time.
The next question you may ask yourself about contacting your
subjects is “what will I say?” While the idea of contacting
strangers to ask survey or interview questions may seem
daunting, it’s actually pretty easy once you have a planned
study. Much of what you will say when contacting subjects can
be taken from your informed consent form discussed in . When
you contact subjects, the most important thing to do is to tell
them who you are, what you want from them, how long it will
take, and what they will get out of it. Most people are very busy
but many will take the time to help you if the time seems
reasonable and the study worthwhile.
In this chapter we focus on traditional data collection strategies
and what you will want to consider in choosing a strategy. We
do not cover the specific details of how to conduct a survey:
there are excellent “how to” guides that you should consult if
you plan a specific study. We spend little time on the
opportunities for and challenges to data collection, such as
identifying and contacting potential respondents, offered by
new communications technologies. Research on new
technologies is in its infancy, too soon for us to know how they
can affect your work. For instance, when considering a
telephone survey you may want to investigate including cell
phones in your sample. In addition, with the availability of
inexpensive online programs to design and administer online
surveys and the diffusion of Internet access, you may consider
conducting an online survey, which opens up opportunities to
include graphics and even video clips. As the costs and logistics
of in-person interviewing climb, you may explore opportunities
to conduct interviews and focus groups via video conferencing.
To take advantage of new technologies and their impact may
require you to keep abreast of research being conducted by
survey organizations, such as the Institute for Social Research
at the University of Michigan.
Mail surveys come to mind when we think about self-
administered surveys. As is true of all surveying techniques,
mail surveys have their unique advantages and disadvantages.
All surveys require time, as you write questions, design a
questionnaire, and find a sampling frame with contact …
PPOL 505
Practice Exercise Instructions
You will have 5 practice exercise assignments throughout this
course. Each assignment will consist of questions related to the
course material for that module/week.
Practice exercises are your opportunity to practically apply the
course materials. You will use SPSS or the course texts to
answer various application questions related to course content.
Assignments will vary in length. See the specific instructions
for each assignment.
All questions and sections of questions must be clearly labeled
in your Word document. Copy any required SPSS tables into
your Word document and label them appropriately. Show your
work when required. Answers that are not clearly labeled will
not receive credit.
For each set of practice exercises, submit your assignment by
11:59 p.m. (ET) on Sunday of the assigned module/week.
Practice Exercises Grading Rubric
Criteria
Levels of Achievement
Content
(70%)
Advanced
92-100%
Proficient
84-91%
Developing
1-83%
Not present
Total
Organization
45 to 49 points
· Student provides a complete, detailed, and accurate response
to each of the questions or concepts assigned.
· Student provides accurate SPSS output when required.
41 to 44.5 points
· Student responds accurately to each of the questions or
concepts assigned.
· Student provides accurate SPSS output when required.
1 to 40.5 points
· The student has accuracy issues with their answers to some of
the questions or concepts assigned.
· Student either provides inaccurate SPSS output or does not
provide output where required.
0 points
Not present
Structure (30%)
Advanced
92-100%
Proficient
84-91%
Developing
1-83%
Not present
Total
Grammar, Spelling, & Turabian
19.5 to 21 points
Minimal to no errors in grammar, spelling, or Turabian.
17.5 to 19 points
Some errors in grammar, spelling, or Turabian.
1 to 17 points
Numerous errors in grammar, spelling, or Turabian.
0 points
Not present
Professor Comments:
Total:
/100

More Related Content

Similar to 7 HYPOTHETICALS AND YOU TESTING YOUR QUESTIONS7 MEDIA LIBRARY.docx

Net set research methodology
Net set research  methodologyNet set research  methodology
Net set research methodologyamitkuls
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testingpraveen3030
 
Psychological Research - Tagged.pdfp
Psychological Research - Tagged.pdfpPsychological Research - Tagged.pdfp
Psychological Research - Tagged.pdfphwn5gfwms4
 
Chap3_ business reaserch
Chap3_ business reaserchChap3_ business reaserch
Chap3_ business reaserchUmair Riaz
 
Essay On Juvenile Incarceration
Essay On Juvenile IncarcerationEssay On Juvenile Incarceration
Essay On Juvenile IncarcerationLissette Hartman
 
Google scholarAdvantages of using Google scholar for research·.docx
Google scholarAdvantages of using Google scholar for research·.docxGoogle scholarAdvantages of using Google scholar for research·.docx
Google scholarAdvantages of using Google scholar for research·.docxwhittemorelucilla
 
CourseProjectReviewPaper.jktamanaPDF
CourseProjectReviewPaper.jktamanaPDFCourseProjectReviewPaper.jktamanaPDF
CourseProjectReviewPaper.jktamanaPDFJasmine Tamanaha
 
Mba724 s2 w1 elements of scientific research
Mba724 s2 w1 elements of scientific researchMba724 s2 w1 elements of scientific research
Mba724 s2 w1 elements of scientific researchRachel Chung
 
WHERE TO START CHP. 2LEARNING OBJECTIVES· Discuss how a hypo.docx
WHERE TO START CHP. 2LEARNING OBJECTIVES· Discuss how a hypo.docxWHERE TO START CHP. 2LEARNING OBJECTIVES· Discuss how a hypo.docx
WHERE TO START CHP. 2LEARNING OBJECTIVES· Discuss how a hypo.docxphilipnelson29183
 
Research Methodology Module-05
Research Methodology Module-05Research Methodology Module-05
Research Methodology Module-05Kishor Ade
 

Similar to 7 HYPOTHETICALS AND YOU TESTING YOUR QUESTIONS7 MEDIA LIBRARY.docx (19)

Net set research methodology
Net set research  methodologyNet set research  methodology
Net set research methodology
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Articulo 50 palabras
Articulo 50 palabras Articulo 50 palabras
Articulo 50 palabras
 
Poli sci.original
Poli sci.originalPoli sci.original
Poli sci.original
 
Psychological Research - Tagged.pdfp
Psychological Research - Tagged.pdfpPsychological Research - Tagged.pdfp
Psychological Research - Tagged.pdfp
 
Chap3_ business reaserch
Chap3_ business reaserchChap3_ business reaserch
Chap3_ business reaserch
 
Research Methods in psychology
Research Methods in psychologyResearch Methods in psychology
Research Methods in psychology
 
Research Designs.pdf
Research Designs.pdfResearch Designs.pdf
Research Designs.pdf
 
Essay On Juvenile Incarceration
Essay On Juvenile IncarcerationEssay On Juvenile Incarceration
Essay On Juvenile Incarceration
 
What is research
What is researchWhat is research
What is research
 
Google scholarAdvantages of using Google scholar for research·.docx
Google scholarAdvantages of using Google scholar for research·.docxGoogle scholarAdvantages of using Google scholar for research·.docx
Google scholarAdvantages of using Google scholar for research·.docx
 
Chapter 1 - AP Psychology
Chapter 1 - AP PsychologyChapter 1 - AP Psychology
Chapter 1 - AP Psychology
 
CourseProjectReviewPaper.jktamanaPDF
CourseProjectReviewPaper.jktamanaPDFCourseProjectReviewPaper.jktamanaPDF
CourseProjectReviewPaper.jktamanaPDF
 
Chapter1
Chapter1Chapter1
Chapter1
 
Mba724 s2 w1 elements of scientific research
Mba724 s2 w1 elements of scientific researchMba724 s2 w1 elements of scientific research
Mba724 s2 w1 elements of scientific research
 
WHERE TO START CHP. 2LEARNING OBJECTIVES· Discuss how a hypo.docx
WHERE TO START CHP. 2LEARNING OBJECTIVES· Discuss how a hypo.docxWHERE TO START CHP. 2LEARNING OBJECTIVES· Discuss how a hypo.docx
WHERE TO START CHP. 2LEARNING OBJECTIVES· Discuss how a hypo.docx
 
Formulación de hipotesis
Formulación de hipotesisFormulación de hipotesis
Formulación de hipotesis
 
Research Methodology Module-05
Research Methodology Module-05Research Methodology Module-05
Research Methodology Module-05
 
Chapter 1 Ap Psych- Research Methods
Chapter 1 Ap Psych- Research MethodsChapter 1 Ap Psych- Research Methods
Chapter 1 Ap Psych- Research Methods
 

More from taishao1

9.1 Give examples of applications of IPsec.9.2 What servic.docx
9.1 Give examples of applications of IPsec.9.2 What servic.docx9.1 Give examples of applications of IPsec.9.2 What servic.docx
9.1 Give examples of applications of IPsec.9.2 What servic.docxtaishao1
 
a brief description of two roles that a forensic psychology professi.docx
a brief description of two roles that a forensic psychology professi.docxa brief description of two roles that a forensic psychology professi.docx
a brief description of two roles that a forensic psychology professi.docxtaishao1
 
A 65-year-old obese African American male patient presents to hi.docx
A 65-year-old obese African American male patient presents to hi.docxA 65-year-old obese African American male patient presents to hi.docx
A 65-year-old obese African American male patient presents to hi.docxtaishao1
 
A case study assessing risk and proposing security for some chosen o.docx
A case study assessing risk and proposing security for some chosen o.docxA case study assessing risk and proposing security for some chosen o.docx
A case study assessing risk and proposing security for some chosen o.docxtaishao1
 
A 3-4-page single-spaced essay that draws on materials posted in m.docx
A 3-4-page single-spaced essay that draws on materials posted in m.docxA 3-4-page single-spaced essay that draws on materials posted in m.docx
A 3-4-page single-spaced essay that draws on materials posted in m.docxtaishao1
 
a 350+ word summary based on Malinowski’s writings that answers the .docx
a 350+ word summary based on Malinowski’s writings that answers the .docxa 350+ word summary based on Malinowski’s writings that answers the .docx
a 350+ word summary based on Malinowski’s writings that answers the .docxtaishao1
 
A - Historian Marybeth Hamilton characterizes the mainstreams obses.docx
A - Historian Marybeth Hamilton characterizes the mainstreams obses.docxA - Historian Marybeth Hamilton characterizes the mainstreams obses.docx
A - Historian Marybeth Hamilton characterizes the mainstreams obses.docxtaishao1
 
918 Riders to the Sea CATHLEEN (slowly and clearly) An Id.docx
918 Riders to the Sea CATHLEEN (slowly and clearly) An Id.docx918 Riders to the Sea CATHLEEN (slowly and clearly) An Id.docx
918 Riders to the Sea CATHLEEN (slowly and clearly) An Id.docxtaishao1
 
8B20M017 Teaching Note SOBEYS—PROJECT SUNRIS.docx
8B20M017  Teaching Note  SOBEYS—PROJECT SUNRIS.docx8B20M017  Teaching Note  SOBEYS—PROJECT SUNRIS.docx
8B20M017 Teaching Note SOBEYS—PROJECT SUNRIS.docxtaishao1
 
875 words Include citations for all unoriginal ideas, .docx
875 words Include citations for all unoriginal ideas, .docx875 words Include citations for all unoriginal ideas, .docx
875 words Include citations for all unoriginal ideas, .docxtaishao1
 
81Journal of International Studies© Foundation .docx
81Journal of International Studies© Foundation .docx81Journal of International Studies© Foundation .docx
81Journal of International Studies© Foundation .docxtaishao1
 
82916 f_ch02.docx 45 Chapter 2 From Public Regulat.docx
82916 f_ch02.docx  45 Chapter 2 From Public Regulat.docx82916 f_ch02.docx  45 Chapter 2 From Public Regulat.docx
82916 f_ch02.docx 45 Chapter 2 From Public Regulat.docxtaishao1
 
8.1 What is the difference between RFC 5321 and RFC 53228.docx
8.1 What is the difference between RFC 5321 and RFC 53228.docx8.1 What is the difference between RFC 5321 and RFC 53228.docx
8.1 What is the difference between RFC 5321 and RFC 53228.docxtaishao1
 
8. On the likely aftermath and long term effects of the coronavi.docx
8. On the likely aftermath and long term effects of the coronavi.docx8. On the likely aftermath and long term effects of the coronavi.docx
8. On the likely aftermath and long term effects of the coronavi.docxtaishao1
 
8 pagesSelect an agency of the US government and a specific pu.docx
8 pagesSelect an agency of the US government and a specific pu.docx8 pagesSelect an agency of the US government and a specific pu.docx
8 pagesSelect an agency of the US government and a specific pu.docxtaishao1
 
8 pages Select an agency of the US government and a specific p.docx
8 pages Select an agency of the US government and a specific p.docx8 pages Select an agency of the US government and a specific p.docx
8 pages Select an agency of the US government and a specific p.docxtaishao1
 
8-1 Discussion Immigration LawsIt is a violation of the immigra.docx
8-1 Discussion Immigration LawsIt is a violation of the immigra.docx8-1 Discussion Immigration LawsIt is a violation of the immigra.docx
8-1 Discussion Immigration LawsIt is a violation of the immigra.docxtaishao1
 
776! CHAPTER 26 Rococo to Neoclassicism The 18th Century in E.docx
776! CHAPTER 26 Rococo to Neoclassicism The 18th Century in E.docx776! CHAPTER 26 Rococo to Neoclassicism The 18th Century in E.docx
776! CHAPTER 26 Rococo to Neoclassicism The 18th Century in E.docxtaishao1
 
8 New Scientist 4 January 2020THE tropics are the most.docx
8  New Scientist  4 January 2020THE tropics are the most.docx8  New Scientist  4 January 2020THE tropics are the most.docx
8 New Scientist 4 January 2020THE tropics are the most.docxtaishao1
 
78 CONTRADICTIONS The development of the cotton expo.docx
78  CONTRADICTIONS The development of the cotton expo.docx78  CONTRADICTIONS The development of the cotton expo.docx
78 CONTRADICTIONS The development of the cotton expo.docxtaishao1
 

More from taishao1 (20)

9.1 Give examples of applications of IPsec.9.2 What servic.docx
9.1 Give examples of applications of IPsec.9.2 What servic.docx9.1 Give examples of applications of IPsec.9.2 What servic.docx
9.1 Give examples of applications of IPsec.9.2 What servic.docx
 
a brief description of two roles that a forensic psychology professi.docx
a brief description of two roles that a forensic psychology professi.docxa brief description of two roles that a forensic psychology professi.docx
a brief description of two roles that a forensic psychology professi.docx
 
A 65-year-old obese African American male patient presents to hi.docx
A 65-year-old obese African American male patient presents to hi.docxA 65-year-old obese African American male patient presents to hi.docx
A 65-year-old obese African American male patient presents to hi.docx
 
A case study assessing risk and proposing security for some chosen o.docx
A case study assessing risk and proposing security for some chosen o.docxA case study assessing risk and proposing security for some chosen o.docx
A case study assessing risk and proposing security for some chosen o.docx
 
A 3-4-page single-spaced essay that draws on materials posted in m.docx
A 3-4-page single-spaced essay that draws on materials posted in m.docxA 3-4-page single-spaced essay that draws on materials posted in m.docx
A 3-4-page single-spaced essay that draws on materials posted in m.docx
 
a 350+ word summary based on Malinowski’s writings that answers the .docx
a 350+ word summary based on Malinowski’s writings that answers the .docxa 350+ word summary based on Malinowski’s writings that answers the .docx
a 350+ word summary based on Malinowski’s writings that answers the .docx
 
A - Historian Marybeth Hamilton characterizes the mainstreams obses.docx
A - Historian Marybeth Hamilton characterizes the mainstreams obses.docxA - Historian Marybeth Hamilton characterizes the mainstreams obses.docx
A - Historian Marybeth Hamilton characterizes the mainstreams obses.docx
 
918 Riders to the Sea CATHLEEN (slowly and clearly) An Id.docx
918 Riders to the Sea CATHLEEN (slowly and clearly) An Id.docx918 Riders to the Sea CATHLEEN (slowly and clearly) An Id.docx
918 Riders to the Sea CATHLEEN (slowly and clearly) An Id.docx
 
8B20M017 Teaching Note SOBEYS—PROJECT SUNRIS.docx
8B20M017  Teaching Note  SOBEYS—PROJECT SUNRIS.docx8B20M017  Teaching Note  SOBEYS—PROJECT SUNRIS.docx
8B20M017 Teaching Note SOBEYS—PROJECT SUNRIS.docx
 
875 words Include citations for all unoriginal ideas, .docx
875 words Include citations for all unoriginal ideas, .docx875 words Include citations for all unoriginal ideas, .docx
875 words Include citations for all unoriginal ideas, .docx
 
81Journal of International Studies© Foundation .docx
81Journal of International Studies© Foundation .docx81Journal of International Studies© Foundation .docx
81Journal of International Studies© Foundation .docx
 
82916 f_ch02.docx 45 Chapter 2 From Public Regulat.docx
82916 f_ch02.docx  45 Chapter 2 From Public Regulat.docx82916 f_ch02.docx  45 Chapter 2 From Public Regulat.docx
82916 f_ch02.docx 45 Chapter 2 From Public Regulat.docx
 
8.1 What is the difference between RFC 5321 and RFC 53228.docx
8.1 What is the difference between RFC 5321 and RFC 53228.docx8.1 What is the difference between RFC 5321 and RFC 53228.docx
8.1 What is the difference between RFC 5321 and RFC 53228.docx
 
8. On the likely aftermath and long term effects of the coronavi.docx
8. On the likely aftermath and long term effects of the coronavi.docx8. On the likely aftermath and long term effects of the coronavi.docx
8. On the likely aftermath and long term effects of the coronavi.docx
 
8 pagesSelect an agency of the US government and a specific pu.docx
8 pagesSelect an agency of the US government and a specific pu.docx8 pagesSelect an agency of the US government and a specific pu.docx
8 pagesSelect an agency of the US government and a specific pu.docx
 
8 pages Select an agency of the US government and a specific p.docx
8 pages Select an agency of the US government and a specific p.docx8 pages Select an agency of the US government and a specific p.docx
8 pages Select an agency of the US government and a specific p.docx
 
8-1 Discussion Immigration LawsIt is a violation of the immigra.docx
8-1 Discussion Immigration LawsIt is a violation of the immigra.docx8-1 Discussion Immigration LawsIt is a violation of the immigra.docx
8-1 Discussion Immigration LawsIt is a violation of the immigra.docx
 
776! CHAPTER 26 Rococo to Neoclassicism The 18th Century in E.docx
776! CHAPTER 26 Rococo to Neoclassicism The 18th Century in E.docx776! CHAPTER 26 Rococo to Neoclassicism The 18th Century in E.docx
776! CHAPTER 26 Rococo to Neoclassicism The 18th Century in E.docx
 
8 New Scientist 4 January 2020THE tropics are the most.docx
8  New Scientist  4 January 2020THE tropics are the most.docx8  New Scientist  4 January 2020THE tropics are the most.docx
8 New Scientist 4 January 2020THE tropics are the most.docx
 
78 CONTRADICTIONS The development of the cotton expo.docx
78  CONTRADICTIONS The development of the cotton expo.docx78  CONTRADICTIONS The development of the cotton expo.docx
78 CONTRADICTIONS The development of the cotton expo.docx
 

Recently uploaded

[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online PresentationGDSCYCCE
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345beazzy04
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxbennyroshan06
 
The Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational ResourcesThe Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational Resourcesaileywriter
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleCeline George
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...Sayali Powar
 
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...Nguyen Thanh Tu Collection
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPCeline George
 
Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportAvinash Rai
 
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.pptBasic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.pptSourabh Kumar
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaasiemaillard
 
Gyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptxGyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptxShibin Azad
 
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdfINU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdfbu07226
 
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptxslides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptxCapitolTechU
 
Matatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptxMatatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptxJenilouCasareno
 
Benefits and Challenges of Using Open Educational Resources
Benefits and Challenges of Using Open Educational ResourcesBenefits and Challenges of Using Open Educational Resources
Benefits and Challenges of Using Open Educational Resourcesdimpy50
 
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptxJose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptxricssacare
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersPedroFerreira53928
 

Recently uploaded (20)

[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
The Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational ResourcesThe Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational Resources
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
 
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
Industrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training ReportIndustrial Training Report- AKTU Industrial Training Report
Industrial Training Report- AKTU Industrial Training Report
 
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.pptBasic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Gyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptxGyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptx
 
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdfINU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
 
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
Operations Management - Book1.p  - Dr. Abdulfatah A. SalemOperations Management - Book1.p  - Dr. Abdulfatah A. Salem
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
 
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptxslides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
 
Matatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptxMatatag-Curriculum and the 21st Century Skills Presentation.pptx
Matatag-Curriculum and the 21st Century Skills Presentation.pptx
 
Benefits and Challenges of Using Open Educational Resources
Benefits and Challenges of Using Open Educational ResourcesBenefits and Challenges of Using Open Educational Resources
Benefits and Challenges of Using Open Educational Resources
 
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptxJose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
Jose-Rizal-and-Philippine-Nationalism-National-Symbol-2.pptx
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 

7 HYPOTHETICALS AND YOU TESTING YOUR QUESTIONS7 MEDIA LIBRARY.docx

  • 1. 7 HYPOTHETICALS AND YOU TESTING YOUR QUESTIONS 7: MEDIA LIBRARY Premium Videos Core Concepts in Stats Video · Probability and Hypothesis Testing Lightboard Lecture Video · Hypothesis Testing Difficulty Scale (don’t plan on going out tonight) WHAT YOU WILL LEARN IN THIS CHAPTER · Understanding the difference between a sample and a population · Understanding the importance of the null and research hypotheses · Using criteria to judge a good hypothesis SO YOU WANT TO BE A SCIENTIST You might have heard the term hypothesis used in other classes. You may even have had to formulate one for a research project you did for another class, or you may have read one or two in a journal article. If so, then you probably have a good idea what a hypothesis is. For those of you who are unfamiliar with this often-used term, a hypothesis is basically “an educated guess.” Its most important role is to reflect the general problem statement or question that was the motivation for asking the research question in the first place. That’s why taking the care and time to formulate a really precise and clear research question is so important. This research question will guide your creation of a hypothesis, and in turn, the hypothesis will determine the techniques you will use to test it and answer the question that was originally asked. So, a good hypothesis translates a problem statement or a research question into a format that makes it easier to examine. This format is called a hypothesis. We will talk about what
  • 2. makes a hypothesis a good one later in this chapter. Before that, let’s turn our attention to the difference between a sample and a population. This is an important distinction, because while hypotheses usually describe a population, hypothesis testing deals with a sample and then the results are generalized to the larger population. We also address the two main types of hypotheses (the null hypothesis and the research hypothesis). But first, let’s formally define some simple terms that we have used earlier in Statistics for People Who (Think They) Hate Statistics. SAMPLES AND POPULATIONS As a good scientist, you would like to be able to say that if Method A is better than Method B in your study, this is true forever and always and for all people in the universe, right? Indeed. And, if you do enough research on the relative merits of Methods A and B and test enough people, you may someday be able to say that. But don’t get too excited, because it’s unlikely you will ever be able to speak with such confidence. It takes too much money ($$$) and too much time (all those people!) to do all that research, and besides, it’s not even necessary. Instead, you can just select a representative sample from the population and test your hypothesis about the relative merits of Methods A and B on that sample. Given the constraints of never enough time and never enough research funds, with which almost all scientists live, the next best strategy is to take a portion of a larger group of participants and do the research with that smaller group. In this context, the larger group is referred to as a population, and the smaller group selected from that population is referred to as a sample. Statistics as a field, in fact, is all about looking at a sample and inferring to the population it represents. Indeed, the word statistic technically means a number that describes a sample (and the word we use for a number that describes a population is parameter). A measure of how well a sample approximates the
  • 3. characteristics of a population is called sampling error. Sampling error is basically the difference between the values of the sample statistic and the population parameter. The higher the sampling error, the less precision you have in sampling, and the more difficult it will be to make the case that what you find in the sample indeed reflects what you expected to find in the population. And just as there are measures of variability regarding distributions, so are there measures of the variability of this difference between a sample measure and a population measure. This is often called the standard error—it’s basically the standard deviation of the difference between these two values. Samples should be selected from populations in such a way that the sample matches as closely as possible the characteristics of the population. You know, to minimize the sampling error. The goal is to have the sample be as much like the population as possible. The most important implication of ensuring similarity between the two is that the research results based on the sample can be generalized to the population. When the sample accurately represents the population, the results of the study are said to have a high degree of generalizability. A high degree of generalizability is an important quality of good research because it means that the time and effort (and $$$) that went into the research may have implications for groups of people other than the original participants. It’s easy to equate “big” with “representative.” Keep in mind that it is far more important to have an accurately representative sample than it is to have a big sample (people often think that big is better—only true on Thanksgiving, by the way). Having lots and lots of participants in a sample may be very impressive, but if the participants do not represent the larger population, then the research will have little value. THE NULL HYPOTHESIS Okay. So we have a sample of participants selected from a population, and to begin the test of our research hypothesis, we first formulate the null hypothesis.
  • 4. The null hypothesis is an interesting little creature. If it could talk, it would say something like “I represent no relationship between the variables that you are studying.” In other words, null hypotheses are statements of equality demonstrated by the following real-life null hypotheses taken from a variety of popular social and behavioral science journals. Names have been changed to protect the innocent. · There will be no difference between the average score of 9th graders and the average score of 12th graders on a memory test. · There is no difference between the effectiveness of community-based, long-term care and the effectiveness of in- home, long-term care in promoting the social activity of older adults when measured using the Margolis Scale of Social Activities. · There is no relationship between reaction time and problem- solving ability. · There is no difference between high- and low-income families in the amount of assistance families offer their children in school-related activities. What these four null hypotheses have in common is that they all contain a statement that two or more things are equal or unrelated (that’s the “no difference” and “no relationship” part) to each other. The Purposes of the Null Hypothesis What are the basic purposes of the null hypothesis? The null hypothesis acts as both a starting point and a benchmark against which the actual outcomes of a study can be measured. Let’s examine each of these purposes in more detail. First, the null hypothesis acts as a starting point because it is the state of affairs that is accepted as true in the absence of any other information. For example, let’s look at the first null hypothesis we stated earlier: There will be no difference between the average score of 9th graders and the average score of 12th graders on a memory test. Given absolutely no other knowledge of 9th and 12th graders’ memory skills, you have no reason to believe that there will be
  • 5. differences between the two groups, right? If you know nothing about the relationship between these variables, the best you can do is guess. And that’s taking a chance. You might speculate as to why one group might outperform another, using theory or common sense, but if you have no evidence a priori (“from before”), then what choice do you have but to assume that they are equal? This lack of a relationship as a starting point is a hallmark of this whole topic. Until you prove that there is a difference, you have to assume that there is no difference. And a statement of no difference or no relationship is exactly what the null hypothesis is all about. Such a statement ensures that (as members of the scientific community) we are starting on a level playing field with no bias toward one or the other direction as to how the test of our hypothesis will turn out. Furthermore, if there are any differences between these two groups, then you have to assume that these differences are due to the most attractive explanation for differences between any groups on any variable—chance! That’s right: Given no other information, chance is always the most likely and attractive explanation for the observed differences between two groups or the relationship between variables. Chance explains what we cannot. You might have thought of chance as the odds of winning that $5,000 jackpot at the penny slots, but we’re talking about chance as all that other “stuff” that clouds the picture and makes it even more difficult to understand the “true” nature of relationships between variables. For example, you could take a group of soccer players and a group of football players and compare their running speeds, thinking about whether playing soccer or playing football makes athletes faster. But look at all the factors we don’t know about that could contribute to differences. Who is to know whether some soccer players practice more or whether some football players are stronger or whether both groups are receiving additional training of different types? What’s more, perhaps the way their speed is being measured
  • 6. leaves room for chance; a faulty stopwatch or a windy day can contribute to differences unrelated to true running speed. As good researchers, our job is to eliminate chance factors from explaining observed differences and to evaluate other factors that might contribute to group differences, such as intentional training or nutrition programs, and see how they affect speed. The point is, if we find differences between groups and the differences are not due to training, then we have no choice but to attribute the difference to chance. And, by the way, you might find it useful to think of chance as being somewhat equivalent to the idea of error. When we can control sources of error, the likelihood that we can offer a meaningful explanation for some outcome increases. The second purpose of the null hypothesis is to provide a benchmark against which observed outcomes can be compared to see if these differences are due to some other factor. The null hypothesis helps to define a range within which any observed differences between groups can be attributed to chance (which is the null hypothesis’s contention) or are due to something other than chance (which perhaps would be the result of the manipulation of some variable, such as training in our example of the soccer and football players). Most research studies have an implied null hypothesis, and you may not find it clearly stated in a research report or journal article. Instead, you’ll find the research hypothesis clearly stated, and this is where we now turn our attention.CORE CONCEPTS IN STATS VIDEOProbability and Hypothesis Testing Hypothesis testing involves analyzing data to test our research hypothesis. We've collected data and are ready to analyze it. We'll need to complete three steps. First, examine the data by creating tables and figures in order to understand it. Second, calculate descriptive statistics-- measures of central tendency and variability-- to describe the data. Finally, calculate inferential statistics to test our research hypothesis. Think of this process like a criminal trial. State an initial assumption
  • 7. and a mutually exclusive alternative to this assumption. Analyze the collected evidence, the data. In this scenario, the jury has two decisions to make when deciding whether or not to reject the initial assumption-- guilty or not guilty. These three steps-- making an initial assumption-- known as the null hypothesis-- and an alternative to that assumption; analyzing the collected data; and making a decision whether to reject or accept the initial assumption-- or null hypothesis-- are all critical parts of the process of hypothesis testing. The decision you make is based on the probability of obtaining a value of a statistic. If your statistic has a low probability of occurring, the likelihood of this happening is so low because of chance you reject the initial assumption, or null hypothesis. This is similar to what happens in jury trials, where if the jury decides that the likelihood an innocent person did these things is so low, they'll reject the presumption of innocence and instead decide on a guilty verdict. What do we mean by "low probability"? If the probability of a statistic is less than to reject the null hypothesis. Think about flipping coins. The first step is to state the null and the alternative hypotheses. In flipping coins, we start with the null hypothesis assumption that coins are fair. There is an equal probability of getting heads or tails. The mutually exclusive alternative to this assumption is that coins are not fair and there is not an equal probability of getting a heads or tails. The second step would be to calculate a statistic. In flipping coins, you could flip a sample of coins and then count the number of heads. The next step is to make a decision about the null hypothesis. You can either reject or accept the null hypothesis-- that coins are fair. If the number of heads in your sample of coin flips has a low probability of occurring-- less than that coins are fair in favor of the alternative hypothesis, that coins are not fair. We use these examples to illustrate the process of hypothesis testing. But the reality is that research isn't about flipping coins and counting the number of heads. And that's what research is about-- collecting data from samples.
  • 8. THE RESEARCH HYPOTHESIS Whereas a null hypothesis is usually a statement of no relationship between variables or that a certain value is zero, a research hypothesis is usually a definite statement that a relationship exists between variables. For example, for each of the null hypotheses stated earlier, here is a corresponding research hypothesis. Notice that we said “a” and not “the” corresponding research hypothesis because there certainly could be more than one research hypothesis for any one null hypothesis. · The average score of 9th graders is different from the average score of 12th graders on a memory test. · The effectiveness of community-based, long-term care is different from the effectiveness of in-home, long-term care in promoting the social activity of older adults when measured using the Margolis Scale of Social Activities. · Slower reaction time and problem-solving ability are positively related. · There is a difference between high- and low-income families in the amount of assistance families offer to their children in school-related activities. Each of these four research hypotheses has one thing in common: They are all statements of inequality. They posit a relationship between variables and not an equality, as the null hypothesis does. The nature of this inequality can take two different forms—a directional or a nondirectional research hypothesis. If the research hypothesis posits no direction to the inequality (such as only saying “different from”), the hypothesis is a nondirectional research hypothesis. If the research hypothesis posits a direction to the inequality (such as “more than” or “less than”), the research hypothesis is a directional research hypothesis. The Nondirectional Research Hypothesis A nondirectional research hypothesis reflects a difference between groups, but the direction of the difference is not
  • 9. specified. For example, the following research hypothesis is nondirectional in that the direction of the difference between the two groups is not specified: The average score of 9th graders is different from the average score of 12th graders on a memory test. The hypothesis is a research hypothesis because it states that there is a difference, and it is nondirectional because it says nothing about the direction of that difference. A nondirectional research hypothesis, like this one, would be represented by the following equation: (7.1) H1:¯¯¯X9≠¯¯¯X12,H1:X¯9≠X¯12, where · H1 represents the symbol for the first (of possibly several) research hypotheses, · ¯¯¯X9X¯9 represents the average memory score for the sample of 9th graders, · ¯¯¯X12X¯12 represents the average memory score for the sample of 12th graders, and · ≠ means “is not equal to.” The Directional Research Hypothesis A directional research hypothesis reflects a difference between groups, and the direction of the difference is specified. For example, the following research hypothesis is directional because the direction of the difference between the two groups is specified: The average score of 12th graders is greater than the average score of 9th graders on a memory test. One is hypothesized to be greater than (not just different from) the other. Examples of two other directional hypotheses are these: · A is greater than B (or A > B). · B is greater than A (or A < B). These both represent inequalities (greater than or less than). A directional research hypothesis such as the one described
  • 10. earlier, where 12th graders are hypothesized to score better than 9th graders, would be represented by the following equation: (7.2) H1:¯¯¯X12>¯¯¯X9,H1:X¯12>X¯9, where · H1 represents the symbol for the first (of possibly several) research hypotheses, · ¯¯¯X9X¯9 represents the average memory score for the sample of 9th graders, · ¯¯¯X12X¯12 represents the average memory score for the sample of 12th graders, and · > means “is greater than.” What is the purpose of the research hypothesis? It is this hypothesis that is directly tested as an important step in the research process. The results of this test are compared with what you expect if you were wrong (reflecting the null hypothesis) to see which of the two is the more attractive explanation for any differences between groups or variables you might observe. Table 7.1 gives the four null hypotheses and accompanying directional and nondirectional research hypotheses. Another way to talk about directional and nondirectional hypotheses is to talk about one- and two-tailed tests. A one- tailed test (reflecting a directional hypothesis) posits a difference in a particular direction, such as when we hypothesize that Group 1 will score higher than Group 2. A two- tailed test (reflecting a nondirectional hypothesis) posits a difference but in no particular direction. We talk about “tails” because we often understand statistical results by applying them to a normal curve that has two “tails.” The importance of this distinction begins when you test different types of hypotheses (one- and two-tailed) and establish probability levels for rejecting or not rejecting the null hypothesis. More about this in Chapters 8 and 9. Promise. Table 7.1 ⬢ Null Hypotheses and Corresponding Research Hypotheses
  • 11. Null Hypothesis Nondirectional Research Hypothesis Directional Research Hypothesis There will be no difference in the average score of 9th graders and the average score of 12th graders on a memory test. Twelfth graders and 9th graders will differ on a memory test. Twelfth graders will have a higher average score on a memory test than will 9th graders. There is no difference between the effectiveness of community- based, long-term care for older adults and the effectiveness of in-home, long-term care for older adults when measured using the Margolis Scale of Social Activities. The effect of community-based, long-term care for older adults is different from the effect of in-home, long-term care for older adults when measured using the Margolis Scale of Social Activities. Older adults exposed to community-based, long-term care score higher on the Margolis Scale of Social Activities than do older adults receiving in-home, long-term care. There is no relationship between reaction time and problem- solving ability. There is a relationship between reaction time and problem- solving ability. There is a positive relationship between reaction time and problem-solving ability. There is no difference between high- and low-income families in the amount of assistance families offer their children in educational activities. The amount of assistance offered by high-income families to their children in educational activities is different from the amount of support offered by low-income families to their children in educational activities. The amount of assistance offered by high-income families to their children in educational activities is more than the amount of support offered by low-income families to their children in educational activities.Some Differences Between the Null
  • 12. Hypothesis and the Research Hypothesis Besides the null hypothesis usually representing an equality and the research hypothesis usually representing an inequality, the two types of hypotheses differ in several other important ways. First, for a bit of review, the two types of hypotheses differ in that one (the null hypothesis) usually states that there is no relationship between variables (an equality), whereas the research hypothesis usually states that there is a relationship between the variables (an inequality). This is the primary difference. Second, null hypotheses always refer to the population, whereas research hypotheses usually refer to the sample. We select a sample of participants from a much larger population. We then try to generalize the results from the sample back to the population. If you remember your basic philosophy and logic (you did take these courses, right?), you’ll remember that going from small (as in a sample) to large (as in a population) is a process of induction. Third, because the entire population cannot be directly tested (again, it is impractical, uneconomical, and often impossible), you can’t say with 100% certainty that there is no real difference between segments of the population on some variable. Rather, you have to infer it (indirectly) from the results of the test of the research hypothesis, which is based on the sample. Hence, the null hypothesis must be indirectly tested, and the research hypothesis can be directly tested. Fourth, in statistics, the null hypothesis is written with an equal sign, while the research hypothesis is written with a not equal to, greater than, or less than sign. Fifth, null hypotheses are always written using Greek symbols, and research hypotheses are always written using Roman symbols. Thus, the null hypothesis that the average score for 9th graders is equal to that of 12th graders is represented like this: (7.3) H0:μ9=μ12,H0:μ9=μ12,
  • 13. where · H0 represents the null hypothesis, · µ9 represents the theoretical average for the population of 9th graders, and · µ12 represents the theoretical average for the population of 12th graders. The research hypothesis that the average score for a sample of 12th graders is greater than the average score for a sample of 9th graders is shown in Formula 7.2 (presented earlier). Finally, because you cannot directly test the null hypothesis, it is an implied hypothesis. But the research hypothesis is explicit and is stated as such. This is another reason why you rarely see null hypotheses stated in research reports and will very often see a statement (be it in symbols or words) of the research hypothesis. LIGHTBOARD LECTURE VIDEO Hypothesis Testing Hypothesis testing is a weird sort of statistics thing. We pretend that we think something is true, when in fact we think the opposite is true. Let me show you what I mean. Here is a correlation coefficient. And your research hypothesis might be hey, A and B are correlated. In reality, the correlation is either greater than isn't. It's either greater than Your null hypothesis is going to be that it's equal to But what you really think is that it's greater than So you have what's really out there, and you can't affect that at all. Then you have your guesses. And your guesses are either going to be that it's greater than exactly equal to All you have to prove is that it's greater than You don't have to prove it's exactly .26 or any of that stuff. Now the guess that says it's greater than your research hypothesis. That's what you really think. The guess that says it's equal to that's your null hypothesis. So you've got these two situations-- whatever is true in the real world and whatever your guess was. So there's really only four possibilities, right? You could be right. You could be wrong. There's two ways to be right and two ways to be wrong. Let's
  • 14. think. If your guess is that it's the correlation is greater than and it really is greater than If your guess is that is greater than and it's not greater than then that's really sad and you're wrong. If your guess is the null hypothesis, which is how we typically do things, you're guessing that the correlation is exactly equal to And if it really is, that, strangely enough, also makes you really happy. But if you're guessing that the correlation is exactly equal to that makes you sad. So a researcher usually has this research hypothesis-- in this case, that the correlation is greater than And if they're right, that's the best possible outcome. WHAT MAKES A GOOD HYPOTHESIS? You now know that hypotheses are educated guesses—a starting point for a lot more to come. As with any guess, some hypotheses are better than others right from the start. We can’t stress enough how important it is to ask the question you want answered and to keep in mind that any hypothesis you present is a direct extension of the original question you asked. This question will reflect your personal interests and motivation and your understanding of what research has been done previously. With that in mind, here are criteria you might use to decide whether a hypothesis you read in a research report or one that you formulate is acceptable. To illustrate, let’s use an example of a study that examines the effects of providing afterschool child care to employees who work late on the parents’ adjustment to work. Here is a well- written directional research hypothesis: Parents who enroll their children in afterschool programs will have a more positive attitude toward work, as measured by the Attitude Toward Work survey, than will parents who do not enroll their children in such programs. Here are the criteria. First, a good hypothesis is stated in declarative form and not as a question. (It ends with a period or, if you’re really excited, an exclamation mark!) While the preceding hypothesis may have started in the researcher’s mind as the question “What are the
  • 15. benefits of afterschool programs at work … ?” it was not posed because hypotheses are most effective when they make a clear and forceful statement. Second, a good hypothesis posits an expected relationship between variables. The hypothesis in our example clearly describes the expected relationships among afterschool child care and parents’ attitude. These variables are being tested to see if one (enrollment in the afterschool program) has an effect on the others (attitude). Notice the word expected in the second criterion. Defining an expected relationship is intended to prevent a fishing trip to look for any relationships that may be found (sometimes called the “shotgun” approach), which may be tempting but is not very productive. You do get somewhere using the shotgun approach, but because you don’t know where you started, you have no idea where you end up. In the fishing-trip approach, you throw out your line and take anything that bites. You collect data on as many things as you can, regardless of your interest or even whether collecting the data is a reasonable part of a scientific investigation. Or, to use a shotgun analogy, you load up them … 6 As you think about surveying clients, donors, or the general public you may start by figuring out whom to contact. Contacting everyone is seldom practical or necessary. You may use too many resources and take up too much time. Instead you will want to contact a sample of clients, donors, or residents. This chapter covers the principles guiding the selection of a sample. Knowing how to design a sample is only the beginning. You need to identify individual sample members, contact them, and encourage them to respond. Technology is rapidly changing
  • 16. strategies for contacting sample members and encouraging them to respond. Survey organizations are studying the impact of changing lifestyles and new communications technologies. In this chapter we focus on the basic sampling and data collection strategies. At the end of the chapter we refer you to how-to books and other sources to fill in the gaps. Our approach keeps us from overwhelming you with information that may have a short shelf life. Sampling is an economical and effective way to learn about individuals and organizations. This is true if you seek information from individuals, case records, agency representatives, or computerized datasets. Even with a relatively small group, sampling enables you to gather information relatively quickly. Depending on how you select the sample you may be able to use the findings to generalize beyond the sampled individuals, organizations, or other units. Because sampling is a jargon-rich topic, we begin by defining some key terms. First, you want to recognize the difference between a population and a sample. The population is the total set of units that you are interested in. A population is often composed of individuals, but it may consist of other units such as organizations, households, records, or computers. A sample is a subset of the population. Closely connected to the concepts of population and sample are the terms parameter and statistics. A parameter is a characteristic of the entire population. A statistic is a characteristic of a sample. The percentage of citizens in a city favoring a fund to finance affordable housing is a parameter. The percentage of sampled residents who favor the fund is a statistic. Depending on how you select the sample you may be able to use the statistic and the sampling error to estimate the parameter. The sampling error, which is discussed in more detail in , is the difference between the parameter and the statistic and is used to estimate the parameter.
  • 17. One of your first steps is to precisely define the population represented by the sample, such as “all adults over the age of 21 living in Clark County on July 1.” After you define your population your next step is to find a sampling frame. A sampling frame is a list of names of individuals or organizations in your population. A sampling frame should contain all members of your population. Complete lists seldom exist. So, in reality the sampling frame is the list of potential sample members. Sampling frames are not perfect. They may aggregate units, list units more than once, omit units, or include units that are not part of the population. A list of building addresses, for example, may include apartment buildings, but not individual units. If you plan to sample individual residences, this sampling frame would be inadequate. The unit of analysis indicates what the data represent. You may think of the unit of analysis as what constitutes a case when you enter data on a spreadsheet. If you collect data from housing agencies, the sampling unit is housing agency. If you report the number of employees and the size of the budget the unit of analysis is housing agency. If you collect data on individual clients from the sampled agencies, then “client” would be the unit of analysis. illustrates the components of a sample. FIGURE 6.1 From Population to Sampled Unit A sample design describes the strategy for selecting the sample members from the population. Designs are classified as probability and nonprobability. With a probability sample, each unit of the population has a known, nonzero chance of being in the sample. Probability samples use randomization to avoid selection biases and use statistical theory to estimate parameters. You cannot estimate parameters with nonprobability samples. Both types of designs have value but for different purposes and in different situations. AN APPLICATION OF SAMPLING TERMINOLOGY
  • 18. • Population. All participants who have completed the Clark County Job Training Program within the past 2 years • Parameter. Average starting salaries of all graduates of the Clark County Job Training Program within the past 2 years • Sampling frame. All (8) graduation lists from the past 2 years • Sampling design. Probability sample • Sample. 120 graduates randomly selected from sampling frame • Unit of analysis. Individual graduates • Statistic. The average starting salary of the 120 graduates was $25,000 ■ With a probability sample, you can calculate the chance that a member of the population has of being sampled. We discuss four common probability sampling designs: simple random sampling, systematic sampling, stratified sampling, and cluster sampling. These designs are often used together. Stratified and cluster samples also use simple random sampling or systematic sampling to select subjects. Multistage sampling combines cluster sampling with other designs. Simple random sampling requires that each unit in the population has an equal probability of being in the sample. Drawing names from a hat is the prototypical example of a simple random sample. An analogous strategy is used to draw lottery numbers. You do not need to put names in a hat or marked balls in a tumbler. Rather you can use technology to help you draw a random sample. If the cases are contained in an electronic database you may use a computer program to select your sample. These programs use an algorithm to create a random sample. If the cases are not stored in an electronic database you can number the cases. In this method, using your calculator or a list of random numbers, you select a set of random numbers. Finally, you match each selected random number to the case with the same number. If the cases are not stored electronically systematic sampling is an acceptable alternative. It is easier and normally produces
  • 19. results comparable to those of a simple random sample. To construct a systematic sample, you divide the number of units in the sampling frame (N) by the number desired for the sample (n). The resulting number is called the skip interval (k). If a sampling frame consists of 50,000 units and a sample of 1,000 is desired, the skip interval equals 50 (50,000 divided by 1,000). You select a random number, go to the sampling frame, and use the random number to select the first case. You then pick every kth unit for the sample. In our example every 50th case would be chosen. If the random number 45 was selected the cases 45, 95, 145, and so on would be in the sample. With systematic sampling, treat the list as circular, so the last listed unit is followed by the first. You should go through the entire sampling frame at least once. Systematic sampling has one potentially serious problem, that of periodicity. If the items are listed in a pattern and the skip interval coincides with the pattern, the sample will be biased. Consider what could happen in sampling the daily activity logs of a 911 call center. If the skip interval was 7 the activity logs in the sample would all be for the same day, that is, all Mondays or all Tuesdays, and so forth. A skip interval of any multiple of 7 would have the same result. Periodicity is relatively rare. If you notice something strange about your sample, for example, that it is all female, includes only top administrators, or has only corner houses, check for periodicity. An easy fix that often works is to double the size of your skip interval (k × 2). Go through the sampling frame once to get half your sample, then choose another random starting point and go through it a second time. Systematic sampling may be the only feasible way to get a probability sample of a population of unknown size, such as people attending a community festival. You estimate the population size, that is, the number of attendees, determine a skip interval, and pick a random beginning point. If you plan to sample every 20th person and start with the 6th person, the 6th person to arrive (or depart) would be selected, as would the
  • 20. 26th person, the 46th person, and so on, until the end of the sampling period. Stratified sampling ensures that a sample adequately represents selected groups in the population. You should consider using stratified sampling if you plan to compare groups, if you need to focus on a group that is a small proportion of your population, or if your sampling frame is already divided by groups. First, you divide or classify the population into strata, or groups, on some characteristic such as gender, age, or institutional affiliation. Every member of the population should be in one and only one stratum. Use either simple random sampling or systematic sampling to draw samples from each stratum. Stratified samples may be proportional or disproportional. In proportional stratified samples the same sampling fraction is applied to each stratum. The sample size for each stratum is proportional to its size in the population. If, for example, you wanted to compare three groups of employees—professional staff, technical staff, and clerical staff—you would designate each group as a stratum. You would select your samples by taking the same percentage of members from each stratum, say 10 percent of the professional staff, 10 percent of the technical staff, and 10 percent of the clerical staff. The resulting sample would consist of three strata, each equal in size to the stratum’s proportion of the total population. illustrates a proportional stratified sample. The percentage of members selected for the sample need not be the same for each stratum. In disproportional stratified sampling, the same sampling fraction is not applied to all strata. Disproportional sampling is useful when a characteristic of interest occurs infrequently in the population, making it unlikely that a simple random or a proportional sample will contain enough members with the characteristic to allow full analysis. For example, a mentoring program may want to compare recruitment and retention of Spanish-speaking volunteers with that of non–Spanish speakers. You will want a
  • 21. sample large enough so that you can analyze and compare the two groups. If 5 percent of its volunteers are Spanish speakers, a random sample of 200 volunteers should have about 10 Spanish speakers—too few cases for analysis. Therefore, you might survey a larger percentage of Spanish-speaking volunteers and a smaller percentage of other volunteers; you could decide that half the sample should be Spanish speakers and an equal number of non-Spanish speakers, thus over- representing Spanish speakers. A higher percentage of Spanish speakers will have been sampled and non-Spanish speakers under-represented. EXAMPLE 6.1 AN APPLICATION OF PROPORTIONAL STRATIFIED SAMPLING • Problem. The director of volunteer recruitment for the state office of a large nonprofit organization wanted information about volunteer applicants to the organization over the previous 4 years. • Population. All applications to the volunteer service of the nonprofit organization for each of the past selects 4 years. • Sampling frame. Agency’s electronic file of applications(organized by application date). • Sampling design. Proportional stratified sample with year of application as strata. A computer program select a random sample of applications for each year, using the same sampling fraction for each group (). ■ TABLE 6.1 Proportional Stratified Sampling Strata Number of Applications Sampling Fraction (%) Number in Sample Year 1 350 15 53 Year 2
  • 22. 275 15 41 Year 3 250 15 38 Year 4 230 15 35 Total 1,150 167 Each strata in a disproportional stratified sample constitutes a separate sample. To conduct your analysis you must keep the samples separate as you compare the Spanish-speaking volunteers with non–Spanish speakers. To determine characteristics of all sampled volunteers you must weigh the samples. First, determine each stratum’s sample size for a proportional sample. To calculate the weight for each stratum, divide the size of the proportional sample by the size of the disproportional sample. When you combine the samples, each Spanish speaker’s responses would be multiplied or weighted by 0.1 and each response by non–Spanish speakers would be multiplied by 1.9. Cluster and multistage sampling take advantage of the fact that members of a population can be located within groups or clusters, such as cities and counties. Cluster sampling is useful if a sampling frame does not exist or is impractical to use. For cluster and multistage samples, we randomly select clusters and then units in the selected clusters. For example, for a study of Boys and Girls Clubs we might first select a sample of counties. Either our final sample would consist of all the Boys and Girls
  • 23. Clubs in the sampled counties or we would select a sample of clubs from these counties. Unlike stratified samples, which sample members from each stratum, cluster samples sample only members from the selected clusters. Multistage sampling is a variant of cluster sampling. It proceeds in stages as you sample units dispersed over a large geographic area. The following example shows how you could design a sample to survey residents in a state’s long-term care facilities: ■ Stage One. Draw a probability sample of counties, that is, you choose large clusters containing smaller clusters. ■ Stage Two. From your sample of counties choose a probability sample of incorporated areas. ■ Stage Three. For your sampled incorporated areas obtain lists of long-term care facilities located in them. ■ Stage Four. Select a sampling strategy ■ Alternative 1. Select all residents in the facilities identified at Stage 3 or ■ Alternative 2. Select a sample of long-term facilities and then select all residents in the selected facilities or ■ Alternative 3. Select a sample of long-term facilities and select a sample of residents in the selected facilities. The units selected at each stage are called sampling units. The sampling unit may not be the same as the unit of analysis. Different sampling units are selected at each stage. In this example, the unit of analysis is residents in long-term care facility. However, different sampling units were selected at each stage of the process. Note that you can incorporate different sampling strategies into this design. For example, you may use stratified sampling at Stage One to make sure that you have counties for each of the state’s major regions, such as upstate or downstate, or the mountains and the beaches. You may use simple random sampling to choose specific residents for your sample. Cluster sampling is recommended if your population is distributed over a large geographic area. Without the ability to limit the sample to discrete areas the costs and logistics would
  • 24. make probability sampling difficult, if not impossible. If site visits are needed to collect data, visiting a sample spread thinly over a wide area will be extremely expensive. Cluster sampling can help reduce this cost. Cluster sampling also helps compensate for the lack of a sampling frame. Combining cluster and multistage sampling requires an investigator to develop a sampling frame for just the last stages of the process. Although cluster and multistage sampling methods reduce travel time and costs, they require a larger sample than other methods for the same level of accuracy. In the multistage process, probability samples are selected at each stage. Each time a sample is selected there is some sampling error; thus, the overall sampling error is likely to be larger than if a random sample of the same size were drawn. Nonprobability samples cannot produce estimates with mathematical precision, because the sample members do not have a known chance of being selected. Nonprobability sampling is useful when designing a probability sampling is not possible, studying small or hard-to-reach populations, doing an exploratory or preliminary study, or conducting in-depth interviews or focus groups. In the following paragraphs we describe four common nonprobability designs: availability sampling, quota sampling, purposive sampling, and snowball sampling. Availability Sampling: Availability or convenience sampling is done when cases are selected because they are easily accessed. Subjects may be selected because they can be contacted with little effort. For example, if an emergency shelter wanted to interview clients about the service availability and quality, everyone at the shelter on a given day might be invited to participate. The obvious flaw of availability sampling is that it may exclude cases that represent the target population, and the findings cannot be generalized. The findings do not describe the knowledge, attitudes, beliefs, or behaviors of others in the
  • 25. target population. Consider the study at the emergency shelter. You may avoid approaching certain people. For instance, individuals who are sleeping or nodding off may not be asked to participate. These individuals may be clients of a methadone treatment program (methadone causes severe drowsiness), and their observations about drug treatment services may go unheard. If you conduct the study in the summer, the population may be very different from the winter population. The day staff may have a reputation that affects who stays around the shelter during the day. Quota Sampling: Quota sampling, often used in market research, is less common in social science studies. Quota samples attempt to overcome the major limitation of availability samples by defining the percentage of members to be sampled from specified groups. An agency considering opening a child care center in an ethnically diverse community may want to survey local families. You could design a quota sample to ensure that you get input from families of all ethnicities. If the community is 25% Asian, 60% White, 10% African American, and 5% other, a sample of 200 should have 50 Asians, 120 Whites, 20 African Americans, and 10 from other groups. If the selection of sample members is based on convenience and judgment, they may also have characteristics that make them more approachable. This adds a potential bias to the sample. While your sample may be representative of one variable, it is not necessarily representative of other variables. For example, your sample of families based on their ethnicity probably isn’t representative of age or income. As part of designing a quota sample you need to determine the most meaningful variable and its variation in the population. It may be easy to find information on some variables (e.g., race, gender, age) but more difficult to find out about other characteristics such as incidence of substance abuse or child-rearing practices. Purposive Sampling: Purposive sampling selects cases based on specialized knowledge, distinct experience, or unique position. You might use a purposive sample to study very successful
  • 26. mentoring programs, people who underwent innovative medical treatments, or women governors. You might select cases to capture maximum variation of a phenomenon. The selected cases may provide rich, detailed information about the phenomenon of interest. Some researchers have argued that we can learn more from very successful programs and limit their samples to such programs. A variation of this theme is to compare a set of very successful programs with unsuccessful ones. If you want to conduct in-depth interviews or focus groups you may want to select respondents deliberately. You will want to carefully select whom to interview. You will want to interview individuals who can provide insight about the phenomenon being studied, are willing to talk, and have diverse perspectives. Such individuals are referred to as key informants. Good informants will have given the subject matter some thought and can express their thoughts, feelings, and opinions. Snowball Sampling: Snowball sampling starts by finding one member of the population of interest, speaking to him or her, and asking that person to identify other members of the population. You can use it to identify organizations or individuals. This process is continued until the desired sample has been identified. The number of members in the sample “snowballs.” Snowball sampling is most often used to contact individuals or groups that are hard to reach, difficult to identify, or tightly interconnected. For example, if you wanted to interview sex workers, you might identify a sex worker who is willing to talk to you and to identify other sex workers. In addition to identifying other sex workers, an informant may also vouch for your credibility. A concern when using snowball sampling is that the individuals who are referred have not consented to being identified. This may not be a problem for some populations, but it is for others. Consequently, for this type of sampling, perhaps more than for others, you should remind any informant of the need to protect people’s privacy and have the informant seek permission before
  • 27. sharing names and contact information. When conducting qualitative research including interviews and focus groups, determining how many cases to include in the sample may be difficult. Completeness and saturation serve as a guide for knowing when to stop selecting participants.Completeness suggests that the subjects have given you a clear, well-defined perspective of the theme or issue. Saturation means that you are confident that little new information will be learned from more interviews or focus groups. Once you find that your newest subjects are sharing the same ideas, themes, and perspectives as previous participants, you can stop. The appropriate sample size for quantitative surveys is based on several factors. The first is how much sampling error you are willing to accept, since accuracy is important in determining sample size. Greater accuracy usually can be obtained by taking larger samples. The confidence you wish to have in the results and the variability within your target population also play a role in determining sample size. Both the desired degree of confidence and the population variability are related to accuracy. We discuss these terms in more detail and their relation to sample size in . People unfamiliar with sampling theory often assume that a major factor determining sample size is the size of the entire population. That is, they may believe that representing a population of 500,000 requires a correspondingly larger sample than representing a population of 20,000. Generally, larger samples will yield better estimates of population parameters than will smaller samples. Yet, increasing the size of the sample beyond a certain point results in very little improvement and also, additional units bring additional expense. Thus you must balance the need for accuracy against the need to control costs. Further, with very large samples, the quality of the data may
  • 28. even decrease. Note that the relationship between sample size and accuracy applies only to probability samples. Sample size for nonprobability sampling must be governed by other considerations, such as the opportunity to get an in-depth understanding of a problem and possible solutions. As you think about sample size keep in mind the study’s purpose. Unless its purpose is well defined and the validity of your measurements has been established, conducting a preliminary study with a small sample may be more efficient than spending resources on a larger sample. Sampling errors come about when we draw a sample rather than study the entire population. If a probability sampling method has been used, the resulting error can be estimated. This is the powerful advantage of probability sampling. But other types of errors can cause a sample statistic to be inaccurate. Nonsampling errors result from flaws in the sampling design or from faulty implementation. To reduce nonsampling errors you must attend carefully to the selection of the sampling frame, the implementation of the design, and the quality of the measures. Nonsampling errors are serious, and their impact on the results of a study may be unknown. Taking a larger sample may not decrease nonsampling error; the errors may actually increase with a larger sample. For example, coding and transcription errors may increase if staff rushes to complete a large number of forms. Similarly, a large sample may result in fewer attempts to reach respondents who are not at home, thus increasing the nonresponse rate. If members of the sample who respond are consistently different from nonrespondents, the sample will be biased. Other nonsampling errors include unreliable or invalid measurements, mistakes in recording and transcribing data, and failure to follow up on nonrespondents. If a sampling frame excludes certain members of a subgroup, such as low- or high-income individuals, substantial bias may be built into the sample.
  • 29. You can collect data using mail surveys, telephone interviews, e-mail surveys, Web-based surveys, in-person interviews, or a combination of these. To decide which to use you may weigh time, cost, and your need for a probability sample. To get a probability sample you need to consider if you can find an appropriate sampling frame, contact sample members, and encourage their responses. Your well-designed probability sample may become a nonprobability sample if you have a poor sampling frame, are unable to contact members of the sample, or have a low response rate. One of the first things to ask yourself is why will a subject answer your questions. You may find that clients, donors, or staff will respond if they see the value of the study and your request isn’t burdensome. The general public may not be so inclined. Survey response rates have declined substantially in recent years, which is attributed both to less success in contacting respondents and to more people refusing to participate. Low response rates, the percentage of the sample that responds, increase costs and call into question accuracy of results. Without careful planning, you risk a low response rate and wasting your time. The next question you may ask yourself about contacting your subjects is “what will I say?” While the idea of contacting strangers to ask survey or interview questions may seem daunting, it’s actually pretty easy once you have a planned study. Much of what you will say when contacting subjects can be taken from your informed consent form discussed in . When you contact subjects, the most important thing to do is to tell them who you are, what you want from them, how long it will take, and what they will get out of it. Most people are very busy but many will take the time to help you if the time seems reasonable and the study worthwhile. In this chapter we focus on traditional data collection strategies and what you will want to consider in choosing a strategy. We do not cover the specific details of how to conduct a survey: there are excellent “how to” guides that you should consult if
  • 30. you plan a specific study. We spend little time on the opportunities for and challenges to data collection, such as identifying and contacting potential respondents, offered by new communications technologies. Research on new technologies is in its infancy, too soon for us to know how they can affect your work. For instance, when considering a telephone survey you may want to investigate including cell phones in your sample. In addition, with the availability of inexpensive online programs to design and administer online surveys and the diffusion of Internet access, you may consider conducting an online survey, which opens up opportunities to include graphics and even video clips. As the costs and logistics of in-person interviewing climb, you may explore opportunities to conduct interviews and focus groups via video conferencing. To take advantage of new technologies and their impact may require you to keep abreast of research being conducted by survey organizations, such as the Institute for Social Research at the University of Michigan. Mail surveys come to mind when we think about self- administered surveys. As is true of all surveying techniques, mail surveys have their unique advantages and disadvantages. All surveys require time, as you write questions, design a questionnaire, and find a sampling frame with contact … PPOL 505 Practice Exercise Instructions You will have 5 practice exercise assignments throughout this course. Each assignment will consist of questions related to the course material for that module/week. Practice exercises are your opportunity to practically apply the course materials. You will use SPSS or the course texts to answer various application questions related to course content. Assignments will vary in length. See the specific instructions for each assignment.
  • 31. All questions and sections of questions must be clearly labeled in your Word document. Copy any required SPSS tables into your Word document and label them appropriately. Show your work when required. Answers that are not clearly labeled will not receive credit. For each set of practice exercises, submit your assignment by 11:59 p.m. (ET) on Sunday of the assigned module/week. Practice Exercises Grading Rubric Criteria Levels of Achievement Content (70%) Advanced 92-100% Proficient 84-91% Developing 1-83% Not present Total Organization 45 to 49 points · Student provides a complete, detailed, and accurate response to each of the questions or concepts assigned. · Student provides accurate SPSS output when required. 41 to 44.5 points · Student responds accurately to each of the questions or concepts assigned. · Student provides accurate SPSS output when required. 1 to 40.5 points · The student has accuracy issues with their answers to some of the questions or concepts assigned.
  • 32. · Student either provides inaccurate SPSS output or does not provide output where required. 0 points Not present Structure (30%) Advanced 92-100% Proficient 84-91% Developing 1-83% Not present Total Grammar, Spelling, & Turabian 19.5 to 21 points Minimal to no errors in grammar, spelling, or Turabian. 17.5 to 19 points Some errors in grammar, spelling, or Turabian. 1 to 17 points Numerous errors in grammar, spelling, or Turabian. 0 points Not present Professor Comments: Total: /100