Experimental research allows researchers to test cause-and-effect relationships by manipulating independent variables and observing their effects on dependent variables. Key aspects of experimental research include identifying independent and dependent variables, using control groups, random assignment, and controlling for extraneous variables. Researchers must also consider issues of internal and external validity when designing and interpreting experiments. Common experimental designs include pre-experimental, true experimental, quasi-experimental, single-subject, and factorial designs. Proper data analysis and replication of results are important for validating experimental findings.
2. Experimental Research vs. Other Methods
2
ā
ā
Cantest for cause/effect relationships
Manipulation of independent variable(s)
Simply put:
Decisions about the forms and values of the IV, as
well as about which group receives which treatment
are at the solediscretion of the researcher
3. Variables in Experimental Research
3
ā IndependentVariable:
ā
ā
Experimental Variable, Cause, or Treatment
The activity or characteristic the researcher believes
makes a diļ¬erence
ā DependentVariable:
ā
ā
ā
ā
Criterion Variable, Eļ¬ect, or Posttest
Outcome of the study
Diļ¬erence in group(s) that occurs as a result of the
manipulation of the IV
Only constraint: must represent a measurable
outcome
4. Characteristics of Experimental Research
4
ā Demanding & Productive, but...
ā Produce the soundest evidence of hypothesized cause-eļ¬ect
relationships
ā Difference between Correlational & Experimental
Research:
ā
ā Correlational can be used to predict a specific score for a
specific individual
Experimental predicts more global results*
5. Steps in Experimental Research Study
5
1. Select and define problem.
2.Select subjects and [measurement] instruments.
3.Select design.
4.Execute procedures.
5.Analyze data.
6.Formulate conclusions.
6. Role of the Researcher
6
ā
ā
ā
ā
Forms or selects groups
Decides what will happen to each group
Attempts to control all variables and factors
Observes and measures eļ¬ect on the groups
Every eļ¬ort is made to make sure the 2groups have
equivalent variablesāexcept for the independent variable.
7. Two Groups
7
ā
ā
Experimental Group
ā Receives the new treatment being investigated
Control Group
ā
ā
Receives a diļ¬erent treatment; or
Receives same treatment as usual (i.e. is left alone)
The Control Group is needed in order to identify/measure any
diļ¬erences observed as a result of the diļ¬ering treatments
8. Potential Issues in Experimental Research
8
ā Experimental treatment not given adequate time to
take effect
ā Experimental group should be exposed to treatment for a long
enough period of time for the treatment to work
ā Treatments received by the 2groups are not
ādifferent enoughā
ā No diļ¬erence between the groups will be found if the
experimental treatment and the control treatment are too
similar
9. Experimental Validity
9
ā Experiments are considered valid if:
ā The results obtained are only due to the manipulation
of the independent variable
ā Two conditions must be met:
ā Experiment has internal validity
ā Experiment has external validity
10. Internal Validity
10
ā Observed diļ¬erences on the dependent variable are the direct result of the
researcherās manipulation of the independent variable.
ā Campbell & Stanley (1971) identified 8threats to internal validity:
ā
ā
ā
ā
ā
ā
ā
ā
History - becomes more likely the longer a study is; caused by external events.
Maturation - physical/mental changes occurring in subjects over time; more likely to occur when study is extended over a
long period of time.
Testing (pretest sensitization) - result of higher scores on a posttest due to participants having taken a pretest; unlike
above, more likely to occur when there are short intervals between testing.
Instrumentation - lack of consistency between measuring instruments; data collection leads to unreliable/invalid results.
Statistical Regression - tendency for some scores to move towards the mean score; participants who score the highest and
lowest on a pretest are more likely to score lower and higher (respectively) on a posttest.
Diļ¬erential Selection of Subjects - diļ¬erences already present between two pre-formed groups could account for
diļ¬erences in posttest results.
Mortality (attrition) - occurs most often in long-term studies; refers to participants who drop out of a group potentially
sharing some characteristic that aļ¬ects the significance of the study.*
Selection-Maturation Interaction, Etc. - if pre-formed groups are used, one group may be at an (dis)advantage due to
factors of maturation; the āetc.ā refers to the fact that selection can also interact in this way with other factors such as
history, testing, instrumentation, etc.
11. External Validity
11
ā Results of the experiment are generalizable to groups and environments outside of the experiment;
results of the study can be reconfirmed with other groups, in other settings, and at other times (if
the conditions are similar to those present in the experiment).
ā Bracht & Glass (1968)identified 6 threats to external validity:
ā Pretest-Treatment Interaction - participants react diļ¬erently to a treatment because they have been pretested; pretests may alert
participants to the make-up of the treatment; therefore, results can only be generalized to other pretested groups.
ā Multiple-Treatment Interference - the same participants receive the same treatment in succession; eļ¬ects are carried-over from the first
treatment making it hard to determine the eļ¬ectiveness of the second treatment.
ā Selection-Treatment Interaction - occurs when participants are not randomly selected for the treatments they receive; can occur when
participants are a pre-formed group or an individual; limits the generalizability of the results.
ā Specificity of Variables - does not depend on the experimental design chosen; threatens validity when a study is conducted:
ā
ā
ā
ā
ā
with a specific kind of subject;
based on a particular definition of the independent variable;
using specific measuring instruments;
at a specific time; and
under a specific set of circumstances.
ā Experimenter Eļ¬ects - experimenter unintentionally aļ¬ects the implementation of the studyās procedures, the behavior of the participants,
or the assessment of participant behavior, thereby aļ¬ecting the results of the study.
ā Reactive Arrangements - factors associated with how a study is conducted eļ¬ectively influence the feelings and attitudes of the
participants; aļ¬ects generalizability of the results.
12. Extraneous Variables
12
ā The control of extraneous variables is vital to the success of
an experiment.
ā Extraneous variables can be controlled through:
ā
ā
ā
ā
ā Randomization - subjects should be randomly selected for participation and randomly assigned to groups; randomizing
selection should be attempted whenever possible
Matching - researcher pairs up participants with matching (similar) scores or characteristics (gender, IQ, location), then
randomly assigns each participant to a diļ¬erent group than their counterpart; this ensures that the pair with matching
IQ scores are not in the same group
Comparing homogenous groups or subgroups - group participants according to their similarity/fit into a variable
subgroup (IQ, SAT score); randomly assign half of the subgroup to the experimental group, and the other half of the
subgroup to the control group
Using subjects as their own controls - the same participants get both treatments (one treatment at a time); controls for
participant diļ¬erences; can result (negatively) in carry-over eļ¬ects between the treatments
Analysis of covariance - statistically equate randomly formed groups on a particular variable; can be used to adjust for
large diļ¬erences in pretest scores between groups
13. Group Designs
13
ā Two classes of experimental designs:
ā Single-Variable: one independent variable; IV is manipulated
ā Three typesā
ā
ā
ā
Pre-experimental
True experimental*
Quasi-experimental
ā Factorial: two or more independent variables; at least one IV
is manipulated
ā
ā
ā Elaborate on single-variable designs;
Investigates each variable independently and in interaction
with other variables;
Skyās the limit**
14. Pre-Experimental Designs
14
ā One-Shot Case Study ā
ā One group exposed to one treatment then given posttest
ā
ā
Donāt know level of group knowledge before the treatment!
Sources of invalidity are not controlled!
ā One-Group Pretest-Posttest Design ā
ā One group pretested, exposed to one treatment, then posttested
ā
ā
Still a number of factors aļ¬ecting validity that are not controlled!
Other factors may influence any diļ¬erences observed between the pretest and posttest
ā Static-Group Comparison ā
ā At least two groups; first receives new treatment; second receives usual
treatment; both posttested
ā
ā
Purpose of control group is to show how the experimental (first) group would have performed had
they not received the new treatment
Eļ¬ective only to the degree that the two groups are equal to each other
15. True Experimental Designs
15
ā Pretest-Posttest Control Group Design ā
ā At least two randomly-assigned groups; both pretested for dependent variable;
one group then receives the new treatment; then both groups are posttested.
ā
ā
Internal invalidity fully controlled by: random assignment, pretesting, & inclusion of a control group
Potential risk of interaction between the pretest and the treatment*
ā Posttest-Only Control Group Design ā
ā Same as pretest-posttest design, except there is no pretest
ā
ā
Subjects randomly assigned; exposed to independent variable; then posttested
Mortality is not controlled for (no pretest), but may not be a problem anyway
ā Solomon Four-Group Design ā
ā Random assignment of participants to one of four groups
ā
ā
ā
ā
Two groups are pretested; two groups are not pretested
One pretested group & one unpretested group receive the experimental treatment
All four groups are posttested
Combination of the two designs (above) - eliminates both sources of internal invalidity!
16. Quasi-Experimental Designs
16
ā Nonequivalent Control Group Design ā
ā Two or more existing groups pretested; administered treatment; and posttested.
ā
ā
Participantsā assignment to groups is not random; assignment of treatments to groups is random
Invalidity sources include: regression, selection-treatment interactions (maturation, history, and testing)
ā Time-Series Design ā
ā One group repeatedly pretested; administered treatment; repeatedly posttested.
ā
ā
Elaboration of the one-group pretest-posttest design; involves testing (pre- and post-) more than once
Advantage lies in confidence gained through significant improvement of group scores between pretests and posttests
ā Counterbalanced Designs ā
ā All groups received all treatments; each group receives treatment in a diļ¬erent
order than others.
ā
ā
ā
ā
Any number of groups can be involved; limited only by the number of treatments; # of groups =# of treatments
Order of each groupsā receipt of treatment is determined randomly; each group is posttested following each treatment
Pretest usually not possible and/or feasible; often used on existing groups
Weakness lies in potential for multiple-treatment interference; thus, should only be used when this is not a concern
17. Factorial Designs
17
ā
ā
Two or more independent variables; at least one is
manipulated by researcher
Term āfactorialā comes from the use of multiple variables
with multiple levels
ā
ā
2x 2factorial design*
Can get very complicated (2 x 3, 3x 2, etc.)!
ā
ā
Often employed after using a single-variable design;
ā āVariables do not operate in isolationā
Studies how variables behave at diļ¬erent levels**
18. Single-Subject Experimental Designs
18
ā
ā
ā
Also referred to as āsingle-case experimental designsā
Used when sample size =1; or for multiple individuals
considered as 1 group
ā Variation of the time-series design
Typically used asastudy of behavioral change in an
individual
ā Participant is own control; exposed to both nontreatment &
treatment phases;
ā Individualās performance measured repeatedly during all phases
ā Nontreatment phase =A; Treatment phase =B
19. Validity in Single-Subject Experiments
19
ā ExternalValidity
ā
ā
Frequent criticism due to lack of generalizability
Can be counteracted through replication
ā InternalValidity
ā Repeated and Reliable Measurement
ā If results are to be trusted, treatment must follow exact same procedures every time
ā Baseline Stability
ā Provides basis for assessing the eļ¬ectiveness of the treatment; must do enough
baseline measurements to establish a pattern*
ā The Single Variable Rule
ā Only one variable should be manipulated at any one time!
20. Types of Single-Subject Designs
20
ā A-B-AWithdrawal Designs --
ā The A-B* Design
ā
ā
Establishment of baseline stability; treatment given
Improvement during treatment =eļ¬ectiveness of treatment
ā The A-B-A Design
ā
ā
Adds a second baseline measurement to the A-B design
Improves validity IF behavior improves during the B phase, and subsequently
deteriorates during the second A phase
ā The A-B-A-B Design
ā
ā
ā
Adds a second treatment phase to the A-B-A design
Could add strength to experiment IF behavior improves during treatment twice!
Eliminates ethical concerns from A-B-A design (ending with participant not
receiving potentially eļ¬ective treatment)
21. Types of Single-Subject Designs (contād)
21
ā Multiple-Baseline Designs
ā
ā
ā
Alternative to the A-B design
Used when unable to withdraw the treatment, or when it would be unethical to do so
Three basic types: across behaviors, across subjects, and across settings*
ā AlternatingTreatments Design
ā
ā
ā
Only valid design for assessing eļ¬ectiveness of 2+treatments in a single-subject
context
Rapid alternation of treatments for a single subject
Treatments are alternated randomly
ā
ā
ā
Notice: no withdrawal phase, no baseline phase.
Allows for the study of multiple treatments quickly and eļ¬ciently
Could introduce multiple-treatment interference
22. Data Analysis/Interpretation
22
ā Typically involves graphically-represented results
ā Design must be evaluated for adequacy; then
treatment eļ¬ectiveness is assessed
ā Clinical Significance vs. Statistical Significance
ā t and F tests can be used to test for statistical
significance
23. Replicating Results
23
ā As results are replicated, confidence in the procedures used grows
ā Direct replication
ā
ā
Replication by the same investigator in the same setting
[Note] the same or diļ¬erent participants may be used
ā
ā
Simultaneous replication
ā Same problem; same location; and same time
Systematic replication
ā Direct replication with diļ¬erent investigators, behaviors, or settings
ā Clinical replication
ā
ā
Treatment package with 2+treatments.*
Designed for participants with complex behavior disorders
24. Example of Experimental Research
24
ā Brain-Computer Interface Project
ā University of Illinois at Urbana-Champaign
ā
ā
ā
Collected brain signals through EEG
Used one group of 9 individuals
Allowed āpracticeā session before testing, but no
pretest was conducted
25. Infamous Cases of Unethical Research
ā Tuskegee Syphilis Study (1932-1972)
ā Nearly 400 African-American men were infected with syphilis
ā Study conducted by Public Health Service
ā Led to the 1979 Belmont Report (modern foundation for ethical research of
human subjects)
ā Milgram Obedience toAuthority Study (began 1961;
made public 1963)
ā Residents of New Haven, CT recruited to participate in astudy of āmemory and
learningā
ā Participants asked to inflict electric shocks in increasing voltages based on
ālearnerāsā incorrect answers (maximum voltage of 450 volts)
ā Study conducted at Yale University; intended to determine whether ordinary
people would follow orders they considered immoral (i.e. Nazi Holocaust/Adolf
Eichmann)
ā Stanford Prison Experiment (1971)
ā
ā
ā
24 students chosen as āprisoners,ā while 9 āguardsā were assigned to 3shifts
Shut down after 6 days (originally intended to take 2weeks) due to a
deterioration of the experimentās conditions and structure
Both prisoners and guards adapted to their given roles--guards becoming
authoritarian and prisoners becoming passive
25
26. References
26
Gay, L. R. (1996). Educational research : competencies for analysis and application /L.R. Gay (5th
ed.): Englewood Cliļ¬s, N.J. : Merrill, 1996.
Milgram experiment. (2011, February 7). In Wikipedia, The Free Encyclopedia. Retrieved from http://
en.wikipedia.org/w/index.php?title=Milgram_experiment&oldid=412574744.
Stanford prison experiment. (2011, February 11). In Wikipedia, The Free Encyclopedia. Retrieved
from http://en.wikipedia.org/w/index.php?title=Stanford_prison_experiment&oldid=413232983.
Omar, C., Akce, A., Johnson, M., Bretl, T., Rui, M., Maclin, E. (2011).A Feedback Information-
Theoretic Approach to the Design of Brain-Computer Interfaces. [Article]. International Journal of
Human-Computer Interaction, 27(1), 5-23. doi: 10.1080/10447318.2011.535749.
Tuskegee syphilis experiment. (2011, February 3). In Wikipedia, The Free Encyclopedia. Retrieved
from http://en.wikipedia.org/w/index.php?title=Tuskegee_syphilis_experiment&oldid=411791432.