1. Pendael Zephania Machafuko
Department of Biometry and Mathematics
Sokoine University ofAgriculture
Mobile phone: +255655397495
:+255688397495
Email address: p_zephania@yahoo.com
βnot ability to reproduce but ability to produceβ
Design and Analysis of Experiments
(MTH201 Lecture Notes)
2. Course objective
01/11/2013Design andAnalysis of Experiments2
ο Student be able to design an experiment in context of his/her
specialization using statistical concepts
ο Student should be able to differentiate different types of
experimental designs
ο Student be able to appropriately allocate treatments to
experimental units and identify possible confounders
ο Student be able to perform analysis of variance to determine the
treatment effects and examine internal and external validity of an
experiment
3. Mode of teaching and assessment
01/11/2013Design andAnalysis of Experiments3
ο Lectures, seminars and presentations
ο Final examination will contribute 60% of the end of semester
marks
ο Seminar reports and presentations will contribute 20% of the
end of semester marks
ο Tests will contribute 20% of the end of semester marks
4. Scientific studies
01/11/2013Design andAnalysis of Experiments4
ο Simple and effective statistical analysis
ο Understanding of subject matter
ο Provide precise parameter estimates
ο Improved statistical power
5. Overview of Experimental Design
Experimental study Observational study
01/11/2013Design andAnalysis of Experiments6
οΆ Cause-effect relationship between
response and explanatory variables
οΆ Are comparative in nature
οΆ Explanatory factor levels referred
to treatment
οΆ Unit of analysis referred to as
experimental unit
οΆ Randomization βassigning
treatment levels to experimental
units at random
οΆ Predictor variables can be can be
controlled
οΆAssociation between explanatory
and response variables
οΆNot comparative
οΆNo randomization
οΆPredictor variables cannot be
controlled by investigator
6. Application of Experimental Design
01/11/2013Design andAnalysis of Experiments7
ο Improve performance of a process or system
ο Reduced variability and closer conformance to nominal or target
requirements
ο Reduced development time
ο Reduced overall cost
7. Treatment
01/11/2013Design andAnalysis of Experiments8
ο Complete description of what will be applied to the experimental
unit
ο Treatments are applications that can stimulate response e.g. wheat
varieties, diets, fertilizers, nutrients
ο Treatment to be considered in an experiment constitute
combination of the levels of factors e.g. fertilizers (nitrogen,
phosphate, potassium), and soil type (loam, clay, sand)
8. Factor
01/11/2013Design andAnalysis of Experiments9
ο Explanatory variable (s) manipulated by the experimenter
ο Levels of a factor-the values of a specific factor e.g. cattle breed
with levels Boran, Nndama, Freshian
9. Examples of experimental units
01/11/2013Design andAnalysis of Experiments10
ο Plots in agricultural experiments
ο Pots in greenhouse experiments
ο Pens or individual animals in animal experiments
ο Farms or farmers in non-farm survey/trials
ο Patients in medical trials
ο Farms in disease survey/trials
12. Response variable
01/11/2013Design andAnalysis of Experiments13
ο Measured as the outcome of interest in the experiment. E.g.
weight gained by calves after diet use
ο In many agriculture experiments the yield of experimental units
to treatments is mostly a measurement of interest e.g. yield of
wheat, milk yield.
13. Response variable(1)
01/11/2013Design andAnalysis of Experiments14
ο Differences in the response variable from different experimental
units subjected to the same treatment may be due to number of
small uncontrollable differences versus slight differences in
Environment- temperature, soil conditions (fertility, acidity,
human), pests, diseases
Raw materials-slight differences in seed condition
Management regimes
14. Experimental error
01/11/2013Design andAnalysis of Experiments15
ο All variations that can be attributed to the effects of all non-
treatment factors and other unidentified disturbance factor(s)
15. Contribution of statistics to
experimentation
01/11/2013Design andAnalysis of Experiments16
ο Planning the experiment so that appropriate data can be
generated
ο Knowing the mechanism generated data help to identify
appropriate statistical methods
ο Attain valid and objective conclusions
16. Principles of Experimental Design
01/11/2013Design andAnalysis of Experiments17
ο±Replication
ο±Randomization
ο±Blocking
17. replication
01/11/2013Design andAnalysis of Experiments18
ο Number of times each treatment is repeated
ο Instead of having a single large plot of each treatment, there are
several smaller ones known as replicates
ο The difference in responses for the same treatment is due to
experimental error
ο Experimental error must be small for a well designed study
18. Why replicates?
01/11/2013Design andAnalysis of Experiments19
ο Replication is desirable because it
Enlarges scope of investigation
Enhances precision and overall efficiency
Minimizes experimental error because it reduces plot size to a
precision-enhancing form
Permits determination of experimental error
19. Properties of replication
01/11/2013Design andAnalysis of Experiments20
basic unit of measurement for determining whether the
observed differences in the data are really statistically
different
Permits precise estimation of treatment effect if sample mean
is used to estimate the effect of a factor, e.g., if π2
is the
variance of an individual observation and there are n
replicates, the variance of the sample mean π π¦
2
=
π2
π
20. randomization
01/11/2013Design andAnalysis of Experiments21
ο Act of assigning treatments to the experimental units purely on
the basis of chance i.e. every treatment has equal chance of being
allocated to any given plot
ο Statistical methods require that the observations be
independently random variables
ο Averaging out the effects of extraneous factors present i.e.,
systematic effects are not under the control of the investigator
ο Statistical estimation and tests of hypothesis on effects are
theoretically valid
21. Why randomize?
01/11/2013Design andAnalysis of Experiments22
ο Overcome systematic effects
ο Avoid selection bias
ο Minimize accidental bias
ο Stop experimental cheating (for good or bad)
ο Ensure no particular patterns in treatment allocation
22. How to randomize
01/11/2013Design andAnalysis of Experiments23
ο Table of random numbers
ο Computer package
ο Randomization schemes, such as simple and permuted blocks
23. blocking
01/11/2013Design andAnalysis of Experiments24
ο Heterogeneous experimental units are divided into
homogeneous subgroups called blocks to facilitate isolation of
block variation that could distort treatment effects
ο Heterogeneity may be due to soil fertility, land gradient, animal
weights, age, etc.
ο Used to improve the precision when comparisons among the
factors of interest are made.
ο Reduce or eliminate the variability transmitted from nuisance
factors i.e., factors that influence experimental response
24. Blocking variables (1)
01/11/2013Design andAnalysis of Experiments25
ο In agricultural experiments;
Soil type or fertility level
Extent and nature of previous cropping
Degree of pest infestation
Direction of wind in wind-control pest disease trial
Moisture level
26. Why blocking?
01/11/2013Design andAnalysis of Experiments27
ο Blocking is an error-control strategy that when used effectively
reduces error variances
increases precision
Reliability of estimates of effects
27. Advantages of blocking
01/11/2013Design andAnalysis of Experiments28
ο Guarantee that the same number of two different
homogeneous groups will receive each treatment
ο Increases the range of validity for the conclusions from the
experiment i.e., provide sufficient variability between groups
of experimental units in different groups for a wider range of
generalizability
ο High precision because of small experimental errors within
blocks
28. Experimental validity
01/11/2013Design andAnalysis of Experiments29
ο Assessment of the quality of an experimental design requires
knowledge of the factors that influence or cause variation in the
measured outcomes
ο Two concepts to consider
Internal validity
conclusion can be made only about the relationship between
dependent and independent variables
External validity
Conclusion from the experiment can be appropriately generalized
to a wider situation of interest
29. assignment
01/11/2013Design andAnalysis of Experiments30
ο With respect to your profession design an experiment based
on the following;
οΆ experimental units
οΆ treatments
οΆ response variable
οΆ use three principles of experimental design
οΆ is that experiment valid external?
οΆ state the assumptions of your experiment
οΆ suggest the appropriate statistical methodology
30. Types of experimental design
01/11/2013Design andAnalysis of Experiments31
ο Some basic designs commonly used in field experiments;
Single level experimental units designs
Completely randomized designs
Randomized complete block designs
Latin squares designs
Multiple level experimental units designs
Split-plot Designs
On-farm experiments
Inter-cropping
Repeated measures experiments
31. Single level experimental units designs
01/11/2013Design andAnalysis of Experiments32
ο Treatments applied to the plots and measurements taken on the
plots
32. Completely Randomized Design
01/11/2013Design andAnalysis of Experiments33
ο Levels of treatment are randomly assigned to the experimental
units (no allocation restrictions)
ο Expected effects are from between and within treatment
differences only
ο Within variation due to experimental units behaving differently
under the same treatment
ο Experimental units assumed to be homogeneous or similar in their
reaction to same treatment stimulus
ο Basic CRD has one treatment with L levels and n replicates
33. CRD Example
01/11/2013Design andAnalysis of Experiments34
ο Suppose that a study involves three varieties of wheat and there
are 27 plots available
ο In equal replication, the three wheat varieties will be randomly
allocated to the plots, 9 for each. π = ππΏ (balanced design)
ο In unequal allocation then we may have 11 plots variety 1, 7 plots
variety 2 and 9 plots variety3. π = ππ
πΏ
π=1 (unbalanced
design)
34. Prospects and problems of CRD
advantages disadvantages
01/11/2013Design andAnalysis of Experiments35
ο Easy to set up and analyze
ο Provide maximum number of
degrees of freedom for
estimation of error variation
ο Missing values cause no
difficulty
ο Suitable only for
homogeneous experimental
material
ο Suitable only for small
numbers of treatments
35. CRD Model
01/11/2013Design andAnalysis of Experiments36
ο Model
-Yield=overall mean+ treatment+ exper. Error i.e., π¦ππ = π + ππ + πππ where π = 1,2, β¦ , πΏ πππ π = 1,2, β¦ , ππ
ο Assumptions
additive effects
Independent homogeneous independent error terms
Constant variance of error terms
Normal error terms
ο Analysis to obtain
Treatment effects
Experimental error variance
Test of treatment effects
44. CRD hypothesis for cell means model
01/11/2013Design andAnalysis of Experiments45
ο π» π: π1 = π2 = π3 = β― = ππ
ο Treatment means are the same
ο π» π: π1 β π2 β π3 β β― β ππ
ο Treatment means are not the same
ο Sπππππππππππ πππ£ππ = 5%
ο Test statistic is the ratio of two variances πΉπ =
ππππ
πππΈ
β πΉ(π1, π2)
ο Decision if πΉπ > πΉ(π1, π2) reject π» π at
Ξ±% π πππππππππππ πππ£ππ
ο πΉπ < πΉ(π1, π2) do not reject π» π
ο Conclusion: There is statistical evidence that treatment means
are not equal
45. CRD hypothesis for cell means model
01/11/2013Design andAnalysis of Experiments46
ο πΉπΆ = 2.199, πΉ4,10 = 3.48
ο Since πΉπΆ < πΉ4,10, we do not reject π» π that treatment
means are the same at 5% level of significance.
ο Conclusion.There is no statistical evidence that the
treatment means are different.
49. Completely Randomized Block Design
(CRBD)
01/11/2013Design andAnalysis of Experiments50
ο The RCBD is the standard design for agricultural experiments
ο Goal is to improve the experiment by reducing the amount of
variability affecting the treatments
ο Field is divided into units to account for any variation in the field
ο Treatments are assigned at random within blocks of adjacent
plots, each treatment once per block
ο Number of blocks is the number of replications
ο Very important in improving experiments as it allows some
control of uncontrolled variation
50. CRBD (1)
01/11/2013Design andAnalysis of Experiments51
ο Any treatment can be adjacent to any other treatment, but not to
the same treatment within the block
ο Used to control variation in an experiment by accounting for
spatial effects.
51. CRBD (2)
01/11/2013Design andAnalysis of Experiments52
ο βcompleteβ each block contains all the treatments
ο Variability arising from a nuisance factor can affect the results
ο Has an effect on response but not of interest
ο Unknown and uncontrolled
ο Randomization can help to eliminate
ο Known but uncontrollable-analysis of covariance
ο Known and controllable-blocking systematically eliminate its
effect
52. CRBD Example
01/11/2013Design andAnalysis of Experiments53
ο Experiment was planned for execution in three batches to
accommodate goats that kidded at different times
ο Each batch on its own can be considered as a completely
randomized design
ο Together they form a randomized block design with batch taking
the role of block
53. CRBD Model
01/11/2013Design andAnalysis of Experiments54
ο Model
Yield=mean+treatment+block+error, i.e.,
π¦ππ = π + ππ + π½π + πππ , π = 1,2, β¦ , πΏ, π½ = 1, 2, β¦ , π
ο Assumption
Additive effects
Independent error terms
Constant variance of error terms
Normal distribution of error terms
No block-treatment interactions
ο Analysis to obtain
Treatment effects
Experimental error variance
Tests of treatment and block effects
58. Prospects and problems of RBD
Advantages disadvantages
01/11/2013Design andAnalysis of Experiments59
ο Control local variability
ο Accommodate any number of
replications
ο Different experimental
techniques can be used in
different blocks
ο Simple analysis
ο Not feasible for large number
of treatments as block size is
increased thus reducing plot
homogeneity
ο Invalid results if assumed
block homogeneity is violated
59. Statistical assumptions
01/11/2013Design andAnalysis of Experiments60
ο Variance of the error term is constant, regardless of factor level
i.e.,
π2
πππ = π2
πππ = π2
ο Error terms are normally distributed, this means that,
observations and error terms are linearly related
ο Error terms are independent i.e., error term of an outcome of
any trial has no effect on the error of any other trial for the same
factor level
ο ANOVA model is πππ β π(ππ, π2
)
60. RBD example
01/11/2013Design andAnalysis of Experiments61
ο An experiment was designed to study the performance of four
different detergents for cleaning clothes.The following
βcleanlinessβ readings (higher=cleaner) were obtained using a
special device for three different types of common stains. Is there
a significant difference among the detergents?
61. Why blocking?
01/11/2013Design andAnalysis of Experiments62
ο Homogeneous experimental units
ο Experimental error as small as possible
ο Improves the accuracy of the comparisons among treatments
62. Latin Square Design
01/11/2013Design andAnalysis of Experiments63
ο Randomized block design use only one blocking variable
ο It is not appropriate where there are more than two blocking
variables need to be controlled
ο When there are two blocking variables and treatments the design
that can handle such a case is the LATIN SQUARE DESIGN
ο In Latin square design each treatment occurs once, and only
once, in each row and column
63. Building Latin Square Design
01/11/2013Design andAnalysis of Experiments64
ο For π treatments, there are π2
observations
ο Observations are placed in π rows and π columns which form
π* π grid, in such a way that each treatment occurs once, and
only once, in each row and column.
ο For 4 treatments π΄, π΅, πΆ, π· and two factors to control. Latin
square design is
68. Example -LSD
01/11/2013Design andAnalysis of Experiments69
ο Consider an experiment to investigate the effect of four different
diets on milk production of cows.There are four cows in the
study. During each lactation period the cows receive a different
diet.Assume that there is a washout period between diets so that
previous diet does not affect future results. Lactation period and
cows are used as blocking variables
69. Factorial Design
01/11/2013Design andAnalysis of Experiments70
ο Two or more factors can be studied simultaneously
ο Every combination of the factors is studied in every trial
ο Given two factors π΄ πππ π΅, π€ππ‘β πππ£πππ π πππ π, each
replicate contain all the π β π treatment combinations
ο The effect of factor π΄ is the change in response due to a change
in the level of π΄