Statistical models in R

Christophe Lalanne
October 22, 2013

Synopsis

The analysis of variance is not a mathematical theorem, but rather a convenient method of arranging the arithmetic. Ronald Fisher (1890-1962)

design of experiments • split-apply-combine • one-way and two-way ANOVA • interaction

Lectures: OpenIntro Statistics, 4.2, 5.2, 5.5.

Design of experiments

Maximize precision while minimizing number of trials.

Implementation of an organized set of experimental units to characterize the effect of certain treatments or combination of treatments, on one or more response variables.

Taking into account one or more nuisance factors for the establishment of experimental design: organize sources of unwanted variation so that we can say that they affect treatment equivalently, making the comparison between treatments possible.

Some examples

  • Parallel (independent) groups
  • Completely randomized design
  • Incomplete block design (e.g., Latin square)
  • Split-plot design
  • Repeated measures, including cross-over trials

Randomization (random allocation of units to treatments–experimental vs. quasi-experimental design), factorial arrangement of treatments, and blocking (grouping of similar units based on known but irrelevant characteristics) are keys components of experimental design (Montgomery, 2012).

Test of null hypothesis

Null hypotheses associated to the full factorial design are given below:

  • \( H_0^A:\, \alpha_1=\alpha_2=\dots=\alpha_a \) (a-1 dof), No effect of A
  • \( H_0^B:\, \beta_1=\beta_2=\dots=\beta_b \) (b-1 dof), No effect of B
  • \( H_0^{AB}:\, \gamma_{11}=\gamma_{13}=\dots=\gamma_{ab} \) ((a-1)(b-1) dof), No interaction between A and B

The ratio of Mean Squares corresponding to each factor and that of the residuals can be tested with Fisher-Snedecor F-tests.

Interaction between factors

anova

References

Chambers J and Hastie T (1992). Statistical Models in S. Wadsworth & Brooks. ISBN: 0534167649.

Montgomery D (2012). Design and Analysis of Experiments, 8th edition. John Wiley & Sons.

Sokal R and Rohlf F (1995). Biometry, 3rd edition. WH Freeman and Company.

Wickham H (2011). “The Split-Apply-Combine Strategy for Data Analysis.” Journal of Statistical Software, 40(1).

Wilkinson G and Rogers C (1973). “Symbolic description of factorial models for analysis of variance.” Applied Statistics, 22, pp. 392-399.