Example Consider the following AOV of yield data from a randomized block experiment to measure yields of 8 lucerne varieties from 3 replicates.
The statistical model is :-
| Source | df | SS | MS | F | E(MS) |
| replicate | 2 | 41,091 | 20,545 | 2.4 |
|
| variety | 7 | 75,437 | 10,777 | 1.26 |
|
| residuals | 14 | 120,218 | 8,587 |
The variety effects are:-
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
| 0 | -69 | -46 | 90 | 81 | 71 | 60 | 22 |
Is there sufficient evidence to say that the observed differences are due to systematic effects? If there is not sufficient evidence to say this, we are obliged to take the position that the observed differences could have arisen from random sampling from a population for which there were no variety effects.
If the null hypothesis that
is true,
The ratio of these 2 mean squares is a random variable whose distribution is
named the
distribution. We assess the null hypothesis using the
statistic which in this case is 1.26 and the probability of getting this or more extreme by sampling from a population with
is 0.34. Thus we have no strong evidence to support the alternate hypothesis that not all
.
Given the effort of conducting an experiment, this is a disappointing result. Specific contrasts amongst the varieties should be tested but if there were no contrasts specified at the design stage, this ``data-snooping'' may also be misleading. A post mortem poses the following questions.
The quantile of
demarking the 5% critical region is
.
To be 95% sure that the observed differences were not due to chance, the observed
has to exceed
. The probability of rejection of the
null hypothesis when the differences are actually not zero is called
POWER and is measured by the area under the non-central
density
to the right of
.
The
curves for this example are shown in Figure 6.1 where
the vertical lines is
. The calculated value of power is 0.1 which is very poor. A better design which will reduce experiment error is required.
For post mortems of AOV's, the non-centrality parameter is estimated by
We revisit the topic of POWER in Statistical Inference.