The Theorem stated below is often referred to as the Fisher-Neyman criterion.
Theorem
7..1
Let
(or X) denote a random sample from a
distribution with density function f(x;
). Then the statistic
T
t(X) is a sufficient statistic for
if and only if we can
find two functions
and
such that
(An aside; heuristic explanation.)
Factorisation is a way of separating the random and non-random components. Only when
is comprehensive enough such that
Proof. For the continuous case, a proof is given in HC 7.2,
Theorem 1, where their
is our
and their
is our
.
To use the factorization criterion, we examine the joint density function,
f(x;
) and see whether there is any factorization of the type
required in terms of some function t(x). It is usually not easy to use
the factorization criterion to show that a statistic
is not
sufficient.
Note that the family of distributions may be indexed by a vector
parameter
, in which case the statistic
in the definition of
sufficiency can be a vector function of observations, for example,
(
) or (
,
).
Example
7..3
We will consider again example 1.1, using the
factorization criterion.
The joint probability density function of
is
Example
7..4
(Example 4 in HC, 7.2, p319)
Let
be a random sample from a
distribution where
is known. Show that
is
sufficient for
.
Now
Read HC 7.2, Examples 5, 6.
Example
7..5
Consider a random sample of size
from the uniform distribution,
. We will use the factorization
criterion to find a sufficient statistic for
.
The joint density function is
Comment. Note that the joint pdf is
which is just a function of
, so it would appear that any
statistic is sufficient. The fallacy in this argument is that the joint density function is
not always given by
, but is equal to zero for
. So it really is not just a function
of
. However, we can get it into the required form
by taking
. [Note that if
then all the
.]
Although the factorization criterion works here and in other cases where the range space depends on the parameter, one has to be careful, and it is often safer to find the conditional density for the sample given the statistic, rather than use the factorization criterion. This is done below.
The joint pdf of the ordered sample
is