In the next chapter we will be considering properties of estimators. One of these properties involves the variance of an estimator and our desire to choose an estimator with variance as small as possible. Some concepts and results that will be used there are introduced in this section. In particular, we will consider the notion of information in a sample, and how we measure this information when data from several experiments is combined.
Consider a distribution indexed by a real parameter
and suppose
and
are independent sets of data,
then the likelihood of the combined sample is the product of the likelihoods
of the two individual samples. That is,
Definition
7..8
The score of a sample, denoted by
is defined by
Some properties of V are given below.
Rigorous proofs of these results depend on fulfillment of conditions (sometimes
referred to as regularity conditions) that permit interchange of integration
and differentiation operations, and on the existence and integrability of the
various partial derivatives. The proofs are not required
in this course but an outline of the proof of (7.7) is given on page
.
Properties of V
| (7.8) |
| (7.9) |
| (7.10) |
![]() |
(7.11) |
| (7.12) |
Comment on (i) and (vi).
A typical example where the ``regularity conditions'' don't hold is the
case where
is distributed U(0,
. When the range space of
depends on
, the order of integration (over X) and differentiation (with
respect to
) can't usually be interchanged, as is done in proving
(i) and (vi). In particular, for
a sample of size
from
, we have
, and
Example
7..10
For
a random sample from a
N(
,
) distribution, find
(a) the information for
;
(b) the information for
.
Example
7..11
Compute the information on
from
Bernoulli
trials with probability of success equal to
. Now
Outline of Proof of 7.7
Comments