| (3.8) |
| (3.9) |
A candidate test statistic is thus
| (3.10) |
Although z would obey a Gaussian distribution if the individual
measurements entering
do, t will not be Gaussian distributed. Instead, the distribution
in t is determined by the ratio of the Gaussian distribution to
the square root of the chisquare distribution which characterizes deviations
of the sample estimate of the standard deviation from the true standard
deviation:3.1
| (3.11) |
where
is distributed as
.
The distribution in t can be derived from this ratio (as shown, e.g., in Brownlee 1965):
| (3.12) |
where
is the number of degrees of freedom, which will be (n-1) in the
case where
is the average of n independent measurements. The variance of the
t distribution is given by
| (3.13) |
and for large numbers of degrees of freedom the t distribution approaches a Gaussian distribution with this variance.
The t statistic supports a test of the hypothesis that the mean is
without requiring that the true standard deviation
be known. Only the sample standard deviation is needed. The change from
a Gaussian distribution function is most significant for small numbers
of degrees of freedom. Figure 2.1 showed the cumulative form of this distribution
function for various degrees of freedom and compared its shape to that
of the Gaussian distribution.
| (3.14) |
with N-2 degrees of freedom. This can be used to test the probability
that a value as large as t will be obtained if the true mean is
1.
| (3.15) |
Then
| (3.16) |
is distributed according to the t distribution with (n1+n2-1)
degrees of freedom, where n1 and n2
are the respective degrees of freedom in the two experiments. A common
mistake is to assume that the degrees of freedom should be 1 for this case
because there are two measurements and one difference. Recall that the
reason for introducing the t distribution was to account for lack
of knowledge of the true standard deviation in the phenomenon being observed.
If there is a large set of measurements, this standard deviation is known
much better than if there are only a few, and this improved knowledge must
be reflected in the number of degrees of freedom in the comparison.
Next: 3.6
Confidence intervals Up: 3.
Probability Distribution Functions Previous: 3.4
Poisson distribution