]> Special Properties of Normal Samples
  1. Virtual Laboratories
  2. 6. Random Samples
  3. 1
  4. 2
  5. 3
  6. 4
  7. 5
  8. 6
  9. 7

5. Special Properties of Normal Samples

Suppose that X X 1 X 2 X n is a random sample from the normal distribution with mean μ and standard deviation σ . In this section, we will establish several special properties of the sample mean, the sample variance, and some other important statistics. Since the sample (and in particular the sample size n ) is fixed, it will be suppressed in the notation.

The Sample Mean

First recall that the sample mean is

M 1 n i 1 n X i

The distribution of M follows easily from basic properties of independent normal variables:

Show that M is normally distributed with mean μ and variance σ 2 n .

The standard score of M is given as follows. This statistic will appear in several of the derivations in this section.

Z M μ σ n

Show that Z has the standard normal distribution.

The Sample Variance

The Distribution of the Special Sample Variance

Recall that if μ is known, a natural estimator of the variance σ 2 is the statistic

W 2 1 n i 1 n X i μ 2

Although the assumption that μ is known is usually artificial, W 2 is very easy to analyze and it will be used in some of the derivations below.

Show that n σ 2 W 2 has the chi-square distribution with n degrees of freedom.

Use the result of the previous exercise to show that

  1. W 2 σ 2
  2. W 2 2 σ 4 n

Of course, these are special cases of the general results obtained in the section on Sample Variance.

Independence of the Sample Mean and Variance

Recall that the standard version of the sample variance is the statistic

S 2 1 n 1 i 1 n X i M 2

The sample variance S 2 is the usual estimator of σ 2 when μ is unknown. The next series of exercises show that the sample mean M and the sample variance S 2 are independent, a very important and useful property. First we will note a simple but interesting fact that holds for a random sample from any distribution, not just the normal.

Use basic properties of covariance to show that M and X i M are uncorrelated for any i 1 2 n .

Our analysis hinges on the sample mean M and the vector of deviations from the sample mean. Let

Y X 1 M X 2 M X n 1 M

Show that S 2 can be written as a function of Y by noting that

i 1 n X i M 0

Show that the M and the vector Y have a joint multivariate normal distribution.

Use the results of the previous exercises to show that M and Y are independent.

Finally, show that M and S 2 are independent.

The Distribution of the Sample Variance

We can now determine the distribution of a simple multiple of the sample variance S 2 . Let

V n 1 σ 2 S 2

Show that n σ 2 W 2 V Z 2 where, as usual, Z is the standard score associated with the sample mean M and where W 2 is the special version of the sample variance. Hint: In the sum on the left, add and subtract M , and expand.

Show that V has the chi-square distribution with n 1 degrees of freedom. Hint: Use the result of the previous exercise, independence, and moment generating functions.

Use the result of the previous exercise to show that

  1. S 2 σ 2
  2. S 2 2 σ 4 n 1

Of course, these are special cases of the general results obtained in the section on Sample Variance.

In the basic random variable experiment, select the chi-square distribution. Vary the degree of freedom parameter and note the shape and location of the probability density function and the mean, standard deviation bar. For selected values of the parameter, run the experiment 1000 times, updating every 10 runs, and note the apparent convergence of the empirical density and moments to the true distribution density and moments.

The T Statistic

The next sequence of exercises will derive the distribution of

T M μ S n

Note that T is similar to the standard score Z , but with the sample standard deviation S replacing the distribution standard deviation σ . The statistic T plays a critical role in constructing interval estimates for μ and performing hypothesis tests for μ .

As usual, let Z denote the standard score associated with the sample mean M and V the chi-square statistics associated with the sample variance S 2 . Show that

T Z V n 1

Use previous results to show that T has the student t distribution with n 1 degrees of freedom.

In the basic random variable experiment, select the t distribution. Vary the degree of freedom parameter and note the shape and location of the probability density function and the mean, standard deviation bar. For selected values of the parameters, run the experiment 1000 times, updating every 10 runs, and note the apparent convergence of the empirical density and moments to the true distribution density and moments.

The F Statistic

The last statistic that we will study arises in the two-sample normal model. Thus, suppose that X X 1 X 2 X m is a random sample of size m from the normal distribution with mean μ and standard deviation σ and that Y Y 1 Y 2 Y n is a random sample of size n from the normal distribution with mean ν and standard deviation τ . Finally, suppose that X and Y are independent. In the following exercises, we will use the basic notation established above, but we will indicate the dependence on the sample.

Show that the random variable given below has the F distribution with m degrees of freedom in the numerator and n degrees of freedom in the denominator:

F W 2 X σ 2 W 2 Y τ 2

Show that the random variable given below has the F distribution with m 1 degrees of freedom in the numerator and n 1 degrees of freedom in the denominator:

F S 2 X σ 2 S 2 Y τ 2

These variables are useful for estimates and tests of the ratio of the variances σ 2 τ 2 . The choice of the variables depends on whether the means μ and τ are known or unknown. Usually, of course, the means are unknown and so the statistic in Exercise 18 is used.

In the basic random variable experiment, select the F distribution. Vary the degrees of freedom parameters and note the shape and location of the probability density function and the mean, standard deviation bar. For selected values of the parameters, run the experiment 1000 times, updating every 10 runs, and note the apparent convergence of the empirical density and moments to the true distribution density and moments.