]>
In this section, we will study hypothesis tests in the two-sample normal model and in the bivariate normal model. This section parallels the section on Estimation in the Two Sample Normal Model in the chapter on Interval Estimation.
Suppose that is a random sample of size from the normal distribution with mean and standard deviation , and that is a random sample of size from the normal distribution with mean and standard deviation . Moreover, suppose that the samples and are independent.
This type of situation arises frequently when the random variables represent a measurement of interest for the objects of the population, and the samples correspond to two different treatments. For example, we might be interested in the blood pressure of a certain population of patients. The vector records the blood pressures of a control sample, while the vector records the blood pressures of the sample receiving a new drug. Similarly, we might be interested in the yield of an acre of corn. The vector records the yields of a sample receiving one type of fertilizer, while the vector records the yields of a sample receiving a different type of fertilizer.
Usually our interest is in a comparison of the parameters (either the mean or variance) for the two sampling distributions. In this section we will construct tests for the ratio of the variances and for the difference of the means. As with previous estimation problems we have studied, the procedures vary depending on what parameters are known or unknown. Also as before, key elements in the construction of the tests are the sample means and sample variances and the special properties of these statistics when the sampling distribution is normal.
We will use the following notation for a generic sample and where is a real number:
We will first consider tests for the ratio of the variances under the assumption that the means and are known. Usually, of course, this is an unrealistic assumption, but it is a good place to start because the analysis is fairly easy. Our basic test statistic is
where is a conjectured value of the ratio of the variances.
Show that has the distribution with degrees of freedom in the numerator and degrees of freedom in the denominator.
Now for and for and , let denote the quantile of order for the distribution with degrees of freedom in the numerator and degrees of freedom in the denominator. For selected values of , , and , can be computed using the quantile applet or from most statistical software packages.
Show that the following tests have significance level α:
For each of the tests in Exercise 2, show that we fail to reject at significance level if and only if is in the corresponding level confidence interval.
Next we will consider tests for the ratio of the variances under the more realistic assumption that the means and are unknown. In this case, our basic test statistic is
where once again, is a conjectured value of the ratio of the variances.
Show that has the distribution with degrees of freedom in the numerator and degrees of freedom in the denominator.
Show that the following tests have significance level α:
For each of the tests in Exercise 5, show that we fail to reject at significance level if and only if is in the corresponding level confidence interval.
Next we will consider tests for the difference in the means under the assumption that the standard deviations and are known. Once again, this is usually an unrealistic assumption, but it is a good place to start because the analysis is fairly easy. Our basic test statistic is
where is a conjectured value of the difference in the means.
Show that has the normal distribution with mean and variance 1.
As usual, for , let denote the quantile of order for the standard normal distribution. For selected values of , can be obtained from the last row of the table of the distribution, from the table of the standard normal distribution, from the quantile applet or from most statistical software packages. Recall also by symmetry that .
Show that the following tests have significance level :
For each of the tests in Exercise 8, show that we fail to reject at significance level if and only if is in the corresponding level confidence interval.
Finally we will consider tests for the difference in the means under the more realistic assumption that the standard deviations and are unknown. In this case, it is more difficult to find a suitable test statistic, but we can do the analysis in the special case that the standard deviations are the same. Thus, we will assume that , and the common value is unknown. This assumption is reasonable if there is an inherent variability in the measurement variables that does not change even when different treatments are applied to the objects in the population. Recall that the pooled estimate of the common variance is
Our basic test statistic is
Show that if has the distribution with degrees of freedom,
As usual, for and , let denote the quantile of order for the distribution with degrees of freedom. For selected values of and , values of are given in the table of the Student distribution, from the quantile applet, or from most statistical software packages. Recall also that, by symmetry,
Show that the following tests have significance level :
For each of the tests in Exercise 11, show that we fail to reject at significance level if and only if is in the corresponding level confidence interval.
In this subsection, we consider a model that is superficially similar to the two-sample normal model, but is actually much simpler. Suppose that
is a random sample of size from the bivariate normal distribution of with , , , , and .
Thus, instead of a pair of samples, we have a sample of pairs. This type of model frequently arises in before and after experiments, in which a measurement of interest is recorded for a sample of objects from the population, both before and after a treatment. For example, we could record the blood pressure of a sample of patients, before and after the administration of a certain drug.
We will use our usual notation for the sample means and variances of and . Recall also that the sample covariance of , is
Show that is a random sample of size from the distribution of , which is normal distribution with
Show that
The sample of differences fits the normal model for a single variable. The section on Tests in the Normal Model could be used to perform tests for the parameters .
A new drug is being developed to reduce a certain blood chemical. A sample of 36 patients are given a placebo while a sample of 49 patients are given the drug. The statistics (in mg) are , , , . Test the following at the 10% significance level:
A company claims that an herbal supplement improves intelligence. A sample of 25 persons are given a standard IQ test before and after taking the supplement. The before and after statistics are , , , , . At the 10% significance level, do you believe the company's claim?
In Fisher's iris data, consider the petal length variable for the samples of Versicolor and Virginica irises. Test the following at the 10% significance level:
A plant has two machines that produce a circular rod whose diameter (in cm) is critical. A sample of 100 rods from the first machine as mean 10.3 and standard deviation 1.2. A sample of 100 rods from the second machine has mean 9.8 and standard deviation 1.6. Test the following hypotheses at the 10% level.