]>
Suppose that is a random sample from the normal distribution with mean and standard deviation . In this section we will construct confidence sets for , and . These are among of the most important special cases of set estimation. A parallel section on Tests in the Normal Model is in the chapter on Hypothesis Testing.
As usual, we will construct the confidence sets by finding appropriate pivot variables. As a first step, we will standardize our outcome variables to form a basic pivot vector from which all of our other pivot variables will be constructed. Thus, for each , let
Show or recall that is a random sample of size from the standard normal distribution, and hence is a pivot vector for .
Recall that the sample mean our data vector is
Show or recall that has the normal distribution with mean and variance . Hence, the corresponding standard score has the standard normal distribution, and is a pivot variable for :
Show that the pivot variable in Exercise 2 can be written in terms of our basic pivot variables in Exercise 1 as follows:
As usual, we will let denote the standard normal probability density function and the standard normal distribution function. For , let denote the quantile of order for the standard normal distribution. That is, . For selected values of , can be obtained from the last row of the table of the distribution, from the table of the standard normal distribution, from the quantile applet, or from most statistical software packages.
Show or recall the following properties. Use the inverse function theorem of calculus for part (d):
Use the pivot variable in Exercise 2 to show that for any and any , a level confidence set for is
Show that the confidence set in Exercise 5 is a cone
in the
parameter space, with vertex at
and boundary lines of slopes
and
,
as shown in the graph below. (Note, however, that both slopes might be negative or both positive.)
Note that the confidence set is unbounded, not surprising since we are using a single real-valued pivot variable to estimate two real parameters. However, if is known, the confidence set gives a bounded, confidence interval for . Geometrically, this interval corresponds to the horizontal cross section at . The assumption that is known is usually, but not always, artificial. Exercise 35 gives an example of a setting in which the assumption may be reasonable. In principle, we could also get confidence sets for assuming that is known; these sets would correspond to vertical cross sections. However, the assumption that is known is almost always unrealistic, and moreover, the confidence sets for would usually be unbounded.
Let us study the size
of our confidence cone. One way to do this is to study
,
the length of the cross section at
.
Of course, if
is known,
is simply the length of the confidence interval for
Show that . Note in particular that the deterministic. Now show that
Exercise 7 shows again that there is a tradeoff between the confidence level and the size of the confidence set. If and are fixed, we can decrease and hence tighten our estimate, only at the expense of decreasing our confidence in the estimate. Conversely, we can increase our confidence in the estimate only at the expense of increasing the size of the set. In terms of , the best of the two-sided confidence sets (and the one that is always used) is the equal-tailed set:
If is known, this set gives the equal-tailed confidence interval for . Note that this interval is symmetric about the sample mean
Let and let in Exercise 5 to show that the following are confidence sets for
If is known, part (a) gives a confidence lower bound for and part (b) gives a confidence upper bound for .
Use the mean estimation experiment to explore the procedure. Select the normal distribution and select normal pivot. Use various parameter values, confidence levels, sample sizes, and interval types. For each configuration, run the experiment 1000 times with an update frequency of 10. As the simulation runs, note that the confidence interval successfully captures the mean if and only if the value of the pivot variable is between the quantiles. Note the size and location of the confidence intervals and note how well the proportion of successful intervals approximates the theoretical confidence level.
Suppose now that is known. Let denote the distance between the sample mean and one of the confidence bounds. That is,
where for the standard two-sided interval and for the upper or lower confidence interval. Note that is deterministic, and the length of the standard two-sided interval is . The number is sometimes called the margin of error. In many cases, the first step in the design of the experiment is to determine the sample size needed to estimate with a given margin of error and a given confidence level.
Show that the sample size needed to estimate with confidence and margin of error is
Note that varies directly with and with and inversely with . This last fact implies a law of diminishing return in reducing the margin of error. For example, if we want to reduce a given margin of error by a factor of , we must increase the sample size by a factor of 4.
Recall that the random variable
is sometimes used as a special version of the sample variance, when the distribution mean is known. On the other hand, the usual version of the sample variance is
Recall that one of the most important special properties of normal samples is that the sample mean and the sample variance are independent. Now let
Show that can be written in terms of the basic pivot variables in Exercise 1 as given below. Thus, show that this variable is a pivot variable for and has the chi-square distribution with degrees of freedom.
Show that . Thus, note that can also be written in terms of the basic pivot variables of Exercise 1 and hence is a pivot variable for and for . Finally, show that this variable has the chi-square distribution with degrees of freedom.
Thus, let denote the probability density function and the distribution function for the chi-square distribution with degrees of freedom. In addition, for , let denote the quantile of order for the distribution, so that by definition, . For selected values of and , can be obtained from the table of the chi-square distribution, from the quantile applet, or from most statistical software packages.
Show or recall the following properties. For part (c), use the inverse function theorem of calculus.
Use the pivot variable to show that for any and any , a level confidence set for is
Show that the boundary curves of the confidence set have the form given in the equation below, where is the appropriate chi-square quantile. Thus, the confidence set is the region between the positive branches of two hyperbolas, each centered at as shown in the picture.
As before, the confidence set is unbounded, again not surprising since we are estimating two real parameters with a single real pivot variable. However, if is known, the confidence set gives a bounded, confidence interval for . Geometrically, this interval corresponds to the vertical cross section at . If we take square roots of the confidence bounds, then we have a confidence interval for . The assumption that is known is almost always artificial; the confidence sets that we will construct using the pivot variable will yield confidence intervals for without requiring the assumption that is known. In principle, we can also obtain a confidence set for , assuming that is known; this set corresponds to the horizontal cross section at . Note however that in general, this set is not be an interval but rather a union of two disjoint intervals. Thus our construction in the previous subsection is better in this case.
Let and let in the confidence set in Exercise 14 to show that the following are confidence set for
If is known, then part (a) gives a confidence lower bound for and part (b) gives a confidence upper bound for
Of the confidence sets in Exercise 14, with fixed confidence level, we prefer the one with the smallest size
, because this set gives the most information about the parameters. In this case, it's not clear how to measure the size of the set. When
is known, of course, our measure would be the length of the confidence interval. Note however, that this length is random. More importantly, minimizing the length as a function of
is computationally difficult. For all of the reasons, we usually use the equal tailed confidence set corresponding to
:
Try to minimize the length of the interval in Exercise 14 as a function of for fixed
Use the pivot variable to show that for any and any , a level confidence set for is
By design, this confidence set gives no information about , but gives a bounded, confidence interval for . Again, if we take square roots of the confidence bounds, then we have a confidence interval for .
Let and let in the confidence set in Exercise 18 to show that the following are confidence set for
Part (a) gives a confidence lower bound for and part (b) gives a confidence upper bound for
Of the confidence sets in Exercise 18, with fixed confidence level, we prefer the one with the smallest size
, because this set gives the most information about the parameters. In this case, of course, we should measure the size of the confidence set by the length of the confidence interval (the vertical cross section). Note that this length is random. More importantly, minimizing the length as a function of
is computationally difficult. Thus, we usually use the equal tailed confidence set corresponding to
:
Try to minimize the length of the interval in Exercise 18 as a function of for fixed
Use variance estimation experiment to explore the procedure. Select the normal distribution. Use various parameter values, confidence levels, sample sizes, and interval types. For each configuration, run the experiment 1000 times with an update frequency of 10. As the simulation runs, note that the confidence interval successfully captures the standard deviation if and only if the value of the pivot variable is between the quantiles. Note the size and location of the confidence intervals and note how well the proportion of successful intervals approximates the theoretical confidence level.
For our last basic confidence set, we will modify the standard normal pivot that we first studied, by replacing the distribution standard deviation with the sample standard deviation . This will lead to confidence intervals for without the assumption that is known. Fortunately, this works because of some special properties of the statistics when the sampling distribution is normal. Let
Show that
Use the previous exercise to show that can be written in terms of the basic pivot variables of Exercise 1 and that this variable has the student distribution with degrees of freedom. Thus, this variable is a pivot variable for and for alone.
For , we will let denote the probability density function and the distribution function for the distribution with degrees of freedom. Additionally, for , let denote the quantile of order for this distribution, so that . For selected values of and , values of can be obtained from the table of the distribution or from the quantile applet.
Show or recall the following properties. Part (d) follows from the inverse function theorem of calculus.
Show that for any and any , a confidence interval for is
By design, this confidence set gives no information about , but gives a bounded confidence interval for . The size of the confidence set is clearly measured by , the length of the horizontal cross section, or equivalently, the length of the confidence interval for .
Show that . Note in particular that the random. Now show that
Exercise 26 shows again that there is a tradeoff between the confidence level and the size of the confidence set. If and are fixed, we can decrease and hence tighten our estimate, only at the expense of decreasing our confidence in the estimate. Conversely, we can increase our confidence in the estimate only at the expense of increasing the size of the set. In terms of , the best of the two-sided confidence sets (and the one that is always used) is the equal-tailed set:
This set gives the equal-tailed confidence interval for . Note that this interval is symmetric about the sample mean , but again the length of the interval is random.
Let and let in the confidence set in Exercise 25 to show that the following are confidence sets for
Part (a) gives a confidence lower bound for and part (b) gives a confidence upper bound for .
Use the mean estimation experiment to explore the procedure. Select the normal distribution and select student pivot. Use various parameter values, confidence levels, sample sizes, and interval types. For each configuration, run the experiment 1000 times with an update frequency of 10. As the simulation runs, note that the confidence interval successfully captures the mean if and only if the value of the pivot variable is between the quantiles. Note the size and location of the confidence intervals and note how well the proportion of successful intervals approximates the theoretical confidence level.
We can now form intersections of some of the confidence sets constructed above to obtain bounded confidence sets for . We will use the fact that the sample mean and the sample variance are independent, one of the most important special properties of a normal sample. We will also need Exercise 1 from the Introduction which is based on Bonferroni's inequality. In the following exercises, suppose that with .
Show that the sets below are conservative confidence set for :
Use independence to show that is a confidence set for . This confidence set has a wedge shape as shown in the picture below.
It is interesting to note that the confidence set in Exercise 29(b) is a product set as a subset of the parameter space, but is not a product set as a subset of the sample space. By contrast, the confidence set in Exercise 30 is not a product set as a subset of the parameter space, but is a product set as a subset of the sample space.
The main assumption that we made was that the underlying sampling distribution is normal. Of course, in real statistical problems, we are unlikely to know much about the sampling distribution, let alone whether or not it is normal. When a statistical procedure works reasonably well, even when the underlying assumptions are violated, the procedure is said to be robust. In this subsection, we will explore the robustness of the estimation procedures for and
Suppose in fact that the underlying distribution is not normal. When the sample size is relatively large, the distribution of the sample mean will still be approximately normal by the central limit theorem. Thus, our interval estimates of may still be approximately valid.
Use the simulation of the mean estimation experiment to explore the procedure. Select the gamma distribution and select student pivot. Use various parameter values, confidence levels, sample sizes, and interval types. For each configuration, run the experiment 1000 times with an update frequency of 10. Note the size and location of the confidence intervals and note how well the proportion of successful intervals approximates the theoretical confidence level.
In the mean estimation experiment, repeat the previous exercise with the uniform distribution.
How large needs to be for the interval estimation procedures of to work well depends, of course, on the underlying distribution; the more this distribution deviates from normality, the larger must be. Fortunately, convergence to normality in the central limit theorem is rapid and hence, as you observed in the exercises, we can get away with relatively small sample sizes (30 or more) in most cases.
In general, the interval estimation procedures for is not as robust as the ones for
In variance estimation experiment, select the gamma distribution. Use various parameter values, confidence levels, sample sizes, and interval types. For each configuration, run the experiment 1000 times with an update frequency of 10. Note the size and location of the confidence intervals and note how well the proportion of successful intervals approximates the theoretical confidence level.
In variance estimation experiment, select the uniform distribution. Use various parameter values, confidence levels, sample sizes, and interval types. For each configuration, run the experiment 1000 times with an update frequency of 10. Note the size and location of the confidence intervals and note how well the proportion of successful intervals approximates the theoretical confidence level.
In the following exercises, use the equal-tailed construction for two-sided confidence intervals, unless otherwise instructed.
The length of a certain machined part is supposed to be 10 centimeters but due to imperfections in the manufacturing process, the actual length is a normally distributed with mean and variance . The variance is due to inherent factors in the process, which remain fairly stable over time. From historical data, it is known that . On the other hand, may be set by adjusting various parameters in the process and hence may change to an unknown value fairly frequently. A sample of 100 parts has mean 10.2.
Suppose that the weight of a bag of potato chips (in grams) is a normally distributed random variable with mean and standard deviation , both unknown. A sample of 75 bags has mean 250 and standard deviation 10.
At a telemarketing firm, the length of a telephone solicitation (in seconds) is a normally distributed random variable with mean and standard deviation , both unknown. A sample of 50 calls has mean length 300 and standard deviation 60.
At a certain farm the weight of a peach (in ounces) at harvest time is a normally distributed random variable with standard deviation 0.5. How many peaches must be sampled to estimate the mean weight with a margin of error and with 95% confidence.
The hourly salary for a certain type of construction work is a normally distributed random variable with standard deviation $1.25 and unknown mean . How many workers must be sampled to construct a 95% confidence lower bound for with margin of error $0.25?
In Michelson's data, assume that the measured speed of light has a normal distribution with mean and standard deviation , both unknown.
truevalue of the speed of light in this interval?
In Cavendish's data, assume that the measured density of the earth has a normal distribution with mean and standard deviation , both unknown.
truevalue of the density of the earth in this interval?
In Short's data, assume that the measured parallax of the sun has a normal distribution with mean and standard deviation , both unknown.
truevalue of the parallax of the sun in this interval?
Suppose that the length of an iris petal of a given type (Setosa, Verginica, or Versicolor) is normally distributed. Use Fisher's iris data to construct 90% two-sided confidence intervals for each of the following parameters.