]>
As usual, our starting point is a random experiment with an underlying sample space and a probability measure . In the basic statistical model, we have an observable random variable taking values in a set . In general, can have quite a complicated structure. For example, if the experiment is to sample objects from a population and record various measurements of interest, then
where is the vector of measurements for the object. The most important special case occurs when are independent and identically distributed. In this case, we have a random sample of size from the common distribution.
Suppose also that the distribution of depends on a parameter taking values in a parameter space . The parameter may also be vector-valued, in which case for some and .
A confidence set is a subset of the parameter space that depends only on the data variable , and no unknown parameters. Thus, in a sense, a confidence set is a set-valued statistic. A confidence set is an estimate of in the sense that we hope that with high probability. In particular, the confidence level is the smallest probability that :
Usually, we try to construct a confidence set for with a prescribed confidence level where . Typical confidence levels are 0.9, 0.95, and 0.99. Sometimes the best we can do is to construct a confidence set whose confidence level is at least this is called a conservative confidence set for .
Note that when we run the experiment and observe the data , the computed confidence set is . The true value of the parameter is either in this set, or is not, and we will usually never know. However, by the law of large numbers, if we were to repeat the confidence experiment over and over, the proportion of sets that contain would converge to . This is the precise meaning of the term confidence.
Next, note that the quality of a confidence set, as an estimator of
,
is based on two factors: the confidence level and the size
of the set. A good estimate has small size (and hence gives tight bounds on
)
and large confidence. However, for a given
,
there is usually a tradeoff between confidence level and size--increasing the confidence level comes only at the expense of increasing the size of the set, and decreasing the size of the set comes only at the expense of decreasing the confidence level. How we measure the size
of the confidence set depends on the dimension of the parameter space and the nature of the confidence set. Moreover, the size of the set is usually random, although in some special cases it may be deterministic.
Suppose that is a level confidence set for for . Show that if then is a conservative level confidence set for . Hint: Use Bonferroni's inequality.
In many cases, we are interested in estimating a real parameter taking values in an interval parameter space . In this context our confidence set frequently has the form
where and are statistics. In this case is called a confidence interval for . If and are both random, then the confidence interval is often said to be two-sided. In the special case that and is random, is called a lower confidence bound for and the interval is called an upper confidence interval for . In the special case that and is random, is called a upper confidence bound for and the interval is called an lower confidence interval for .
Suppose that is a level confidence lower bound for and that is a level confidence upper bound for . Show that if then is a conservative level confidence interval for . Hint: Use Exercise 1
You might think that it should be very difficult to construct confidence sets for a parameter . However, in many important special cases, confidence sets can be constructed easily from certain random variables known as pivot variables.
Suppose that is a function from into a set . The random variable is a pivot variable for if its distribution does not depend on . Specifically, is constant in for each . If we know the distribution of the pivot variable, then for a given , we can try to find (that does not depend on ) such that
It then follows that a confidence set for the parameter is given by
Suppose now that our pivot variable , is real-valued, which for simplicity, we will assume has a continuous distribution. For , let denote the quantile of order for the pivot variable . By the very meaning of pivot variable, does not depend on .
Show that for any , a level confidence set for is
The confidence set in Exercise 3 corresponds to in the left tail and in the right tail, in terms of the distribution of the pivot variable . The special case is the equal-tailed case, the most common case.
Show that the confidence set in Exercise 3 is decreasing in and hence increasing in (in the sense of the subset relation) for fixed .
Specializing further, suppose that is a vector of real parameters, and that we are interested in estimating one of the coordinates of ; the other coordinates are sometimes referred to as nuisance parameters in this context. It is often the case that the real-valued pivot variable is a strictly decreasing function of for each and for all values of the other coordinates of . In this setting, we can obtain a confidence set by inverting the pivot variable with respect to .
In the setting above, show that the confidence set for in Exercise 3 can be written in the following form, where is the parameter vector with deleted:
In words, we apply the inverse transformation to obtain bounds on that depend on the data variable , the other coordinates (nuisance parameters) of , and the quantiles of the pivot variable. If the other coordinates are known, then these bounds become statistics, and we have constructed a confidence interval for .
For the confidence set in Exercise 3, we would naturally like to choose that minimizes the size of the set in some sense. However this is often a difficult problem. The equal-tailed interval, corresponding to , is the most commonly used case, and is sometimes (but not always) an optimal choice.
Pivot variables are far from unique; the challenge is to find a pivot quantity whose distribution is known and which gives tight bounds on the parameter.
Suppose that is a pivot variable for . If is a function defined on the range of and involves no unknown parameters, show that is also a pivot variable for .
In the case of location-scale families of distributions, we can easily find pivot variables. Suppose that is a real-valued random variable with a continuous distribution that has probability density function , and no unknown parameters. Let where and are parameters. Recall that the probability density function of is given by
and the corresponding family of distributions is called the location-scale family associated with the distribution of ; is the location parameter and is the scale parameter. Generally, we are assuming that these parameters are unknown.
Now suppose that is a random sample of size from the distribution of ; this is our observable outcome vector. For each , let
Show that is a random sample of size from the distribution of .
In particular, note that is a pivot variable for , since is a function of , , and , but the distribution of does not depend on and . Hence, any function of will also be a pivot variable for , (if the function does not involve the parameters). Of course, some of these pivot variables will be much more useful than others in estimating and . In the following exercises, we will explore two common and important pivot variables.
Let and denote the sample means of and , respectively. Show that is a pivot variable for since
.Let denote the quantile function of the pivot variable . Show that for any , a confidence set for is
Show that the confidence set in Exercise 9 is a cone
in the
parameter space, with vertex at
and boundary lines of slopes
and
,
as shown in the graph below. (Note, however, that both slopes might be negative or both positive.)
The fact that the confidence set is unbounded is clearly not good, but is perhaps not surprising; we are estimating two real parameters with a single real-valued pivot variable. However, if is known, the confidence set defines a confidence interval for . Geometrically, the confidence interval simply corresponds to the horizontal cross section at .
In the confidence set in Exercise 9, let and , respectively, to show that confidence sets for are
If is known, then Exercise 11(a) gives a confidence lower bound for and Exercise 11(b) gives a confidence upper bound for .
Let and denote the sample standard deviations of and , respectively. Show that is a pivot variable for and a pivot variable for since
Let denote the quantile function of . Use the pivot variable to show that for any and any , a confidence set for is
Note that the confidence set gives no information about since the random variable in Exercise 13 is a pivot variable for alone. The confidence set can also be viewed as a bounded confidence interval for
In the confidence set in Exercise 13, let and , respectively, to show that confidence sets for are
The set in part (a) gives a confidence lower bound for and the set in part (b) gives a confidence upper bound for
We can intersect the confidence sets corresponding to the two pivot variables to produce conservative, bounded confidence sets.
Suppose that with . Use Exercise 1 to show that is a conservative confidence set for
The most important location-scale family is the family of normal distributions. The problem of estimation in the normal model is considered in the next section. In the remainder of this section, we will explore another important scale family.
Suppose is a random sample of size from the exponential distribution with scale parameter . Let
Show that has the chi-square distribution with degrees of freedom, and hence is a pivot variable for .
Note that the variable in Exercise 16 is a multiple of the variable in Exercise 8 (with ). Thus, let denote the probability density function and the distribution function for the chi-square distribution with degrees of freedom. In addition, for , let denote the quantile of order for the distribution. That is, . For selected values of and , can be obtained from the table of the chi-square distribution, from the quantile applet, or from most statistical software packages.
Show or recall that
Show that for any and any , a confidence interval for is
Show that
Of the two-sided confidence intervals in Exercise 18, we would naturally prefer the one with the smallest length, because this interval gives the most information about the parameter . However, minimizing the length as a function of is computationally difficult. The two-sided confidence interval that is typically used is the equal tailed interval obtained by letting :
Try to find the that minimizes the length of the interval in Exercise 18.