]>
As usual, we start with a random experiment with probability measure on an underlying sample space. Suppose now that and are random variables for the experiment, and that takes values in while takes values in . We can think of as a random variable taking values in the product set . The purpose of this section is to study how the distribution of is related to the distributions of and individually. In this context, the distribution of is called the joint distribution, while the distributions of and of are referred to as marginal distributions. Note that and themselves may be vector valued.
The first simple, but very important point, is that the marginal distributions can be obtained from the joint distribution, but not conversely.
Show that
If and are independent, then by definition,
and as we have noted before, this completely determines the distribution on . However, if and are dependent, the joint distribution cannot be determined from the marginal distributions. Thus in general, the joint distribution contains much more information than the marginal distributions individually.
In the discrete case, note that is countable if and only if and are countable.
Suppose that has a discrete distribution with probability density function on a countable set . Show that and have density functions and , respectively, given by
For the continuous case, suppose that and so that
Suppose that has a continuous distribution on with probability density function . Show that and have continuous distributions with probability density functions and , respectively, given by
In the context of Exercises 2 and 3, is called the joint probability density function of , while and are called the marginal density functions of and of , respectively.
When the variables are independent, the joint density is the product of the marginal densities.
Suppose that and are independent, either both with discrete distributions or both with continuous distributions. Let and denote the probability density functions of and respectively. Show that has probability density function given by
The following exercise gives a converse to Exercise 4. If the joint probability density factors into a function of only and a function of only, then and are independent.
Suppose that has either a discrete or continuous distribution, with probability density function . Suppose that
where and . Show that and are independent and that there exists a nonzero constant such that is a probability density function for and is a probability density function for .
The results of this section have natural analogies in the case that has coordinates with different distribution types, as discussed in the section on mixed distributions. For example, suppose has a discrete distribution, has a continuous distribution, and has joint probability density function on . Then the results in exercises 2(a), 3(b), 4, and 5 hold.
Suppose that two standard, fair dice are rolled and the sequence of scores recorded. Let and denote the sum and difference of the scores, respectively.
Suppose that two standard, fair dice are rolled and the sequence of scores recorded. Let and denote the minimum and maximum scores, respectively.
Suppose that has probability density function for .
Suppose that has probability density function for .
Suppose that has probability density function for .
Suppose that has probability density function for .
Suppose that has probability density function for .
Suppose that has probability density function for .
Multivariate uniform distributions give a geometric interpretation of some of the concepts in this section. Recall first that the standard Lebesgue measure on is
In particular, is the length measure on , is the area measure on , and is the volume measure on .
Suppose now that takes values in , takes values in , and that is uniformly distributed on a set . Thus, by definition, the joint probability density function of is
Let and be the projections of onto and respectively, defined as follows:
Note that . Next we define the cross-sections at and at , respectively by
Show that takes values in and that the probability density function of is proportional to the cross-sectional measure:
Show that takes values in and that the probability density function of is proportional to the cross-sectional measure:
In particular, note from previous exercises that and are not in general either independent nor uniformly distributed.
Suppose that . Show that
In each of the following cases, find the joint and marginal densities, and determine if and are independent.
In the bivariate uniform experiment, run the simulation 5000 times, updating every 10 runs for each of the following cases. Watch the points in the scatter plot and the graphs of the marginal distributions. Interpret what you see in the context of the discussion above.
Suppose that is uniformly distributed on the cube .
Suppose that is uniformly distributed on .
The following exercise shows how an arbitrary continuous distribution can be obtained from a uniform distribution. This result is useful for simulating certain continuous distributions.
Suppose that is a probability density function for a continuous distribution on . Let
Show that if is uniformly distributed on , then has probability density function . A picture in the case is given below:
Suppose that a population consists of objects, and that each object is one of four types. There are type 1 objects, type 2 objects, type 3 objects and type 0 objects. The parameters , , and are nonnegative integers with . We sample objects from the population at random, and without replacement. Denote the number of type 1, 2, and 3 objects in the sample by , , and , respectively. Hence, the number of type 0 objects in the sample is . In the problems below, the variables , , and take values in the set .
Use a combinatorial argument to show that has a (multivariate) hypergeometric distribution, with probability density function:
Use both a combinatorial argument and an analytic argument to show that also has a (multivariate) hypergeometric distribution, with the probability density function given below. The essence of the combinatorial argument is that we are selecting a random sample of size from a population of objects, with objects of type 1, objects of type 2, and objects of other types.
Use both a combinatorial argument and an analytic argument to show that has an ordinary hypergeometric distribution, with the probability density function given below. The essence of the combinatorial argument is that we are selecting a random sample of size from a population of size , with objects of type 1 and objects of other types.
These results generalize in a straightforward way to a population with any number of types. In brief, if a random vector has a hypergeometric distribution, then any sub-vector also has a hypergeometric distribution. In other terms, all of the marginal distributions of a hypergeometric distribution are themselves hypergeometric. The hypergeometric distribution and the multivariate hypergeometric distribution are studied in detail in the chapter on Finite Sampling Models.
Recall that a bridge hand consists of 13 cards selected at random and without replacement from a standard deck of 52 cards. Let , , and denote the number of spades, hearts, and diamonds, respectively, in the hand. Find the density function of each of the following:
Suppose that we have a sequence of independent trials, each with 4 possible outcomes. On each trial, outcome 1 occurs with probability , outcome 2 with probability , outcome 3 with probability , and outcome 0 occurs with probability . The parameters , , and are nonnegative numbers with . Denote the number of times that outcome 1, outcome 2, and outcome 3 occurred in the trials by , , and respectively. Of course, the number of times that outcome 0 occurs is . In the problems below, the variables , , and take values in the set .
Use a probability argument (based on combinatorics and independence) to show that has a multinomial distribution, with probability density function given by
Use a probability argument and an analytic argument to show that also has a multinomial distribution, with the probability density function given below. The essence of the probability argument is that we have independent trials, and on each trial, outcome 1 occurs with probability , outcome 2 with probability , and some other outcome with probability .
Use a probability argument and an analytic argument to show that has a binomial distribution, with the probability density function given below. The essence of the probability argument is that we have independent trials, and on each trial, outcome 1 occurs with probability and some other outcome with probability
These results generalize in a completely straightforward way to multinomial trials with any number of trial outcomes. In brief, if a random vector has a multinomial distribution, then any sub-vector also has a multinomial distribution. In other terms, all of the marginal distributions of a multinomial distribution are themselves multinomial. The binomial distribution and the multinomial distribution are studied in detail in the chapter on Bernoulli Trials.
Recall that an ace-six flat die is a standard 6-sided die in which faces 1 and 6 have probability each, while faces 2, 3, 4, and 5 have probability each. Suppose that an ace-six flat die is thrown 10 times; let denote the number of times that score occurred for . Find the density function of each of the following:
Suppose that has probability the density function given below:
Suppose that has the probability density function given below:
The joint distributions in the last two exercises are examples of bivariate normal distributions. Normal distributions are widely used to model physical measurements subject to small, random errors. The bivariate normal distribution is studied in more detail in the chapter on Special Distributions.
Recall that the exponential distribution has probability density function
where is the rate parameter. The exponential distribution is widely used to model random times, and is studied in more detail in the chapter on the Poisson Process.
Suppose and have exponential distributions with parameters and , respectively, and are independent. Show that
Suppose , , and have exponential distributions with parameters , , and , respectively, and are independent. Show that
If , , and are the lifetimes of devices that act independently, then the results in the previous two exercises give probabilities of various failure orders. Results of this type are also very important in the study of continuous-time Markov processes. We will continue this discussion in the section on transformations of random variables.
Suppose takes values in the finite set , takes values in the interval , and that the joint density function is given by
Suppose that takes values in the interval , takes values in the finite set , and that has joint probability density function given by
.As we will see in the section on conditional distributions, the distribution in the last exercise models the following experiment: a random probability is selected, and then a coin with this probability of heads is tossed 3 times; is the number of heads.
For the cicada data, denotes gender and denotes species type.
For the cicada data, denotes body weight and denotes body length (in mm).
For the cicada data, denotes gender and denotes body weight (in grams).