]>
As usual, we start with a random experiment with probability measure on an underlying sample space. Suppose that we have a random variable for the experiment, taking values in , and a function . Then is a new random variable taking values in . If the distribution of is known, how do we find the distribution of ? This is a very basic and important question, and in a superficial sense, the solution is easy.
Show that for .
However, frequently the distribution of is known either through its distribution function or its density function , and we would similarly like to find the distribution function or density function of . This is a difficult problem in general, because as we will see, even simple transformations of variables with simple distributions can lead to variables with complex distributions. We will solve the problem in various special cases.
Suppose that has a discrete distribution with probability density function (and hence is countable). Show that has a discrete distribution with probability density function given by
Suppose that has a continuous distribution on a subset with probability density function , and that is countable. Show that has a discrete distribution with probability density function given by
Suppose that has a continuous distribution on a subset and that has a continuous distributions on a subset . Suppose also that has a known probability density function . In many cases, the density function of can be found by first finding the distribution function of (using basic rules of probability) and then computing the appropriate derivatives of the distribution function. This general method is referred to, appropriately enough, as the distribution function method.
When the transformation is one-to-one and smooth, there is a formula for the probability density function of directly in terms of the probability density function of . This is known as the change of variables formula.
We will explore the one-dimensional case first, where the concepts and formulas are simplest. Thus, suppose that random variable has a continuous distribution on an interval . with distribution function and density function . Suppose that where is a differentiable function from onto an interval . As usual, we will let denote the distribution function of and the density function of .
Suppose that is strictly increasing on . Show that for ,
Suppose that is strictly decreasing on . Show that for ,
The density function formulas in Exercises 4 and 5 can be combined: if is a strictly monotone on then the density function of is given by
Letting , the change of variables formula can be written more compactly as
![]() |
![]() |
The generalization this result is basically a theorem in multivariate calculus. Suppose that is a random variable taking values in , and that has a continuous distribution with probability density function . Suppose that where is a one-to-one, differentiable function form onto . The first derivative of the inverse function is the matrix of first partial derivatives:
The Jacobian (named in honor of Karl Gustav Jacobi) of the inverse function is the determinant of the first derivative matrix
With this compact notation, the multivariate change of variables formula states that the density of is given by
Linear transformations (or more technically affine transformations) are among the most common and important transformations. Moreover, this type of transformation leads to a simple application of the change of variable theorem. Suppose that is a random variable taking values in and that has a continuous distribution on with probability density function . Let where and Note that and that takes values in .
Apply the change of variables theorem to show that has probability density function
When (which is often the case in applications), this transformation is known as a location-scale transformation; is the location parameter and is the scale parameter. Location-scale transformations are studied in more detail in the chapter on Special Distributions.
The multivariate version of this result has a simple and elegant form when the linear transformation is expressed in matrix-vector form. Thus suppose that is a random variable taking values in and that has a continuous distribution on with probability density function . Let where and is an invertible matrix. Note that and that takes values in .
Show that
Apply the change of variables theorem to show that has probability density function
Simple addition of random variables is perhaps the most important of all transformations. Suppose that and are real-valued random variables for a random experiment. Our goal is to find the distribution of .
Suppose that has a discrete distribution with probability density function . Show that has a discrete distribution with probability density function given by
The sum is actually over the countable set .
Suppose that has a continuous distribution with probability density function . Show that the has a continuous distribution with probability density function given by
By far the most important special case occurs when and are independent.
Suppose that and are independent and have discrete distributions with densities and respectively. Show that has a probability density function
The sum is actually over the countable set . The probability density function defined in the previous exercise is called the discrete convolution of and .
Suppose that and are independent and have continuous distributions with densities and respectively. Show that has a probability density function given by
The integral is actually over the set . The probability density function is called the continuous convolution of and .
Show that convolution (either discrete or continuous) satisfies the properties below (where , . and are probability density functions). Give an analytic proof, based on the definition of convolution, and a probabilistic proof, based on sums of independent random variables
Thus, in part (b) we can write without ambiguity. Note that if is a sequence of independent and identically distributed random variables, with common probability density function . then
has probability density function , the -fold convolution power of . When appropriately scaled and centered, the distribution of converges to the standard normal distribution as . The precise statement of this result is the central limit theorem, one of the fundamental theorems of probability. The central limit theorem is studied in detail in the chapter on Random Samples.
Suppose that is a sequence of independent real-valued random variables. The minimum and maximum transformations are very important in a number of applications.
For example, recall that in the standard model of structural reliability, a system consists of components that operate independently. Suppose that represents the lifetime of component . Then is the lifetime of the series system which operates if and only if each component is operating. Similarly, is the lifetime of the parallel system which operates if and only if at least one component is operating.
A particularly important special case occurs when the random variables are identically distributed, in addition to being independent. In this case, the sequence of variables is a random sample of size from the common distribution. We usually think of the random variables as independent copies of an underlying random variable. The minimum and maximum variables are the extreme examples of order statistics. Order statistics are studied in detail in the chapter on Random Samples.
Let denote the distribution function of for each , and let and denote the distribution functions of and respectively.
Show that for ,
Show that for ,
From Exercise 14, note that the product of distribution functions is another distribution function. From Exercise 15, the product of right-tail distribution functions is a right-tail distribution function. In the reliability setting, where the random variables are nonnegative, the product of reliability functions is a reliability function. If has a continuous distribution with probability density function for each then and also have continuous distributions, and their probability density functions can be obtained by differentiating the distribution functions in Exercises 14 and 15.
The formulas are particularly nice when the random variables are identically distributed, in addition to being independent. Suppose that this is the case, and let be the common distribution function.
Show that for ,
In particular, it follows that a positive integer power of a distribution function is a distribution. More generally, it's easy to see that every positive power of a distribution function is a distribution function. How could we construct a non-integer power of a distribution function in a probabilistic way? Now, in addition to the independent and identically distributed assumptions, suppose that the common distribution of the variables is continuous, with density function . Let and denote the density functions of and . respectively.
Show that for ,
Suppose that has a continuous distribution on with distribution function and density function . Show that
Let denote the sign of , defined by
.Suppose that the probability density of is symmetric with respect to 0, so that for all . Show that
This subsection contains computational exercises, many of which involve special parametric families of distributions. It is always interesting when a random variable from one parametric family can be transformed into a variable from another family. It is also interesting when a parametric family is closed or invariant under some transformation on the variables in the family. Often, such properties are what make the parametric families special in the first place. Please note these properties when they occur.
Recall that a standard die is an ordinary 6-sided die. A fair die is one in which the faces are equally likely. An ace six flat die is a standard die in which faces 1 and 6 occur with probability each and the other faces with probability
Suppose that two standard dice are rolled and the sequence of scores is recorded. Find the probability density function of , the sum of the scores, in each of the following cases:
In the dice experiment, select two dice and select the sum random variable. Run the simulation 1000 times, updating every 10 runs and note the apparent convergence of the empirical density function to the probability density function for each of the following cases:
A fair die and an ace-six flat die are rolled. Find the probability density function of the sum of the scores.
Suppose that standard, fair dice are rolled. Find the probability density function of the following variables:
In the dice experiment, select fair dice and select each of the following random variables. Vary with the scroll bar and note the shape of the density function. With , run the simulation 1000 times, updating every 10 runs. Note the apparent convergence of the empirical density function to the probability density function.
Let . Find the density function of and sketch the graph in each of the following cases:
Compare the distributions in the last exercise. Note that even a simple transformation of a simple distribution can produce a complicated distribution. On the other hand, the uniform distribution is preserved under a linear transformation of the random variable.
Suppose that is uniformly distributed on . Let , where and is an invertible matrix. Show that is uniformly distributed on .
Suppose that is uniformly distributed on the square . Let .
Suppose that is a sequence of independent random variables, each uniformly distributed on . Find the distribution and density functions of the following variables. Both distributions are beta distributions.
In the order statistic experiment, select the uniform distribution.
Let denote the density function of the uniform distribution on . Compute and . Graph the three density functions on the same set of axes. Note the behavior predicted by the central limit theorem beginning to emerge.
A remarkable fact is that the uniform distribution on can be transformed into almost any other distribution on . This is particularly important for simulations, since many computer languages have an algorithm for generating random numbers, which are simulations of independent variables, each uniformly distributed on . Conversely, any continuous distribution supported on an interval of can be transformed into the uniform distribution on .
Suppose first that is a distribution function for a distribution on (which may be discrete, continuous, or mixed), and let denote the quantile function.
Suppose that is uniformly distributed on . Show that has distribution function .
Assuming that we can compute , the previous exercise shows how we can simulate a distribution with distribution function . To rephrase the result, we can simulate a variable with distribution function by simply computing a random quantile. Most of the applets in this project use this method of simulation.
![]() |
![]() |
Suppose that has a continuous distribution on an interval and that the distribution function is strictly increasing on . Show that has the uniform distribution on .
Show how to simulate the uniform distribution on the interval with a random number. Using your calculator, simulate 5 values from the uniform distribution on the interval .
Recall that a Bernoulli trials sequence is a sequence of independent, identically distributed indicator random variables. In the usual terminology of reliability theory, means failure on trial . while means success on trial . The basic parameter of the process is the probability of success . The random process is named for Jacob Bernoulli and is studied in detail in the chapter on Bernoulli trials.
Show that the common probability density function of the trial variables is .
Let denote the number of successes in the first trials. Use an argument based on combinatorics and independence to show that has the probability density function given below. This defines the binomial distribution with parameters and .
As above, let denote the number of successes in the first trials.
In particular, it follows that if and are independent variables, and that has the binomial distribution with parameters and while has the binomial distribution with parameter and . Show that has the binomial distribution with parameter and .
Find the probability density function of the difference between the number of successes and the number of failures in Bernoulli trials.
Recall that the Poisson distribution with parameter has probability density function
This distribution is named for Simeon Poisson and is widely used to model the number of random points in a region of time or space. The Poisson distribution is studied in detail in the chapter on The Poisson Process.
Suppose that and are independent variables, and that has the Poisson distribution with parameter while has the Poisson distribution with parameter . Show that has the Poisson distribution with parameter . Equivalently, show that . Hint: You will need to use the binomial theorem.
Recall that the exponential distribution with rate parameter has probability density function . These distributions are often used to model random times such as failure times and arrival times. Exponential distributions are studied in more detail in the chapter on Poisson Processes.
Show how to simulate, with a random number, the exponential distribution with rate parameter . Using your calculator, simulate 5 values from the exponential distribution with parameter .
Suppose that has the exponential distribution with rate parameter . Find the density function of the following random variables:
Note that the distributions in the previous exercise are geometric distributions on and on , respectively. In many respects, the geometric distribution is a discrete version of the exponential distribution.
Suppose that and are independent random variables, each having the exponential distribution with parameter 1. Let .
Suppose that has the exponential distribution with rate parameter , has the exponential distribution with rate parameter , and that and are independent. Find the probability density function of and sketch the graph.
Suppose that is a sequence of independent random variables, and that has the exponential distribution with rate parameter for each .
Note that the minimum in part (a) has the exponential distribution with parameter . In particular, suppose that a series system has independent components, each with an exponentially distributed lifetime. Then the lifetime of the system is also exponentially distributed, and the failure rate of the system is the sum of the component failure rates.
In the order statistic experiment, select the exponential distribution.
In the setting of Exercise 44, show that for ,
The result in the previous exercise is very important in the theory of continuous-time Markov chains.
Recall that for , the gamma distribution with shape parameter has probability density function
This distribution is widely used to model arrival times under certain basic assumptions.. The gamma distribution is studied in detail in the chapter on The Poisson Process.
Show that if has the gamma distribution with shape parameter and has the gamma distribution with shape parameter . and if and are independent, then has the gamma distribution with shape parameter . Equivalently, show that .
Recall that the Pareto distribution with shape parameter has probability density function
The distribution is named for Vilfredo Pareto. It is a heavy-tailed distribution often used for modeling income and other financial variables. The Pareto distribution is studied in more detail in the chapter on Special Distributions.
Suppose that has the Pareto distribution with shape parameter . Find the density function of . Note that has the exponential distribution with rate parameter .
Suppose that has the Pareto distribution with shape parameter . Find the probability density function of . The distribution of is the beta distribution with parameters and .
Show how to simulate, with a random number, the Pareto distribution with shape parameter . Using your calculator, simulate 5 values from the Pareto distribution with shape parameter .
Recall that the standard normal distribution has probability density function
Suppose that has the standard normal distribution, and that and . Find the density function of and sketch the graph.
Random variable has the normal distribution with location parameter and scale parameter . The normal distribution is perhaps the most important distribution in probability and mathematical statistics. It is widely used to model physical measurements of all types that are subject to small, random errors. The normal distribution is studied in detail in the chapter on Special Distributions.
Suppose that has the standard normal distribution. Find the density function of and sketch the graph.
Random variable has the chi-square distribution with 1 degree of freedom. Chi-square distributions are studied in detail in the chapter on Special Distributions.
Suppose that has the normal distribution with location parameter and scale parameter , has the normal distribution with location parameter and scale parameter . and that and are independent. Show that has the normal distribution with location parameter and scale parameter
Suppose that and are independent random variables, each with the standard normal distribution. Show that has probability density function given below, and sketch the graph.
Random variable has the Cauchy distribution, named after Augustin Cauchy; it is a member of the family of student distributions. The student distributions are studied in detail in the chapter on Special Distributions.
For , show that the probability density function of is
Of course, this is the Cauchy distribution with scale parameter .
Show that if has the Cauchy distribution with scale parameter , has the Cauchy distribution with scale parameter . and and are independent, then has the Cauchy distribution with scale parameter . Equivalently, show that
Suppose that has the probability density function . Find the probability density function of and sketch its graph.
Random variables and both have beta distributions, which are widely used to model random proportions and probabilities. The family of beta distributions is studied in detail in the chapter on Special Distributions.