]>
As usual, our starting point is a random experiment with probability measure on an underlying sample space. A generating function of a random variable is an expected value of a certain transformation of the variable. Most generating functions share four important properties:
Property 1 is most important. Often a random variable is shown to have a certain distribution by showing that the generating function has a certain form. The process of recovering the distribution from the generating function is known as inversion. Property 2 is frequently used to determine the distribution of a sum of independent variables. By contrast, recall that the probability density function of a sum of independent variables is the convolution of the individual density functions, a much more complicated operation. Property 3 is useful because often computing moments from the generating function is easier than computing the moments directly from the definition. The last property is known as the continuity theorem. Often it is easer to show the convergence of the generating functions than to prove convergence of the distributions directly.
Suppose that is a random variable taking values in . The probability generating function of is defined as follows, for all values for which the expected value exists:
Let denote the probability density function of , so that .
Show that
Thus, is a power series in , with the values of the probability density function as the coefficients. In the language of combinatorics, is the ordinary generating function of . Recall from calculus that there exists such that the series converges absolutely for and diverges for . The number is the radius of convergence of the series.
Show that and hence .
Recall from calculus that a power series can be differentiated term by term, just like a polynomial. Each derivative series has the same radius of convergence as the original series. We denote the derivative of order by .
Show that for . Thus, the probability generating function completely determines the distribution of .
Show that . This is not a particularly important result, but has a certain curiosity.
Recall that if and are nonnegative integers with , then the number of permutations of size chosen from a population of objects is
Suppose that the radius of convergence satisfies . Show that for ; these moments are called factorial moments. In particular, has finite moments of all orders.
In particular, show that
Suppose that and are independent random variables with probability generating functions and respectively. Show that the probability generating function of is
Let be a real-valued random variable. The moment generating function of is the function defined by
Note that since with probability 1, exists, as a real number or , for any .
Suppose that has a continuous distribution on with probability density function . Show that
Thus, the moment generating function of is closely related to the Laplace transform of the density function . The Laplace transform is named for Simeon Laplace, and is widely used in many areas of applied mathematics.
The basic inversion theorem for moment generating functions states that if for is some open interval about 0, then completely determines the distribution of . Thus, if two distributions on have moment generating functions that are equal (and finite) in an open interval about 0, then the distributions are the same.
Suppose that has moment generating function and that is finite in some interval about 0. Then has moments of all orders. Show formally the following result (the interchange of sum and expected value is justified by the finiteness assumption).
Show that for . Thus, the derivatives of the moment generating function at 0 determine the moments of the variable (hence the name). In the language of combinatorics, the moment generating function is the exponential generating function of the sequence of moments.
Thus, a random variable that does not have finite moments of all orders cannot have a finite moment generating function. Even when a random variable does have moments of all orders, the moment generating function may not exist. A counterexample is given below.
Suppose that is a real-valued random variable with moment generating function and that and are constants. Show that the moment generating function of is
Suppose that and are independent, real-valued random variables with moment generating functions and respectively. Show that the moment generating function of is
Suppose that is a random variable taking values in with probability generating function . Show that the moment generating function of is
Suppose that is a real-valued random variable with moment generating function . Prove the Chernoff bounds:
Hint: Show that if and if . Then use Markov's inequality.
Naturally, the best Chernoff bound (in either (a) or (b)) is obtained by finding that minimizes .
From a mathematical point of view, the nicest of the generating functions is the characteristic function which is defined for a real-valued random variable by
Note that is a complex valued function, and thus this subsection requires knowledge of complex analysis, at the undergraduate level. Note that is defined for all because the random variable in the expected value is bounded in magnitude. Indeed, for all . Many of the properties of the characteristic function are more elegant than the corresponding properties of the probability or moment generating functions, because the characteristic function always exists.
Suppose that has a continuous distribution on with probability density function . Show that
Thus, the characteristic function of is closely related to the Fourier transform of the density function . The Fourier transform is named for Joseph Fourier, and is widely used in many areas of applied mathematics.
The characteristic function completely determines the distribution. That is, random variables and have the same distribution if and only if they have the same characteristic function. Indeed, the general inversion formula is a formula for computing certain combinations of probabilities from the characteristic function: if then
The probability combinations on the right side completely determine the distribution of . Suppose that has a continuous distribution with probability density function . A special inversion formula states that at every point where is differentiable,
As with the other generating functions, the characteristic function can be used to find the moments of . Moreover, this can be done even when only some of the moments exist. If then
and therefore
Suppose that is a real-valued random variable with characteristic function and that and are constants. Show that the characteristic function of is
Suppose that and are independent, real-valued random variables with characteristic functions and respectively. Show that the characteristic function of is
The characteristic function of a random variable can be obtained from the moment generating function, under the basic existence condition that we saw earlier. Specifically, suppose that is a real-valued random variable with moment generating function that satisfies for in some interval about 0. Then the characteristic function of satisfies for .
The final important property of characteristic functions that we will discuss relates to convergence in distribution. Suppose that is a sequence of real-valued random with characteristic functions respectively. The random variables need not be defined on the same probability space.
The continuity theorem states that if the distribution of converges to the distribution of a random variable as and has characteristic function , then as for all . Conversely, if as for in some open interval about 0, and is continuous at 0, then is the characteristic function of a random variable , and the distribution of converges to the distribution of as .
The continuity theorem can be used to prove the central limit theorem, one of the fundamental theorems of probability. Also, the continuity theorem has a straightforward generalization to distributions on .
Suppose now that is a random vector for an experiment, taking values in . The (joint) characteristic function of is defined by
Once again, the most important fact is that completely determines the distribution: two random vectors taking values in have the same characteristic function if and only if they have the same distribution.
The joint moments can be obtained from the derivatives of the characteristic function. Suppose that and . If then
Now let , , and denote the characteristic functions of , , and , respectively.
Show that
Show that and are independent if and only if for all .
Naturally, the results for bivariate characteristic functions have analogies in the general multivariate case. Only the notation is more complicated.
Suppose is an indicator random variable with , where is a parameter. Show that has probability generating function for
Recall that a Bernoulli trials process is a sequence of independent, identically distributed indicator random variables. In the usual language of reliability, denotes the outcome of trial , where 1 denotes success and 0 denotes failure. The probability of success is the basic parameter of the process. The process is named for James Bernoulli. A separate chapter on the Bernoulli Trials explores this process in more detail.
The number of successes in the first trials is . Recall that this random variable has the binomial distribution with parameters and , which has density function
Show that has probability generating function in two ways:
Show that
Suppose that has probability density function where is a parameter. Thus, has the geometric distribution on with parameter , and models the trial number of the first success in a sequence of Bernoulli trials. Let denote the probability generating function of . Show that
Suppose that has the binomial distribution with parameters and , has the binomial distribution with parameters and , and that and are independent.
Recall that the Poisson distribution has density function
where
is a parameter. The Poisson distribution is named after Simeon Poisson and is widely used to model the number of random points
in a region of time or space; the parameter is proportional to the size of the region of time or space. The Poisson distribution is studied in more detail in the chapter on the Poisson Process.
Suppose that has Poisson distribution with parameter . Let denote the probability generating function of . Show that
Suppose that has the Poisson distribution with parameter , has the Poisson distribution with parameter , and that and are independent. Show that has the Poisson distribution with parameter .
Suppose that has the Poisson distribution with parameter . Use the Chernoff bounds to show that
Let denote the probability generating function of the binomial distribution with parameters and , where as (and where ). Let denote the probability generating function of the Poisson distribution with parameter . Show that as for any . Thus conclude that the binomial distribution with parameters and converges to the Poisson distribution with parameter as .
Recall that the exponential distribution is a continuous distribution with probability density function
where
is the rate parameter. This distribution is widely used to model failure times and other arrival times
. The exponential distribution is studied in more detail in the chapter on the Poisson Process.
Suppose that has the exponential distribution with rate parameter . Let denote the moment generating function of . Show that
Suppose that is a sequence of independent random variables, each having the exponential distribution with rate parameter . Find the moment generating function of . Random variable has the gamma distribution with parameters and .
Suppose that is uniformly distributed on the interval . Let denote the moment generating function of . Show that
Suppose that is uniformly distributed on the triangle .
Suppose that has probability density function
Recall that the standard normal distribution is a continuous distribution with density function . Normal distributions are widely used to model physical measurements subject to small, random errors and are studied in more detail in the chapter on Special Distributions.
Suppose that has the standard normal distribution. Let denote the moment generating function of . Show that
Suppose again that has the standard normal distribution. Recall that has the normal distribution with mean and standard deviation . Show that the moment generating function of is for .
Suppose that and are independent random variables, each with a normal distribution. Show that has a normal distribution.
Suppose that has the Pareto distribution, which is a continuous distribution with probability density function
where is a parameter. The Pareto distribution is named for Vilfredo Pareto. It is a heavy-tailed distribution that is widely used to model financial variables such as income. The Pareto distribution is studied in more detail in the chapter on Special Distributions.
Let denote the moment generating function of . Show that
Suppose that has the Cauchy distribution, a continuous distribution with probability density function
.This distribution is named for Augustin Cauchy and is a member of the family of student distributions. The distributions are studied in more detail in the chapter on Special Distributions. The graph of is known as the Witch of Agnesi, named for Maria Agnesi.
Let denote the moment generating function of . Show that
Let denote the characteristic function of . Show that for .
For the Pareto distribution, only some of the moments are finite; naturally, the moment generating function was infinite. We will now give an example of a distribution for which of the moments are finite, yet still the moment generating function is infinite. Furthermore, we will see two different distributions that have the same moments of all orders.
Suppose that Z has the standard normal distribution and let . The distribution of is known as a lognormal distribution.
Use the change of variables formula to show that has probability density function
Use the moment generating function of the standard normal distribution to show that for . Thus, has finite moments of all orders.
Show that for . Thus, the moment generating function of is infinite at any positive value of .
Now let . Show that for ,
where is normally distributed with mean and standard deviation 1. Hint: Use the change of variables and then complete the square in the exponential factor.
Use the result in the previous exercise and symmetry to conclude that for ,
Let . Use the result in the previous exercise to show that is a probability density function.
Let have probability density function . Use the result in Exercise 40 to show that has the same moments as . That is, for .
The graphs of and are shown below, in blue and red, respectively.