]>
Suppose again that our random experiment is to perform a sequence of Bernoulli trials with parameter . Recall that the number of successes in the first trials
has the binomial distribution with parameters and . In this section we will study the random variable that gives the trial number of the success:
Note that is the number of trials needed to get the first success, which we now know has the geometric distribution on with parameter .
Show that if and only if and .
Use Exercise 1, independence, and the binomial distribution to show that
The distribution defined by the density function in Exercise 2 is known as the negative binomial distribution; it has two parameters, the number of successes and the success probability .
In the negative binomial experiment, vary and with the scroll bars and note the shape of the density function. For selected values of and , run the experiment 1000 times with an update frequency of 10. Watch the apparent convergence of the relative frequency function to the density function.
Show that the binomial and negative binomial sequences are inverse to each other in the sense that
This property is implicit in the definition of given at the beginning of this section. In particular, argue that any event that can be expressed in terms of the negative binomial variables can also be expressed in terms of the binomial variables.
Show that if and only if . Thus, the density function at first increases and then decreases, reaching its maximum value at . This integer is the mode of the distribution and hence the negative binomial distribution is unimodal.
Next we will define the random variables that give the number of trials between successive successes. Let
Show that is a sequence of independent random variables, each having the geometric distribution on with parameter . Moreover,
In statistical terms, corresponds to sampling from the geometric distribution with parameter , so that for each , is a random sample of size from this distribution. The sequence of negative binomial variables is the partial sum process corresponding to the sequence . In statistical terms, is the sample mean corresponding to the random sample ; this random variable gives the average number of trials between the first successes. Partial sum processes are studied in more generality in the chapter on Random Samples.
Use the partial sum representation to prove the following properties.
Actually, any partial sum process corresponding to an independent, identically distributed sequence will have stationary, independent increments.
The mean, variance and probability generating function of now follow easily from the representation as a sum of independent, identically distributed geometrically distributed variables..
Show that .
Show that .
Show that for
In the negative binomial experiment, vary and with the scroll bars and note the location and size of the mean/standard deviation bar. For selected values of the parameters, run the experiment 1000 times with an update frequency of 10. Watch the apparent convergence of the sample mean and standard deviation to the distribution mean and standard deviation.
Verify the results of Exercise 8, Exercise 9, and Exercise 10 directly using the probability density function. Note that this method requires significantly more work.
Suppose that and are independent random variables for an experiment, and that has the negative binomial distribution with parameters and , and has the negative binomial distribution with parameters and . Show that has the negative binomial distribution with parameters and .
In the negative binomial experiment, start with various values of and . Successively increase by 1, noting the shape of the probability density function each time.
Even though you are limited to , you can still see the characteristic bell shape. This is a consequence of the central limit theorem because the negative binomial variable can be written as a sum of independent, identically distributed (geometric) random variables.
Show that the distribution of the standardized variable given below converges to the standard normal distribution as .
From a practical point of view, this result means that if
is large
, the distribution of
is approximately normal with mean and variance given in Exercise 8 and Exercise 9, respectively. Just how large
needs to be for the approximation to work well depends on
. Also, when using the normal approximation, we should remember to use the continuity correction, since the negative binomial is a discrete distribution.
Suppose that . Show that
where . Thus, given successes in the first trials, the vector of success trial numbers is uniformly distributed on . Equivalently, the vector of success trial numbers is distributed as the vector of order statistics corresponding to a sample of size chosen at random and without replacement from .
Show that
Again, this is the distribution of the order statistic when a sample of size is selected at random and without replacement from the population .
A standard, fair die is thrown until 3 aces occur. Let denote the number of throws.
A coin is tossed repeatedly. The head occurs on the toss.
A certain type of missile has failure probability 0.02. Let denote the launch number of the fourth failure.
In the negative binomial experiment, set and . Run the experiment 1000 times with an update frequency of 100. Compute and compare each of the following:
A coin is tossed until the head occurs.
Suppose that an absent-minded professor (is there any other kind?) has matches in his right pocket and matches in his left pocket. When he needs a match to light his pipe, he is equally likely to choose a match from either pocket. We want to compute the probability density function of the random variable that gives the number of matches remaining when the professor first discovers that one of the pockets is empty. This is known as the Banach match problem, named for the mathematician Stefan Banach, who evidently behaved in the manner described.
We can recast the problem in terms of the negative binomial distribution. Clearly the match choices form a sequence of Bernoulli trials with parameter . Specifically, we can consider a match from the right pocket as a win for player , and a match from the left pocket as a win for player . In a hypothetical infinite sequence of trials, let denote the number of trials necessary for to win trials, and the number of trials necessary for to win trials. Note that and each have the negative binomial distribution with parameters and .
For , show that
For , show that
Combine the results of the previous two exercises to conclude that
We can also solve the non-symmetric Banach match problem, using the same methods as above. Thus, suppose that the professor reaches for a match in his right pocket with probability and in his left pocket with probability , where . The essential change in the analysis is that has the negative binomial distribution with parameters and , while has the negative binomial distribution with parameters and .
Show that
Suppose that two teams (or individuals) and play a sequence of Bernoulli trials, where is the probability that player wins a trial. For nonnegative integers and , let denote the probability that wins points before wins points. Computing is an historically famous problem, known as the problem of points, that was solved by Pierre de Fermat and by Blaise Pascal.
Comment on the validity of the Bernoulli trial assumptions (independence of trials and constant probability of success) for games of sport that have a skill component as well as a random component.
There is an easy solution to the problem of points using the binomial distribution; this was essentially Pascal's solution. Let us pretend that trials are played, regardless of the outcomes, and let denote the number of trials that wins. By definition, has the binomial distribution with parameters and .
Show that wins trials before wins trials if and only if
Use the result of the previous exercise to show that
There is also an easy solution to the problem of points using the negative binomial distribution In a sense, this has to be the case, given the equivalence between the binomial and negative binomial processes. First, let us pretend that the games go on forever, regardless of the outcomes, and let denote the number of games needed for to win games. By definition, has the negative binomial distribution with parameters and .
Show that wins trials before wins trials if and only if
Use the result of the previous exercise to show that
Show that for fixed and , increases from 0 to 1 as increases from 0 to 1.
Show that
Show that for any , , and .
In the problem of points experiments, vary the parameters , , and , and note how the probability changes. For selected values of the parameters, run the simulation 1000 times with an update frequency of 10. Note the apparent convergence of the relative frequency to the probability.
Condition on the outcome of the first trial to derive the following recurrence relation and boundary conditions (this was essentially Fermat's solution):
Now let's study the number of trials needed for the problem of points experiment. Let denote the number of trials needed until either wins points or wins points, which occurs first. The negative binomial variables provide an easy derivation of the distribution . Again, imagine that we continue the trials indefinitely. Let denote the number of trials needed for to win points, and let denote the number of trials needed for to win points.
Show that for
The special case of the problem of points experiment with is important, because it corresponds to and playing a best of game series. That is, the first player to win games wins the series. Such series, especially when are frequently used in championship tournaments. We will let denote the probability that player wins the series. From our general results with the problem of points, we have
Suppose that . Explicitly find the probability that team wins in each of the following cases:
In the problem of points experiments, vary the parameters , , and (keeping ), and note how the probability changes. Now simulate a best of 5 series by selecting , . Run the experiment 1000 times, updating every 10 runs. Note the apparent convergence of the relative frequency to the true probability.
Show that for any and .
In the problem of points experiments, vary the parameters , , and (keeping ), and note how the probability changes. Now simulate a best 7 series by selecting , . Run the experiment 1000 times, updating every 10 runs. Note the apparent convergence of the relative frequency o the true probability.
Suppose that Show that if and only if Interpret the result.
Let denote the number of trials in the series. Use Exercise 37 to show that
Explicitly compute the probability density function, expected value, and standard deviation for the number of games in a best of 7 series with the following values of :
The problem of points originated from a question posed by Chevalier de Mere, who was interested in the fair division of stakes when a game is interrupted. Specifically, suppose that players and each put up monetary units, and then play Bernoulli trials until one of them wins a specified number of trials. The winner then takes the entire fortune.
If the game is interrupted when needs to win more trials and needs to win more trials, argue that the fortune should be divided between and , respectively, as follows: