]>
A multinomial trials process is a sequence of independent, identically distributed random variables each taking possible values. Thus, the multinomial trials process is a simple generalization of the Bernoulli trials process (which corresponds to ). For simplicity, we will denote the set of outcomes by , and we will denote the common probability density function of the trial variables by
Of course for each and . In statistical terms, the sequence is formed by sampling from the distribution.
As with our discussion of the binomial distribution, we are interested in the random variables that count the number of times each outcome occurred. Thus, let
Note that so if we know the values of of the counting variables, we can find the value of the remaining counting variable. As with any counting variable, we can express as a sum of indicator variables:
Show that .
Basic arguments using independence and combinatorics can be used to derive the joint, marginal, and conditional densities of the counting variables. In particular, recall the definition of the multinomial coefficient: for positive integers with ,
Show that for positive integers with ,
The distribution of is called the multinomial distribution with parameters and . We also say that has this distribution (recall that the values of of the counting variables determine the value of the remaining variable). Usually, it is clear from context which meaning of the term multinomial distribution is intended. Again, the ordinary binomial distribution corresponds to .
Show that has the binomial distribution with parameters and :
The multinomial distribution is preserved when the counting variables are combined. Specifically, suppose that . is a partition of the index set into nonempty subsets. For let
Show that has the multinomial distribution with parameters and .
The multinomial distribution is also preserved when some of the counting variables are observed. Specifically, suppose that is a partition of the index set into nonempty subsets. Suppose that is a sequence of nonnegative integers, indexed by such that
Let
Show that the conditional distribution of . given . is multinomial with parameters and .
Combinations of the basic results in Exercise 4 and Exercise 5 can be used to compute any marginal or conditional distributions.
We will compute the mean, variance, covariance, and correlation of the counting variables. Results from the binomial distribution and the representation in terms of indicator variables are the main tools.
Show that
Show that for distinct and ,
From Exercise 7, note that the number of times outcome occurs and the number of times outcome occurs are negatively correlated, but the correlation does not depend on or . Does this seem reasonable?
Use the result of Exercise 7 to show that if , then the number of times outcome 1 occurs and the number of times outcome 2 occurs are perfectly correlated. Does this seem reasonable?
In the dice experiment, select the number of aces. For each die distribution, start with a single die and add dice one at a time, noting the shape of the density function and the size and location of the mean/standard deviation bar. When you get to 10 dice, run the simulation with an update frequency of 10. Note the apparent convergence of the relative frequency function to the density function, and the empirical moments to the distribution moments.
Suppose that we throw 10 standard, fair dice. Find the probability of each of the following events:
Suppose that we roll 4 ace-six flat dice (faces 1 and 6 have probability each; faces 2, 3, 4, and 5 have probability each). Find the joint probability density function of the number of times each score occurs.
In the dice experiment, select 4 ace-six flats. Run the experiment 500 times, updating after each run. Compute the joint relative frequency function of the number times each score occurs. Compare the relative frequency function with the true probability density function.
Suppose that we roll 20 ace-six flat dice. Find the covariance and correlation of the number of 1's and the number of 2's.
In the dice experiment, select 20 ace-six flat dice. Run the experiment 500 times, updating after each run. Compute the empirical covariance and correlation of the number of 1's and the number of 2's. Compare the results with the theoretical results computed in Problem 13.