In this section and the next, we will introduce some basic probability density functions which are used to describe common phenomena. These will be presented in two different groups: discrete density functions, generally used in situations where we can count off the possible outcomes of an experiment, will be discussed in this section. Continuous distributions, generally used in situations where there is a continuum of possible outcomes – a collection of possible outcomes requiring at least an interval of
Probabilistic processes with discrete density functions have the feature that the probability of an event can be determined by just adding up the probabilities for the outcomes that comprise the event.
There are several discrete distributions which occur routinely, with which anyone wishing to use mathematical statistics should be familiar.
Uniform Densities
When there are a finite number of possible outcomes to an experiment, and each outcome is equally likely, the probability distribution is said to be uniform. Thus, for example, on rolling a standard die the probabilities of each outcome
Similarly, consider a standard well-shuffled deck (fifty-two cards,
These are uniform distributions, where in each case we have a finite number (call it
In the special case where there are
(i.e. the average of the minimum and maximum data values). To compute the variance we first find that
so that the variance is
Example: A fair twenty-sided die has faces numbered from through . What is the probability that a roll will yield a value greater than ?
The die being fair means that each of the possible values is equally likely. If
Bernoulli Densities
Any yes-no experiment, with two mutually exclusive possible outcomes, is called a Bernoulli experiment. Assigning the values
The mean of such a random variable
and the variance is
The moment-generating function for bernoulli densities can be explicitly computed as follows.
Moment Generating Function
If
An example of such a Bernoulli experiment is a coin-flip. Let us call this experiment
An example where the two probabilities are not the same is the roll of a standard die, where success is getting the value
Any lottery is at its core a Bernoulli experiment, where one either wins or loses. If we think of left-right handedness as a genetic lottery denoted by
Examples of Bernoulli Densities
-
As of 2020,
of registered voters identified as Democrat, while identified as Republican. Thus of registered voters identified as either Democrat or Republican. -
Approximately
of humans are left-handed. Approximately of Americans identify as homosexual. Approximately of humans are red-heads, but in Scotland the incidence of red hair is roughly .
Binomial Densities
Consider the new experiment of performing a Bernoulli experiment a bunch of times and counting the number of ‘successes’ (the number of times
For example, consider the experiment of flipping a coin twice and letting
If we know that each of the four possible outcomes on flipping a coin twice is equally likely, the probability distribution for this experiment is
To be precise, if we know that the second flip is independent of the first then, letting
Thus
This technique can be used to show that
To extract the essential features of the above experiment and put them to work, we ask that the Bernoulli trials are mutually independent, and each has the same probability of yielding a
Binomial. We seek the probability distribution for this experiment.
To this end, consider the following special case.
Special Case: Determination of the probability density for the sum of three Bernoulli trials
We now let
More generally, if we ask for the probability that
We use this to find the full probability density for
This is summarized (for
We can extract the salient features of the above example to determine the probability density function for an experiment
There are
These values are the terms in the binomial expansion of
and as such give the density function its name. We denote this by
The mean and variance of this density are determined as follows. The mean is
The variance is similarly computed using
Thus
The moment-generating function for binomial densities can be explicitly computed, and used to verify the above determinations of
Moment Generating Function
Note that this is the
From this moment generating function we have
so that
and
respectively. These are immediately seen to agree with the computations above.
Example: Studies show that the probability that a newborn is male is . If seventeen babies are born at a hospital on a specific day, what is the probability that ten are boys? What is the probability that eleven are girls?
Let
The probability that eleven are girls is precisely the probability that six are boys. This is
Binomial Densities with Spreadsheets
Binomial densities occur often enough that they are built in to spreadsheet applications. In the following, we see how to use a spreadsheet to obtain probabilities
Click here to open a copy of this so you can experiment with it. You will need to be signed in to a Google account.
Geometric Densities
If we run a sequence of mutually independent, identically distributed Bernoulli trials until we get a
The density function of such an experiment is not difficult to determine. Suppose that the underlying Bernoulli experiment (call it
Thus
Thus we have the density function,
for
The mean and variance of this density are determined as follows. The mean is
so that when
Thus
Example: Hitting Home-Runs
A star baseball player hits a home-run about once in every twenty-five at-bats. A baseball playing child idolized this star, an the parents take the child to watch the star play. What is the probability that the star will hit a home-run in his first at bat? If the star is at bat four times during the game, what is the probability that the star will hit a home-run during the game?
The probability that the star hits his first home-run in his
Thus the probability that the star hits his first home-run in his first at bat is
Hitting a home-run during the game means precisely that he hit his first home-run in one of his first four at-bats. Thus the probability that he hits a home-run during the game is
That is, there is about a
Geometric Densities with Spreadsheets
Geometric densities are part of a larger family of densities called Negative Binomial, and are built in to spreadsheet applications. Here we see how to use a spreadsheet to obtain probabilities
Click here to open a copy of this so you can experiment with it. You will need to be signed in to a Google account.
Poisson Densities
Poisson densities describe the number of events occurring in a fixed interval of time, if these events occur at a known average rate, and the occurrence of an event does not influence the waiting time for subsequent events. Poisson densities can similarly be used for other specified intervals or regions, such as distance, area or volume. Thus we might use a Poisson density to understand the number of calls received by a service center in an hour, or auto accidents on a stretch of road in a month, or hawks landing on a field in day.
It is a bit beyond what we are after here to derive an exact formula for a Poisson density, given a mathematically precise formulation of the above description. Instead, we simply give the density function and properties, and follow with an example of its use.
A Poisson density function with parameter
to integer
The mean of
Given this, Poisson density functions are often written as
(using
The variance of Poisson density functions are similarly determined. We use
Thus we have
The moment generating function for a Poisson random variable can be explicitly computed.
Moment Generating Function
Let
Poisson Densities with Spreadsheets
Binomial densities occur often enough that they are built in to spreadsheet applications. In the following, we see how to use a spreadsheet to obtain probabilities
Click here to open a copy of this so you can experiment with it. You will need to be signed in to a Google account.
Example: If the number of customers entering a shop in its first hour of business each day is modeled by , what is the probability that on a given day fewer than customers will enter in the first hour? What is the probability that at least customers will enter in the first hour?
The probability that fewer than
That is, there is about a
The probability that at least
This is an infinite series which can be summed explicitly. But we will use here a computational trick which can be accessed under many circumstances. It is worth knowing. The idea is that the condition that at least
Following the preceding paragraph, we have
The following subsection describes a use of the Poisson density to approximate the binomial density under certain reasonable conditions. As such, because the formula for the Poisson density is more simple than that of the binomial density, it can be used to get good estimates when working by hand or with a calculator. When using a spreadsheet or statistical software or a computer algebra system, this is almost never used as computers have trivialized the determination of the binomial density.
Poisson Approximation of Binomial
If
Thus, for example, if
This is not so simple to estimate by hand. We obtain
This is off by less than