Mean and Standard Deviation of Discrete Random Variables
Characteristics of a Probability Distribution
The most important characteristics of any probability distribution are the mean (or average value) and the standard deviation (a measure of how spread out the values are). The example below illustrates how to calculate the mean and the standard deviation of a random variable. A common symbol for the mean is (mu), the lowercase of the Greek alphabet. A common symbol for standard deviation is (sigma), the Greek lowercase .
Calculating the Mean of a Probability Distribution
Recall the probability distribution of the 2-coin experiment. Calculate the mean of this distribution.
If we look at the graph of the 2-coin toss experiment (shown below), we can easily reason that the mean value is located right in the middle of the graph, namely, at . This is intuitively true. Here is how we can calculate it:
To calculate the population mean, multiply each possible outcome of the random variable by its associated probability and then sum over all possible values of :
Mean Value or Expected Value
The mean value, or expected value, of a discrete random variable is given by the following equation:
This definition is equivalent to the simpler one you have learned before:
However, the simpler definition would not be usable for many of the probability distributions in statistics.
Calculating the Expected Value
An insurance company sells life insurance of $15,000 for a premium of $310 per year. Actuarial tables show that the probability of death in the year following the purchase of this policy is 0.1%. What is the expected gain for this type of policy?
There are two simple events here: either the customer will live this year or will die. The probability of death, as given by the problem, is 0.001, and the probability that the customer will live is . The company’s expected gain from this policy in the year after the purchase is the random variable, which can have the values shown in the table below.
Figure: Analysis of the possible outcomes of an insurance policy.
Remember, if the customer lives, the company gains $310 as a profit. If the customer dies, the company "gains" , or in other words, it loses $14,690. Therefore, the expected profit can be calculated as follows:
This tells us that if the company were to sell a very large number of the 1-year $15,000 policies to many people, it would make, on average, a profit of $295 per sale.
Another approach is to calculate the expected payout, not the expected gain:
Since the company charges $310 and expects to pay out $15, the average profit for the company is $295 per policy.
Sometimes, we are interested in measuring not just the expected value of a random variable, but also the variability and the central tendency of a probability distribution. To do this, we first need to define population variance, or . It is the average of the squared distance of the values of the random variable from the mean value, . The formal definitions of variance and standard deviation are shown below.
The variance of a discrete random variable is given by the following formula:
The Standard Deviation
The square root of the variance, or, in other words, the square root of , is the standard deviation of a discrete random variable:
Finding the Mean, Standard Deviation, and Interpreting the Results
A university medical research center finds out that treatment of skin cancer by the use of chemotherapy has a success rate of 70%. Suppose five patients are treated with chemotherapy. The probability distribution of successful cures of the five patients is given in the table below:
Figure: Probability distribution of cancer cures of five patients.
1) Find .
To find , we use the following formula:
2) Find .
To find , we first calculate the variance of :
Now we calculate the standard deviation:
3) Graph and explain how and can be used to describe .
The graph of is shown below:
We can use the mean, or , and the standard deviation, or , to describe in the same way we used and to describe the relative frequency distribution. Notice that is the center of the probability distribution. In other words, if the five cancer patients receive chemotherapy treatment, we expect the number of them who are cured to be near 3.5. The standard deviation, which is in this case, measures the spread of the probability distribution .
A random variable has the following probability distribution:
Calculate the mean of X.
Mean of X is
Calculate the variance of X
Calculate the standard deviation of X.
Standard deviation of is .
- Consider the following probability distribution:
Figure: The probability distribution for question 1.
a. Find the mean of the distribution.
b. Find the variance.
c. Find the standard deviation.
- An officer at a prison was studying recidivism among the prison inmates. The officer questioned each inmate to find out how many times the inmate had been convicted prior to the inmate’s current conviction. The officer came up with the following table that shows the relative frequencies of X, the number of times previously convicted:
Figure: The probability distribution for question 2.
If we regard the relative frequencies as approximate probabilities, what is the expected value of the number of previous convictions of an inmate?
- Suppose has the following distribution table:
Find the expected value.
- The possible values for a certain random variable are 1, 2, 3, and 8. Part of its distribution table is given below. Fill in the blank, and find the expected value.
- Suppose draws are made at random with replacement from a box of numbered balls. Let be the sum of the draws. Show that the expected value of is equal to (the average of the box).
- A die is thrown twice. Let be the number of spots on the first thrown and be the number of spots on the second throw. Find .
- Find the expected value of the random variable with the following distribution table:
- The possible values for a certain random variable are 1, 3, 4, and 8. Part of its distribution table is given below. Fill in the blank, and find the expected value.
- Suppose X represents the number of children in a family. Following is the probability distribution for X for families with particular characteristics;
a. Is this a valid probability distribution? Explain.
b. What is the expected value of X? What does this mean?
- Suppose the probability that you get an A in any class is .4 and the probability that you get a B in the class is 0.6. To construct a grade point average an A is worth 4.0 and a B is worth 3.0.
- Is it possible that you will get a C in this class? Explain.
- What is the expected value of your grade point average?
- Suppose you have to take the bus to school. The probability that you will have to wait for the bus is .25. If you don’t have to wait for the bus the commute takes 20 minutes, but it you have to wait for the bus, the commute takes 30 minutes. What is the expected value of the time it takes you to commute to school?
To view the Review answers, open this PDF file and look for section 4.3.