Summary: Using the Central Limit Theorem

Key Concepts

  • Even if a distribution is non-normal, if the sample size is sufficiently large, a normal distribution can be used to calculate probabilities involving sample means and sample sums. This is even true for exponential distributions and uniform distributions.
  • As the sample size gets larger, the mean of the sample means approaches the population mean. This is due to the law of large numbers.
  • The central limit theorem (CLT) is not for calculating probabilities involving an individual value.

Glossary

central limit theorem (for means and sums): given a random variable (RV) with known mean μ and known standard deviation, σ, we are sampling with size n, and we are interested in two new RVs: the sample mean, ¯X¯¯¯¯¯X and the sample sum x. If the size (n) of the sample is sufficiently large, then ¯XN(M,σn) and XN(nμ,nσ). If the size (n) of the sample is sufficiently large, then the distribution of the sample means and the distribution of the sample sums will approximate a normal distribution regardless of the shape of the population. The mean of the sample means will equal the population mean, and the mean of the sample sums will equal n times the population mean. The standard deviation of the distribution of the sample means, σn, is called the standard error of the mean.

exponential distribution: a continuous random variable (RV) that appears when we are interested in the intervals of time between some random events, for example, the length of time between emergency arrivals at a hospital; the notation is XExp(m). The mean is μ=1m and the standard deviation is σ=1m. The probability density function is f(x)=me(mx),x0 and the cumulative distribution function is P(Xx)=1e(mx).

uniform distribution: a continuous random variable (RV) that has equally likely outcomes over the domain, [latex]a