📚Study Guide: Probability, Random Variables, and Distributions
Unit 4: Probability, Random Variables, and Distributions
Probability forms the mathematical foundation of statistical inference. This unit covers basic probability rules, conditional probability, independence, and the concept of random variables. You will study probability distributions for discrete and continuous random variables, computing expected values and standard deviations. The binomial and geometric distributions model specific types of discrete scenarios, while the normal distribution is the cornerstone continuous model. Understanding how to combine random variables and approximate binomial probabilities with normal distributions prepares you for inference. The AP exam frequently tests probability through two-way tables, tree diagrams, and contextual word problems.
Key Concepts
- Basic Probability Rules: Complement P(A^c) = 1 - P(A); addition P(A or B) = P(A) + P(B) - P(A and B); multiplication for independent events P(A and B) = P(A)*P(B).
- Conditional Probability: P(A|B) = P(A and B) / P(B). Tree diagrams and two-way tables are excellent tools.
- Independence: A and B are independent if P(A|B) = P(A), or equivalently P(A and B) = P(A)*P(B).
- Discrete Random Variables: Defined by a probability distribution table; mean (expected value) mu = sum[x*P(x)]; variance = sum[(x-mu)^2*P(x)].
- Continuous Random Variables: Described by density curves; probability is area under the curve.
- Binomial Distribution: Fixed trials n, independent, two outcomes, constant probability p. Mean = np; SD = sqrt[np(1-p)].
- Geometric Distribution: Trials until first success. Mean = 1/p.
- Normal Distribution: Symmetric, bell-shaped, defined by mean and standard deviation; use z-scores and standard normal tables.
Vocabulary
- Sample space: The set of all possible outcomes of a random process.
- Mutually exclusive: Events that cannot occur simultaneously; P(A and B) = 0.
- Random variable: A variable taking numerical values determined by the outcome of a random process.
- Probability distribution: A description of all possible values of a random variable and their probabilities.
- Expected value: The long-run average value of a random variable, equivalent to the mean of its distribution.
- Standard deviation of a random variable: A measure of the spread of its probability distribution.
Formulas
- P(A or B) = P(A) + P(B) - P(A and B)
- P(A|B) = P(A and B) / P(B)
- E(X) = mu = sum x*P(x)
- Var(X) = sum (x - mu)^2 * P(x)
- Binomial: P(X=k) = C(n,k) * p^k * (1-p)^(n-k)
- Binomial mean: mu = np; SD = sqrt[np(1-p)]
- Geometric: P(X=k) = (1-p)^(k-1) * p
- Geometric mean: mu = 1/p
- z = (x - mu) / sigma
Common Mistakes
- Treating dependent events as independent and simply multiplying their probabilities.
- Using the binomial formula when trials are not independent or probabilities change between trials.
- Finding P(X > k) for a discrete distribution and forgetting to adjust for continuity when approximating with a normal distribution.
- Confusing mutually exclusive events with independent events; mutually exclusive events with positive probability are dependent.
AP Exam Strategies
- For complex probability problems, draw a two-way table or tree diagram before writing any formulas.
- Always check the binomial conditions: binary outcomes, independent trials, fixed n, same probability p.
- When using normal approximation to binomial, verify np >= 10 and n(1-p) >= 10, then apply continuity correction.
- State the distribution and parameters (e.g., Binomial(n=20, p=0.3)) before computing probabilities.
Real-World Applications
- Insurance: Probability distributions model claim frequencies and set premium prices.
- Quality Assurance: Binomial distributions calculate the probability of defective items in batches.
- Genetics: Probability rules predict inheritance patterns and genetic trait expression likelihoods.