- Arithmetic β averages and sums
- Fractions & decimals β probability is expressed as fractions/decimals
π Statistics & Probability
Statistics is how we make sense of data in an uncertain world, from medical research to election polling. Understanding statistics helps you critically evaluate claims.
Statistics is the art of finding truth in data. But numbers mislead easily: one billionaire joining a group of average workers makes the mean salary enormous, but the median barely changes. The average is easily skewed by outliers.
Common mistake: The mean (average) is always the best measure of center.
Reality: When data has outliers, the median is more representative. Always ask: which measure best represents this data?
1. Descriptive Statistics
Descriptive statistics summarize and describe a data set. The key measures are measures of center and measures of spread.
Measures of Center
- Mean (average): \(\bar{x} = \dfrac{\sum x_i}{n}\) β sum of all values divided by the count
- Median: the middle value when data is sorted. If n is even, average the two middle values.
- Mode: the value that appears most often. A data set can have no mode, one mode, or multiple modes.
Median = 9 (4th value in sorted list)
Mode = 7 (appears twice)
Measures of Spread AP Exam
- Range: maximum β minimum
- Variance (\(\sigma^2\) or \(s^2\)): average of squared deviations from the mean
- Standard deviation (\(\sigma\) or \(s\)): square root of variance β typical distance from the mean
- IQR (Interquartile Range): Q3 β Q1 β spread of the middle 50% of data
Five-Number Summary and Box Plots
The five-number summary consists of: Minimum, Q1, Median (Q2), Q3, Maximum. A box plot displays these five values graphically.
2. Probability Fundamentals
Probability measures how likely an event is to occur. It ranges from 0 (impossible) to 1 (certain).
Complement Rule
Addition Rule AP Exam
If A and B are mutually exclusive (cannot both occur): \(P(A \cup B) = P(A) + P(B)\)
Multiplication Rule
If A and B are independent: \(P(A \cap B) = P(A) \times P(B)\)
\(P = \dfrac{4}{10} \times \dfrac{3}{9} = \dfrac{12}{90} = \dfrac{2}{15}\)
Conditional Probability
3. Counting Principles
Permutations β Order Matters
\(P(8,3) = \dfrac{8!}{5!} = 8 \times 7 \times 6 = 336\)
Combinations β Order Does Not Matter
\(C(8,3) = \dfrac{8!}{3! \cdot 5!} = \dfrac{8 \times 7 \times 6}{6} = 56\)
4. Probability Distributions
Normal Distribution
The normal distribution is bell-shaped and symmetric about the mean. Described by mean (\(\mu\)) and standard deviation (\(\sigma\)).
- About 68% of data falls within 1Ο of the mean
- About 95% of data falls within 2Ο of the mean
- About 99.7% of data falls within 3Ο of the mean
Z-Score (Standardized Score)
\(z = \dfrac{85 - 70}{10} = 1.5\) β the student is 1.5 standard deviations above the mean.
5. Practice Problems
- Data: 3, 5, 5, 8, 10, 12. Find mean, median, and mode.
- A fair die is rolled. What is the probability of rolling an even number?
- A card is drawn from a standard 52-card deck. Find P(King or Heart).
- How many 4-digit PIN codes are possible using digits 0β9 with no repetition?
- A class has 10 students. How many ways can a team of 4 be selected?
- IQ scores are normally distributed with ΞΌ = 100, Ο = 15. What percent of scores fall between 85 and 115?
- Mean = \(\dfrac{43}{6} \approx 7.17\); Median = \(\dfrac{5+8}{2} = 6.5\); Mode = 5
- \(P = \dfrac{3}{6} = \dfrac{1}{2}\)
- \(P(\text{K or H}) = \dfrac{4}{52} + \dfrac{13}{52} - \dfrac{1}{52} = \dfrac{16}{52} = \dfrac{4}{13}\)
- \(P(10,4) = 10 \times 9 \times 8 \times 7 = 5{,}040\)
- \(C(10,4) = 210\)
- 85 to 115 is within 1Ο β approximately 68%
How do you calculate probability for continuous data like height or weight?
Discrete probability uses counting, but continuous probability distributions require the area under a curve β which is exactly what integration (calculus) computes. That's the bridge from statistics to calculus.
CalculusStatistics is the foundation of data science and scientific reasoning. Mastering mean, variance, and probability now unlocks advanced topics like hypothesis testing, regression, and machine learning fundamentals.
- Variance = mean of squared deviations / Standard deviation = βvariance
- Addition rule: P(AβͺB) = P(A)+P(B)βP(Aβ©B)
- Independent events: P(Aβ©B) = P(A)ΓP(B)
- Mean is sensitive to outliers; median is robust
Review this material at increasing intervals to commit it to long-term memory.