Probability distributions are a central topic in IB Mathematics Analysis and Approaches at SL. They provide a mathematical framework for describing random phenomena — from predicting how many students pass an exam to modelling heights in a population. The two distributions you must master are the binomial distribution (for counting successes in a fixed number of trials) and the normal distribution (for continuous data that clusters around a mean).
This guide covers the theory, formulas, and — critically — the GDC (graphing display calculator) techniques you need for the exam. In Paper 2, your GDC is your best friend for probability calculations. The IB formula booklet contains the key formulas, but for most probability questions, you will use your GDC rather than compute by hand.
Understanding when to use each distribution and how to set up the problem is more important than the calculations themselves — this is where marks are won and lost.
Core Concepts
Discrete vs Continuous Random Variables
A discrete random variable takes specific, countable values (e.g., number of heads in 10 coin flips). A continuous random variable can take any value in an interval (e.g., height, temperature).
- The binomial distribution is discrete.
- The normal distribution is continuous.
The Binomial Distribution
A random variable follows a binomial distribution, written , if:
- There are a fixed number of independent trials
- Each trial has exactly two outcomes: success (probability ) or failure (probability )
- The probability of success is constant for each trial
- The trials are independent
The probability of exactly successes is:
where is the binomial coefficient.
This formula is in your IB formula booklet, but in practice you will almost always use your GDC.
Key statistics:
- Expected value (mean):
- Variance:
- Standard deviation:
The expected value is particularly important and frequently tested.
GDC Techniques for Binomial
Your GDC has two key functions for binomial calculations:
- Binomial PDF (probability density function): — the probability of exactly successes
- Binomial CDF (cumulative distribution function): — the probability of at most successes
Using these two functions, you can calculate any binomial probability:
| Probability | GDC approach |
|---|---|
| binomPDF() | |
| binomCDF() | |
| binomCDF() | |
| binomCDF() | |
| binomCDF() | |
| binomCDF() binomCDF() |
The Normal Distribution
A continuous random variable follows a normal distribution, written , where is the mean and is the standard deviation.
Properties of the normal distribution:
- The curve is bell-shaped and symmetric about
- The mean, median, and mode are all equal to
- Approximately of data falls within
- Approximately falls within
- Approximately falls within
- The total area under the curve equals 1
Standardisation and Z-scores
The standard normal distribution has and , written .
Any normal variable can be standardised using the z-score formula:
The z-score tells you how many standard deviations a value is from the mean. A z-score of 2 means the value is 2 standard deviations above the mean.
While you can use z-scores and standard normal tables, your GDC can work with any normal distribution directly — you do not need to standardise first on Paper 2.
GDC Techniques for Normal
Your GDC provides:
-
Normal CDF: for
- For : use lower bound (or equivalent)
- For : use upper bound
-
Inverse Normal: Given a probability , find the value such that
- This is used for questions like "find the value below which 90% of the data falls"
Inverse Normal Problems
Inverse normal problems give you a probability and ask you to find the corresponding -value. The key is ensuring your probability is a left-tail probability (i.e., ) before using the inverse normal function.
Example: If and , find .
Since , we have .
Using inverse normal with area , , : (3 s.f.)
Combining Both Distributions
Some IB questions require you to recognise which distribution to use. Ask yourself:
- Am I counting successes in fixed trials? → Binomial
- Am I measuring a continuous quantity? → Normal
- Does the question mention "probability of success," "independent trials"? → Binomial
- Does it mention "mean and standard deviation" of a measured quantity? → Normal
Strategy Tips
Tip 1: Check the Conditions for Binomial
Before using the binomial distribution, verify all four conditions (fixed , two outcomes, constant , independence). If any condition is violated, the binomial model may not be appropriate.
Tip 2: Sketch a Normal Curve
For normal distribution questions, always sketch a bell curve, mark the mean, and shade the required area. This helps you identify whether you need a left-tail, right-tail, or central probability, and prevents errors when using the GDC.
Tip 3: Convert Probability Statements Carefully
The most common source of error is misinterpreting inequality signs. is NOT the same as for discrete distributions. For continuous distributions, since the probability at a single point is zero.
Tip 4: Use Complement for "At Least" Problems
For binomial problems: . The complement approach is usually easier than summing individual probabilities.
Tip 5: Write Down Your GDC Input
In the exam, write exactly what you enter into your GDC (e.g., "binomCDF(10, 0.3, 4) = 0.850"). This shows your method clearly and earns marks even if you make a keying error.
Worked Example: Example 1
A fair coin is tossed 8 times. Find the probability of getting exactly 5 heads.
Let = number of heads. Then .
(3 s.f.)
On GDC: binomPDF(8, 0.5, 5) = 0.219.
Worked Example: Example 2
A factory produces light bulbs where are defective. A random sample of 20 bulbs is taken. Find: (a) the probability that exactly 2 are defective (b) the probability that at most 2 are defective (c) the expected number of defective bulbs
Let = number of defective bulbs. .
(a) binomPDF(20, 0.05, 2) (3 s.f.)
(b) binomCDF(20, 0.05, 2) (3 s.f.)
(c)
Worked Example: Example 3
The heights of adult males in a city are normally distributed with mean cm and standard deviation cm.
(a) Find the probability that a randomly selected male is taller than cm. (b) Find the height below which of males fall.
Let .
(a)
Using GDC normalCDF(185, , 175, 8) or equivalently normalCDF(, 185, 175, 8):
(3 s.f.)
Alternatively, , and .
(b) Using inverse normal with area , , :
cm (1 d.p.)
So of adult males are shorter than cm.
Worked Example: Example 4
The scores on a test are normally distributed. If of students scored above 78 and scored below 52, find the mean and standard deviation.
means , so and from inverse normal, .
, so and .
From equation 1: ... (1) From equation 2: ... (2)
Subtracting (2) from (1): , so (3 s.f.).
Substituting back: (3 s.f.).
Practice Problems
Problem 1
A multiple-choice test has 15 questions, each with 4 options. A student guesses every answer. Find the probability of getting at least 5 questions correct.
Problem 2
The lifetime of a battery is normally distributed with mean 500 hours and standard deviation 40 hours. Find the probability that a battery lasts between 450 and 550 hours.
Problem 3
In a binomial experiment, and . Find and .
Problem 4
The weights of apples are normally distributed with mean 150g and standard deviation 20g. Apples weighing less than 120g are graded "small." What percentage of apples are graded small?
Problem 5
A medical test gives a positive result for of people with a certain condition, and of people without it (false positive). If of the population has the condition, find the probability that a person who tests positive actually has the condition.
Want to check your answers and get step-by-step solutions?
Common Mistakes
-
Using the wrong distribution. Make sure you identify whether the scenario is binomial (counting successes) or normal (measuring a continuous variable). Read the question carefully.
-
Inequality errors with discrete distributions. For : . The former includes , the latter does not. Be precise with , , , .
-
Forgetting to convert for inverse normal. Inverse normal gives . If the question says , you need to use .
-
Confusing and . The notation uses the variance as the second parameter, but your GDC typically asks for (standard deviation). Mixing these up is a common and costly error.
-
Not writing GDC inputs. You must show what you entered into your GDC. Just writing the answer without method earns zero method marks.
-
Assuming normality without checking. The binomial can be approximated by the normal for large with and , but at SL this approximation is not required. Stick with the exact distribution.
Frequently Asked Questions
What formulas do I need to know for the exam?
The binomial probability formula, expected value, and variance are in the formula booklet. The normal distribution PDF is also given, but you will never need to use it directly — your GDC handles all calculations. Focus on knowing when to use each distribution.
How do I decide between using binomPDF and binomCDF?
Use binomPDF for (exactly successes). Use binomCDF for (at most successes). For "at least" or "more than" problems, use the complement with binomCDF.
Do I always need to standardise for normal distribution questions?
No. Modern GDCs can compute probabilities for any directly. Standardisation (z-scores) is still useful for understanding and for Paper 1 (no calculator), but on Paper 2 you can input and directly.
What does the 68-95-99.7 rule mean, and when should I use it?
This rule (also called the empirical rule) states that approximately 68%, 95%, and 99.7% of data in a normal distribution falls within 1, 2, and 3 standard deviations of the mean, respectively. Use it for quick estimates and to check whether your GDC answer is reasonable.
Can the binomial distribution be used when trials are not independent?
Strictly, no — independence is one of the four conditions. However, if the population is very large relative to the sample size (typically when the sample is less than 10% of the population), the dependence is negligible and the binomial is a good approximation.
Key Takeaways
The binomial distribution counts successes. Use when you have a fixed number of independent trials, each with the same probability of success.
The normal distribution models continuous data. Use for measurements that are symmetric and bell-shaped.
Your GDC is essential. Learn to use binomPDF, binomCDF, normalCDF, and inverse normal fluently. Write down your GDC inputs for full marks.
Sketch before you calculate. For normal distribution questions, draw the bell curve, mark the mean, and shade the required region.
Check conditions before applying a distribution. Verify the four binomial conditions or the appropriateness of a normal model.
Use the complement for "at least" and "more than" problems. simplifies many calculations.
