How do I revise Probability Generating Functions for A-Level?

Use MasteryMind's Probability Generating Functions revision notes alongside practice questions and past papers. Focus on key definitions, worked examples, and exam-style questions to build confidence for your OCR A-Level exam.

Is this Probability Generating Functions guide aligned to the OCR specification?

Yes. This revision guide is specifically written for the OCR A-Level specification and covers all required content for Probability Generating Functions.

Probability Generating Functions Notes

Probability Generating Functions Revision Notes

Subject: Further Mathematics | Level: A-Level | Exam Board: OCR

Master one of the most powerful tools in A-Level Further Maths. This guide breaks down Probability Generating Functions (PGFs), showing you how to encode entire distributions into a single function, then differentiate to find the mean and variance. It’s your key to unlocking top marks in OCR exam questions on discrete distributions.

Revision Notes & Key Concepts

![Header image for Probability Generating Functions](https://xnnrgnazirrqvdgfhvou.supabase.co/storage/v1/object/public/study-guide-assets/guide_3f0e928d-1658-402d-b473-a12162b896da/header_image.png) ## Overview Probability Generating Functions (PGFs) are a cornerstone of advanced probability theory, covered in section 3.6 of the OCR A-Level Further Mathematics specification. A PGF is a sophisticated way to represent a discrete probability distribution. Instead of working with a table of probabilities, we encode the entire distribution into a single polynomial or power series, G(t). The magic of PGFs lies in their ability to simplify complex calculations. By differentiating G(t) and evaluating it at t=1, candidates can swiftly calculate the mean and variance of the distribution. Furthermore, PGFs provide an elegant method for finding the distribution of the sum of independent random variables using the convolution theorem. Exam questions typically require candidates to derive PGFs for standard distributions (like Binomial, Poisson, and Geometric), use them to find moments, and apply the convolution theorem to solve problems. Mastery of PGFs demonstrates a deep understanding of the algebraic structure of probability, a skill highly rewarded by examiners. ![Podcast: Mastering Probability Generating Functions](https://xnnrgnazirrqvdgfhvou.supabase.co/storage/v1/object/public/study-guide-assets/guide_3f0e928d-1658-402d-b473-a12162b896da/probability_generating_functions_podcast.mp3) ## Key Concepts ### Concept 1: The Definition and Purpose of a PGF A Probability Generating Function, G(t), is a power series that ‘encodes’ the probability mass function (PMF) of a discrete random variable X. It is defined as: **G(t) = E(tˣ) = Σ P(X=x) * tˣ** Here, the summation is over all possible values, x, that the random variable X can take. The variable ‘t’ is a dummy variable, a placeholder that allows us to create this function. Think of it as a clothes hanger: the hanger (t) isn't the important part; it's the clothes (the probabilities and values of X) that it holds in a structured way. The primary purpose is to transform a sequence of probabilities into a single, manageable function. A crucial property, and a key exam check, is that **G(1) = 1**, because substituting t=1 reduces the sum to Σ P(X=x), which is the sum of all probabilities and must equal 1. **Example**: A biased coin shows heads with probability p=1/3. Let X=1 for heads and X=0 for tails. The PMF is P(X=1)=1/3 and P(X=0)=2/3. The PGF is: G(t) = P(X=0)t⁰ + P(X=1)t¹ = (2/3) * 1 + (1/3) * t = **(2+t)/3**. ### Concept 2: Extracting Moments (Mean and Variance) This is the most common application of PGFs in exams. By differentiating G(t) with respect to t and evaluating at t=1, we can find the moments of the distribution. - **The Mean (Expected Value)**: The first derivative gives the mean. **E(X) = G'(1)** - **The Variance**: This requires the first and second derivatives. First, the second derivative gives the *second factorial moment*: **E[X(X-1)] = G''(1)**. This is a very common point of error; G''(1) is NOT E(X²). From this, we find Var(X) using the formula: **Var(X) = G''(1) + G'(1) - [G'(1)]²** Credit is often awarded for explicitly stating this variance formula before substitution. ![Extracting Moments from a PGF](https://xnnrgnazirrqvdgfhvou.supabase.co/storage/v1/object/public/study-guide-assets/guide_3f0e928d-1658-402d-b473-a12162b896da/pgf_moments_diagram.png) ### Concept 3: The Convolution Theorem This theorem is used for finding the distribution of the sum of two or more *independent* random variables. If Z = X + Y, where X and Y are independent, the PGF of Z is simply the product of the PGFs of X and Y. **G_Z(t) = G_X(t) * G_Y(t)** This is a powerful shortcut. For example, if you have two independent Poisson variables, X ~ Po(λ₁) and Y ~ Po(λ₂), you can find the distribution of their sum Z = X + Y by multiplying their PGFs. The result is the PGF for a Po(λ₁ + λ₂) distribution, saving you a much more complex convolution calculation. ![The Convolution Theorem for PGFs](https://xnnrgnazirrqvdgfhvou.supabase.co/storage/v1/object/public/study-guide-assets/guide_3f0e928d-1658-402d-b473-a12162b896da/pgf_convolution_diagram.png) ## Mathematical Relationships Below are the key formulas and PGFs for standard distributions. Candidates should be able to derive these but are strongly advised to memorise them for exam efficiency. | Distribution | PMF: P(X=x) | PGF: G(t) | Mean E(X) | Variance Var(X) | Status | | :--- | :--- | :--- | :--- | :--- | :--- | | **Bernoulli(p)** | p for x=1, q for x=0 | `q + pt` | `p` | `pq` | Must memorise | | **Binomial(n,p)** | `(nCx) pˣ qⁿ⁻ˣ` | `(q + pt)ⁿ` | `np` | `npq` | Must memorise | | **Poisson(λ)** | `e⁻ˡ λˣ / x!` | `e^(λ(t-1))` | `λ` | `λ` | Must memorise | | **Geometric(p)** | `qˣ⁻¹ p` (for x=1,2,...) | `pt / (1-qt)` | `1/p` | `q/p²` | Must memorise | | **Negative Binomial(r,p)** | `(x-1Cr-1) pʳ qˣ⁻ʳ` | `(pt / (1-qt))ʳ` | `r/p` | `rq/p²` | Given on formula sheet | **Key Moment Formulas:** - **E(X) = G'(1)** (Must memorise) - **Var(X) = G''(1) + G'(1) - [G'(1)]²** (Must memorise) **Key Transformation Formula:** - For Z = aX + b, **G_Z(t) = tᵇ * G_X(tᵃ)** (Must memorise) ## Practical Applications While PGFs are largely a theoretical tool in A-Level, they have significant real-world applications in fields that model discrete events, particularly where sums of variables are involved. - **Queueing Theory**: In call centres or network traffic analysis, the number of arrivals in a given interval might be modelled by a Poisson distribution. PGFs can be used to analyse the total number of arrivals over several intervals or the properties of waiting times. - **Genetics**: The number of offspring carrying a certain gene can be modelled as a random variable. PGFs are used in branching processes to model population growth over generations, calculating the probability of eventual extinction or survival of a genetic line. - **Insurance Risk**: An insurance company might model the number of claims for different policy types using different distributions. PGFs allow them to combine these to find the distribution of the total number of claims, which is crucial for calculating capital reserves.

Key Terms & Definitions

Probability Generating Function (PGF)

For a discrete random variable X, the PGF is G(t) = E(tˣ) = Σ P(X=x)tˣ, where the sum is over all values x that X can take.

Moment

A quantitative measure of the shape of a probability distribution. The first moment is the mean. The second central moment is the variance.

Factorial Moment

The r-th factorial moment of X is E[X(X-1)...(X-r+1)]. The second factorial moment, E[X(X-1)], is found from G''(1).

Convolution Theorem

If X and Y are independent random variables, the PGF of their sum Z = X + Y is the product of their individual PGFs: G_Z(t) = G_X(t) * G_Y(t).

PMF (Probability Mass Function)

A function that gives the probability that a discrete random variable is exactly equal to some value. P(X=x).

Dummy Variable

A variable, such as 't' in G(t), that is used as a placeholder in a function and is not one of the variables being measured.

Worked Examples

Worked Example

Question: The discrete random variable X has a Poisson distribution with mean 2. The discrete random variable Y has a Poisson distribution with mean 3. X and Y are independent. Find the probability generating function of Z = X + Y and use it to find P(Z=4).

Solution: Step 1: Write down the PGFs for X and Y. For X ~ Po(2), G_X(t) = e^(2(t-1)). For Y ~ Po(3), G_Y(t) = e^(3(t-1)). Step 2: Use the convolution theorem for the sum of independent variables. Since Z = X + Y and X, Y are independent, G_Z(t) = G_X(t) * G_Y(t). G_Z(t) = e^(2(t-1)) * e^(3(t-1)) = e^(2(t-1) + 3(t-1)) = e^(5(t-1)). Step 3: Identify the distribution of Z. The PGF G_Z(t) = e^(5(t-1)) is the standard PGF for a Poisson distribution with parameter 5. Therefore, Z ~ Po(5). Step 4: Calculate P(Z=4) using the identified distribution. For Z ~ Po(5), the PMF is P(Z=z) = e⁻⁵ * 5ᶻ / z! P(Z=4) = e⁻⁵ * 5⁴ / 4! = (625 * e⁻⁵) / 24. Step 5: Calculate the final numerical answer. P(Z=4) ≈ 0.175467... Final answer: **0.175** (to 3 s.f.)

Worked Example

Question: A discrete random variable X has probability generating function G_X(t) = k(1 + 2t + 3t²). (i) Find the value of the constant k. (ii) Use the PGF to find the mean and variance of X.

Solution: (i) Step 1: Use the property that G(1) = 1. G_X(1) = k(1 + 2(1) + 3(1)²) = k(1 + 2 + 3) = 6k. Since G_X(1) must equal 1, we have 6k = 1, so **k = 1/6**. (ii) Step 2: Find the first derivative, G'(t). G_X(t) = (1/6)(1 + 2t + 3t²) G'_X(t) = (1/6)(2 + 6t). Step 3: Calculate the mean, E(X) = G'(1). E(X) = G'_X(1) = (1/6)(2 + 6(1)) = 8/6 = **4/3**. Step 4: Find the second derivative, G''(t). G''_X(t) = (1/6)(6) = 1. Step 5: Calculate G''(1). G''_X(1) = 1. This is the value of E[X(X-1)]. Step 6: State and use the variance formula. Var(X) = G''(1) + G'(1) - [G'(1)]² Var(X) = 1 + (4/3) - (4/3)² Var(X) = 1 + 4/3 - 16/9 = 9/9 + 12/9 - 16/9 = **5/9**.

Worked Example

Question: The number of emails arriving at an office follows a Poisson distribution with a mean of 3 per hour. Use probability generating functions to find the probability that exactly 5 emails arrive in a two-hour period.

Solution: Step 1: Define the random variables for each hour. Let X₁ be the number of emails in the first hour, X₁ ~ Po(3). Let X₂ be the number of emails in the second hour, X₂ ~ Po(3). We assume the number of arrivals in each hour is independent. Step 2: Write down the PGF for a single hour. For X₁ and X₂, the PGF is G(t) = e^(3(t-1)). Step 3: Find the PGF for the total number of emails in two hours. Let Z = X₁ + X₂. By the convolution theorem, G_Z(t) = G_X₁(t) * G_X₂(t). G_Z(t) = e^(3(t-1)) * e^(3(t-1)) = e^(6(t-1)). Step 4: Identify the distribution of Z and calculate the probability. The PGF G_Z(t) shows that Z ~ Po(6). P(Z=5) = e⁻⁶ * 6⁵ / 5! = (7776 * e⁻⁶) / 120 = **0.161** (to 3 s.f.).

Practice Questions

Question: A fair four-sided die is numbered 1, 2, 3, 4. The result of a single roll is the random variable X. Find the probability generating function of X.

Answer:

Question: A random variable X has PGF G(t) = (0.4 + 0.6t)¹⁰. Identify the distribution of X, stating its parameters, and find E(X).

Answer:

Question: The random variable Y has PGF G(t) = e^(4(t-1)). Use differentiation to find the variance of Y.

Answer:

Question: Let X be a random variable with PGF G_X(t). A second random variable is defined as Y = 2X + 3. Find the PGF of Y, G_Y(t), in terms of G_X(t).

Answer:

Question: A discrete random variable X has PGF G(t) = (1/35)(1 + 4t + 10t² + 20t³). Find the mode of X.

Answer:

Overview

Probability Generating Functions (PGFs) are a cornerstone of advanced probability theory, covered in section 3.6 of the OCR A-Level Further Mathematics specification. A PGF is a sophisticated way to represent a discrete probability distribution. Instead of working with a table of probabilities, we encode the entire distribution into a single polynomial or power series, G(t). The magic of PGFs lies in their ability to simplify complex calculations. By differentiating G(t) and evaluating it at t=1, candidates can swiftly calculate the mean and variance of the distribution. Furthermore, PGFs provide an elegant method for finding the distribution of the sum of independent random variables using the convolution theorem. Exam questions typically require candidates to derive PGFs for standard distributions (like Binomial, Poisson, and Geometric), use them to find moments, and apply the convolution theorem to solve problems. Mastery of PGFs demonstrates a deep understanding of the algebraic structure of probability, a skill highly rewarded by examiners.

Key Concepts

Concept 1: The Definition and Purpose of a PGF

A Probability Generating Function, G(t), is a power series that ‘encodes’ the probability mass function (PMF) of a discrete random variable X. It is defined as:

G(t) = E(tˣ) = Σ P(X=x) * tˣHere, the summation is over all possible values, x, that the random variable X can take. The variable ‘t’ is a dummy variable, a placeholder that allows us to create this function. Think of it as a clothes hanger: the hanger (t) isn't the important part; it's the clothes (the probabilities and values of X) that it holds in a structured way. The primary purpose is to transform a sequence of probabilities into a single, manageable function. A crucial property, and a key exam check, is that G(1) = 1, because substituting t=1 reduces the sum to Σ P(X=x), which is the sum of all probabilities and must equal 1.

Example: A biased coin shows heads with probability p=1/3. Let X=1 for heads and X=0 for tails. The PMF is P(X=1)=1/3 and P(X=0)=2/3. The PGF is:
G(t) = P(X=0)t⁰ + P(X=1)t¹ = (2/3) * 1 + (1/3) * t = (2+t)/3.

Concept 2: Extracting Moments (Mean and Variance)

This is the most common application of PGFs in exams. By differentiating G(t) with respect to t and evaluating at t=1, we can find the moments of the distribution.

The Mean (Expected Value): The first derivative gives the mean.
E(X) = G'(1)
The Variance: This requires the first and second derivatives.
First, the second derivative gives the second factorial moment: E[X(X-1)] = G''(1). This is a very common point of error; G''(1) is NOT E(X²). From this, we find Var(X) using the formula:
Var(X) = G''(1) + G'(1) - [G'(1)]²Credit is often awarded for explicitly stating this variance formula before substitution.

Concept 3: The Convolution Theorem

This theorem is used for finding the distribution of the sum of two or more independent random variables. If Z = X + Y, where X and Y are independent, the PGF of Z is simply the product of the PGFs of X and Y.

**G_Z(t) = G_X(t) * G_Y(t)**This is a powerful shortcut. For example, if you have two independent Poisson variables, X ~ Po(λ₁) and Y ~ Po(λ₂), you can find the distribution of their sum Z = X + Y by multiplying their PGFs. The result is the PGF for a Po(λ₁ + λ₂) distribution, saving you a much more complex convolution calculation.

Mathematical Relationships

Below are the key formulas and PGFs for standard distributions. Candidates should be able to derive these but are strongly advised to memorise them for exam efficiency.

Distribution	PMF: P(X=x)	PGF: G(t)	Mean E(X)	Variance Var(X)	Status
Bernoulli(p)	p for x=1, q for x=0	`q + pt`	`p`	`pq`	Must memorise
Binomial(n,p)	`(nCx) pˣ qⁿ⁻ˣ`	`(q + pt)ⁿ`	`np`	`npq`	Must memorise
Poisson(λ)	`e⁻ˡ λˣ / x!`	`e^(λ(t-1))`	`λ`	`λ`	Must memorise
Geometric(p)	`qˣ⁻¹ p` (for x=1,2,...)	`pt / (1-qt)`	`1/p`	`q/p²`	Must memorise
Negative Binomial(r,p)	`(x-1Cr-1) pʳ qˣ⁻ʳ`	`(pt / (1-qt))ʳ`	`r/p`	`rq/p²`	Given on formula sheet

Key Moment Formulas:

E(X) = G'(1) (Must memorise)
Var(X) = G''(1) + G'(1) - [G'(1)]² (Must memorise)

Key Transformation Formula:

For Z = aX + b, G_Z(t) = tᵇ * G_X(tᵃ) (Must memorise)

Practical Applications

While PGFs are largely a theoretical tool in A-Level, they have significant real-world applications in fields that model discrete events, particularly where sums of variables are involved.

Queueing Theory: In call centres or network traffic analysis, the number of arrivals in a given interval might be modelled by a Poisson distribution. PGFs can be used to analyse the total number of arrivals over several intervals or the properties of waiting times.
Genetics: The number of offspring carrying a certain gene can be modelled as a random variable. PGFs are used in branching processes to model population growth over generations, calculating the probability of eventual extinction or survival of a genetic line.
Insurance Risk: An insurance company might model the number of claims for different policy types using different distributions. PGFs allow them to combine these to find the distribution of the total number of claims, which is crucial for calculating capital reserves.

Probability Generating Functions

Study Notes

Overview

Key Concepts

Concept 1: The Definition and Purpose of a PGF

Concept 2: Extracting Moments (Mean and Variance)

Concept 3: The Convolution Theorem

Mathematical Relationships

Practical Applications

Visual Resources

Interactive Diagrams

Worked Examples

Practice Questions

In this Guide

Key Terms

Probability Generating Functions Revision Notes

Revision Notes & Key Concepts

Key Terms & Definitions

Worked Examples

Worked Example

Worked Example

Worked Example

Practice Questions

Probability Generating Functions

Study Notes

Overview

Key Concepts

Concept 1: The Definition and Purpose of a PGF

Concept 2: Extracting Moments (Mean and Variance)

Concept 3: The Convolution Theorem

Mathematical Relationships

Practical Applications

Visual Resources

Interactive Diagrams

Worked Examples

1The discrete random variable X has a Poisson distribution with mean 2. The discrete random variable Y has a Poisson distribution with mean 3. X and Y are independent. Find the probability generating function of Z = X + Y and use it to find P(Z=4).5 marks

2A discrete random variable X has probability generating function G_X(t) = k(1 + 2t + 3t²). (i) Find the value of the constant k. (ii) Use the PGF to find the mean and variance of X.6 marks

3The number of emails arriving at an office follows a Poisson distribution with a mean of 3 per hour. Use probability generating functions to find the probability that exactly 5 emails arrive in a two-hour period.4 marks

Practice Questions

In this Guide

Key Terms

Probability Generating Function (PGF)

Moment

Factorial Moment

Convolution Theorem

PMF (Probability Mass Function)

Dummy Variable