Borel-Cantelli Lemma

Published May 27, 2026

BOREL-CANTELLI LEMMA

\sum_{n=1}^{\infty} \mathbb{P}(A_n) < \infty

then

\mathbb{P}(A_n \, \text{i.o.}) = 0

(A_n)

are independent and

\sum_{n=1}^{\infty} \mathbb{P}(A_n) = \infty

then

\mathbb{P}(A_n \, \text{i.o.}) = 1

The Borel-Cantelli Lemma is a fundamental result in probability theory which provides criteria to determine whether an infinite sequence of events will occur infinitely often or only finitely often.

Before proving the both Borel-Cantelli lemmas let's consider three examples.

Example of Symmetric coin

Consider an infinite sequence of independent tosses of a fair coin, represented by random variables

X_1, X_2, \ldots

, where each toss has probability

\begin{equation*}\mathbb{P}(\text{Head}) = \frac{1}{2}.\end{equation*}

Will we see infinitely many heads, or will the sequence eventually contain only tails?

While our intuition suggests we should see infinitely many heads, the second Borel-Cantelli lemma provides mathematical certainty. Consider:

\begin{equation*}\sum_{k=1}^n \mathbb{P} \left( X_k = \text{Head} \right) = \sum_{k=1}^n \frac{1}{2} \to \infty\end{equation*}

By the second Borel-Cantelli lemma, since the events are independent, with probability 1 we will observe infinitely many heads.

Example Biased coin 1

Now consider a more interesting case where the probability of heads decreases with each toss:

\begin{equation*}\mathbb{P}(X_n = \text{Head}) = \frac{1}{n}.\end{equation*}

The sum is

\begin{equation*}\sum_{k=1}^n \mathbb{P} \left( X_k = \text{Head} \right) = \sum_{k=1}^n \frac{1}{k} \to \infty\end{equation*}

This is the harmonic series, which diverges. Therefore, by the second Borel-Cantelli lemma, we will still see infinitely many heads, even though the probability of heads becomes arbitrarily small!

Example of Biased coin 2

Finally, consider an even more extreme bias where

\begin{equation*}\mathbb{P}(X_n = \text{Head}) = \frac{1}{n^2}.\end{equation*}

Here, the Borel-Cantelli lemma reveals a fundamentally different behavior:

\begin{equation*}\sum_{k=1}^n \mathbb{P} \left( X_k = \text{Head} \right) = \sum_{k=1}^n \frac{1}{k^2} \to \sum_{k=1}^{\infty} \frac{1}{k^2} < \infty\end{equation*}

Since this series converges, the Borel-Cantelli lemma tells us that, almost surely, there exists some finite

N

after which we will never see another head. The probability of heads decreases so rapidly that only finitely many heads will occur.

Before discussing the two Borel-Cantelli Lemmas, we first need to define some technical terms related to an infinite sequence of events. Consider the sequence of events

A_1, A_2, A_3, \dots

. We say that

A_n

happens infinitely often (i.o.) if

\begin{equation*}\{ A_n, \text{i.o.} \} = \{ \omega : \forall m \in \mathbb{N}, \, \, \exists n \geq m \,\, \text{such that} \, \, \omega \in A_{n} \} = \underbrace{\bigcap_{m=1}^{\infty}}_{\forall m} \underbrace{\bigcup_{n \geq m}}_{\exists n \geq m} A_n = \limsup_{n \rightarrow \infty} A_n = \lim_{m \rightarrow \infty} \bigcup_{n \geq m} A_n\end{equation*}

and we say that

A_n

happens eventually, that is, for all but finitely many

n

\begin{equation*}\{ A_n, \text{eventually} \} = \{ \omega : \exists m \in \mathbb{N}, \,\, \forall n \geq m \, \, \text{such that} \, \, \omega \in A_{n} \} = \underbrace{\bigcup_{m=1}^{\infty}}_{\exists m} \underbrace{\bigcap_{n \geq m}}_{\forall n \geq m} A_n = \liminf_{n \rightarrow \infty} A_n = \lim_{m \rightarrow \infty} \bigcap_{n \geq m} A_n\end{equation*}

Suppose we focus on two subsets within the interval

(0,1]

A_1 = (0, \frac{1}{2}]

and

A_2 = (\frac{1}{2}, 1]

. We define the sequence

A_n

\begin{equation*}A_n = \begin{cases} (0, \frac{1}{2}] , & n = 2k + 1, \quad k \in \mathbb{N} \\ (\frac{1}{2}, 1], & n = 2k, \quad k \in \mathbb{N} .\end{cases}\end{equation*}

With this sequence, we can compute the upper limit

\limsup_{n \rightarrow \infty} A_n

\begin{equation*}\{ A_n, \text{i.o.} \} = \bigcap_{m=1} \bigcup_{n \geq m} A_n = (0, 1]\end{equation*}

and the lower limit

\liminf_{n \rightarrow \infty} A_n

\begin{equation*}\{ A_n, \text{eventually} \} = \bigcup_{m=1} \bigcap_{n \geq m} A_n = \emptyset\end{equation*}.

THE FIRST PART OF BOREL-CANTELLI LEMMA

\sum_{n=1}^{\infty} \mathbb{P}(A_n) < \infty

then

\mathbb{P}(A_n \, \text{i.o.}) = 0

\quad

Proof. If we denote

G_m = \bigcup_{n \geq m} A_n

then we can see that

G_{m+1} \subset G_m

and

G_m \downarrow \limsup_{n \rightarrow \infty} A_n

. So using the continuity property of

\mathbb{P}

\begin{equation*}\mathbb{P}(A_n, \text{i.o.}) = \mathbb{P}\left(\bigcap_{m=1}^{\infty} G_m\right) = \lim_{m \rightarrow \infty} \mathbb{P}(G_m) = \lim_{m \rightarrow \infty} \mathbb{P}\left(\bigcup_{n \geq m} A_n\right) \leq \lim_{m \rightarrow \infty} \sum_{n \geq m} \mathbb{P}(A_n)\end{equation*}

where the last inequality is due the sub-additivity property of

\mathbb{P}

. When

\sum_{n=1}^{\infty} \mathbb{P}(A_n) < \infty

, the right-hand side becomes

0

, and we get

\mathbb{P}(A_n \, \text{i.o.}) = 0

\Box

Plot of

1 - x

and

\exp(-x)

THE SECOND PART OF BOREL-CANTELLI LEMMA

If the events

A_n

are independent and

\sum_{n=1}^{\infty} \mathbb{P}(A_n) = \infty

, then

\mathbb{P}(A_n \, \text{i.o.}) = 1

\quad

Proof. If

A_1, A_2, ...

are independent, so are

\overline{A}_1, \overline{A}_2, ...

. Hence for

N \geq n

we have,

\begin{equation*}\mathbb{P} \left( \bigcap^N_{k=n} {A^c}_k \right) = \prod^N_{k=n} \mathbb{P}({A^c}_k) = \prod^N_{k=n} (1 - \mathbb{P}(A_k))\end{equation*}

using

1-x \leq \exp(-x)

\begin{equation*}\prod^N_{k=n} (1 - \mathbb{P}(A_k)) \leq \prod^N_{k=n} \exp(-\mathbb{P}(A_k)) = \exp \left( - \sum^N_{k=n} \mathbb{P}(A_k) \right) \quad \text{as} \quad N \rightarrow \infty\end{equation*}

Hence

\begin{equation*}\mathbb{P}\!\left(\bigcap_{k=n}^N A_k^c\right)\le\exp\!\left(-\sum_{k=n}^N \mathbb{P}(A_k)\right)\overset{N\to\infty}{\Longrightarrow}\mathbb{P}\!\left(\bigcap_{k=n}^{\infty} A_k^c\right)=0,\end{equation*}

because

\sum_{k=n}^{\infty}\mathbb{P}(A_k)=\infty

. Therefore

\begin{equation*}\mathbb{P}\!\left(\bigcup_{k=n}^{\infty} A_k\right)=1-\mathbb{P}\!\left(\bigcap_{k=n}^{\infty} A_k^c\right)=1.\end{equation*}

for all

n

, and since

\begin{equation*}\bigcup^{\infty}_{k=n} A_k \downarrow \bigcap_{n=1}^{\infty}\bigcup^{\infty}_{k=n} A_k = \limsup_{n\to\infty} A_n,\end{equation*}

it follows that

\mathbb{P}(A_n \, \text{i.o.}) = 1

\Box

The foolowing example is Theorem 2.3.5, page 59 from [Durrett2019]

THEOREM

Let

X_1, X_2, \ldots

be i.i.d. with

\mathbb{E}X = \mu

and

\mathbb{E}X^4 < \infty

. If

S_n = X_1 + \cdots + X_n

, then

S_n/n \to \mu

a.s.

\quad

Proof. By letting

X_i' = X_i - \mu

, we can suppose without loss of generality that

\mu = 0

. Now

\begin{equation*}\mathbb{E}S_n^4 = \mathbb{E}\left( \sum_{i=1}^n X_i \right)^4 = \mathbb{E} \sum_{1 \leq i,j,k,l \leq n} X_i X_j X_k X_l\end{equation*}

Terms in the sum of the form

\mathbb{E}(X_i^3 X_j)

\mathbb{E}(X_i^2 X_j X_k)

, and

\mathbb{E}(X_i X_j X_k X_l)

are

0

(if

i,j,k,l

are distinct) since the expectation of the product is the product of the expectations because of the independence, and in each case one of the terms has expectation 0. The only terms that do not vanish are those of the form

\mathbb{E}X_i'^4

and

\mathbb{E}X_i'^2 X_j'^2 = (\mathbb{E}X_i'^2)^2

. There are

n

and

3n(n - 1)

of these terms, respectively. In the second case, we can pick the two indices in

n(n - 1)/2

ways, and with the indices fixed, the term can arise in a total of six ways:

\{ 1, 2\}, \{ 1, 3\}, \{ 1, 4\}, \{ 2, 3\}, \{ 2, 4\}

and

\{ 3, 4\}

. The last observation implies

\begin{equation*}\mathbb{E}S_n^4 = n\mathbb{E}X_i^4 + 3(n^2 - n)(\mathbb{E}X_i^2)^2 \leq Cn^2\end{equation*}

where

C < \infty

. Chebyshev’s inequality gives us

\begin{equation*}\mathbb{P}(\frac{|S_n|}{n} > \varepsilon) \leq \frac{\mathbb{E}(S_n^4)}{(n\varepsilon)^4} \leq \frac{C}{(n^2 \varepsilon^4)}\end{equation*}

Summing on

n

and using the Borel-Cantelli lemma gives

\mathbb{P}(|S_n| > n\varepsilon \text{ i.o.}) = 0

\exists n_0, \, \forall n \geq n_0 \: \frac{|S_n|}{n} \leq \varepsilon

. Since

\varepsilon

is arbitrary, the proof is complete.

\Box

References

[Durrett2019]Rick Durrett. Probability Theory and Examples. Fifth edition. 2019.

Borel-Cantelli Lemma

Motivation

Symmetric coin

Biased coin 1

Biased coin 2

What is limit of the events

Example

Proof of Borel-Cantelli Lemma

Applications

Proof of Strong Law of Large Numbers

References