The Strong Law of Large Numbers (SLLN) is a fundamental result in probability theory that describes the long-term behavior of the average of a sequence of random variables.
The SLLN states that, under certain conditions, the sample average of a sequence of independent and identically distributed (i.i.d.) random variables converges almost surely to the expected value (mean) of those variables as the sample size goes to infinity.
Mathematically, let X1,X2,… be a sequence of i.i.d. random variables with a finite first moment, i.e., E∣X1∣<∞. The SLLN asserts that the sample average of these variables converges almost surely to the expected value:
nX1+X2+⋯+Xna. s.EX1, as n→∞.
This means that as the number of observations n increases, the sample average will almost surely converge to the true mean EX1.
Motivation
Coin Flip Experiment
Consider a fair coin flip experiment where we assign:
Xi={1,0,if heads,if tails.
The expected value is clearly E[Xi]=21. But what happens when we perform this experiment repeatedly?
Fair coin flip experiment
In our animation, we observe:
The horizontal line at y=0.5 represents E[Xi]
The blue path shows nSn as n increases
Early fluctuations are larger due to the n1 scaling
As n→∞, the path stabilizes near μ=0.5
Monte-Carlo Method
One of the most important applications of the SLLN is in the Monte Carlo method, which is used to numerically estimate quantities that are difficult or impossible to calculate analytically. The accuracy of Monte Carlo method depends on the convergence of the sample averages to the true values, which is guaranteed by the SLLN.
Let's define a random variable X based on randomly chosen points (U1,U2) in a 6×6 square:
X={1,0,if U12+U22≤9otherwise
where U1,U2∼Uniform(−3,3)
Calculating π with Monte-Carlo
The expected value of X is:
E[X]=P(U12+U22≤9)=Area of SquareArea of Circle=(2r)2πr2=4π
Let Xii=1n be i.i.d. copies of X. By the Strong Law of Large Numbers:
n1∑i=1nXia.s.E[X]=4π
Therefore:
4⋅n1i=1∑nXia.s.π
Strong Law of Large Numbers with finite 4th moment
The foolowing example is Theorem 2.3.5, page 59 from [Durrett2019]
One of the earliest results in this direction is the following theorem, which can be easily proved with Chebyshev inequality and Borel-Cantelli lemma.
Theorem (Cantelli)
Let X1,X2,… be i.i.d. with EX=μ and EX4<∞. If Sn=X1+⋯+Xn, then Sn/n→μ a.s.
\begin{proof}
Proof. By letting Xi′=Xi−μ, we can suppose without loss of generality that μ=0. Now
ESn4=E(i=1∑nXi)4=E1≤i,j,k,l≤n∑XiXjXkXl
Terms in the sum of the form E(Xi3Xj), E(Xi2XjXk), and E(XiXjXkXl) are 0 (if i,j,k,l are distinct) since the expectation of the product is the product of the expectations because of the independence, and in each case one of the terms has expectation 0. The only terms that do not vanish are those of the form EXi′4 and EXi′2Xj′2=(EXi′2)2. There are n and 3n(n−1) of these terms, respectively. In the second case, we can pick the two indices in n(n−1)/2 ways, and with the indices fixed, the term can arise in a total of six ways: {1,2},{1,3},{1,4},{2,3},{2,4} and {3,4}. The last observation implies
ESn4=nEXi4+3(n2−n)(EXi2)2≤Cn2
where C<∞. Chebyshev’s inequality gives us
P(n∣Sn∣>ϵ)≤(nϵ)4E(Sn4)≤(n2ϵ4)C
Summing on n and using the Borel-Cantelli lemma gives P(∣Sn∣>nϵ i.o.)=0 or ∃n0,∀n≥n0n∣Sn∣≤ϵ. Since ϵ is arbitrary, the proof is complete. □
\end{proof}
Toeplitz and Kronecker Lemmas
In this section, we introduce the Toeplitz and Kronecker Lemmas, which play crucial roles in proving the Strong Law of Large Numbers with finite absolute first monent.
Toeplitz Lemma
The Toeplitz Lemma provides conditions under which the averages of a sequence {xn}n≥1 converge to a limit.
Lemma (Toeplitz)
Let {an} be a sequence of nonnegative numbers, bn=∑i=1nai, b1=a1>0, and bn↗∞, n→∞. Let {xn}n≥1 be a sequence of numbers converging to x. Then
bn1j=1∑najxj→x.(7)
Proof. Let ε>0, and let n0=n0(ε) be such that ∣xn−x∣≤ε/2 for all n≥n0. Choose n1>n0 such that
The foolowing proof is taken from [Shiryaev2] pages 16-17
Here we present the strongest version of the SLLN, which was proved by Kolmogorov and require the finitnes of the expectation of the absolute value.
Theorem (Strong law of large numbers, Kolmogorov)
Let X1,X2,… be i.i.d. with EX=μ and E∣X∣<∞. If Sn=X1+⋯+Xn, then Sn/n→μ a.s.
\end{theorem}
We break down the proof of the theorem in several steps. First, we consider the truncated sequence {Xn′}, noting that the convergence of its sample average is equivalent to the convergence of the sample average of {Xn}. Next, we apply Kronecker's Lemma and Kolmogorov's Two-Series Theorem to derive the final result.
Let f(x) be a continuous function defined on [0,1], with values in [0,1]. The following idea is the foundation of the statistical method of calculating
∫01f(x)dx
, called the Monte Carlo method. Let X1,X2,X3,… be a sequence of independent random variables uniformly distributed on [0,1]. Define a new sequence of i.i.d. random variables
Yn=f(Xn)
with the mathematical expectation
EYn=∫01f(x)dx∀n≥1.
By the strong law of large numbers (Theorem 3),
n1i=1∑nf(Xi)a.s.n→∞∫01f(x)dx.
Consequently, we can approximate an integral ∫01f(x)dx by taking a simulation consisting of random variables Xi,i≥1 and then calculating f(Xi),i≥1 and n1∑i=1nf(Xi).