Strong Law of Large numbers
The Strong Law of Large Numbers (SLLN) is a fundamental result in probability theory that describes the long-term behavior of the average of a sequence of random variables.
The SLLN states that, under certain conditions, the sample average of a sequence of independent and identically distributed (i.i.d.) random variables converges almost surely to the expected value (mean) of those variables as the sample size goes to infinity.
Mathematically, let be a sequence of i.i.d. random variables with a finite first moment, i.e., . The SLLN asserts that the sample average of these variables converges almost surely to the expected value:
This means that as the number of observations increases, the sample average will almost surely converge to the true mean .
Consider a fair coin flip experiment where we assign:
The expected value is clearly . But what happens when we perform this experiment repeatedly?
In our animation, we observe:
- The horizontal line at represents
- The blue path shows as increases
- Early fluctuations are larger due to the scaling
- As , the path stabilizes near
One of the most important applications of the SLLN is in the Monte Carlo method, which is used to numerically estimate quantities that are difficult or impossible to calculate analytically. The accuracy of Monte Carlo method depends on the convergence of the sample averages to the true values, which is guaranteed by the SLLN.
Let's define a random variable based on randomly chosen points in a square:
where
The expected value of is:
Let be i.i.d. copies of . By the Strong Law of Large Numbers:
The foolowing example is Theorem 2.3.5, page 59 from [Durrett2019]
One of the earliest results in this direction is the following theorem, which can be easily proved with Chebyshev inequality and Borel-Cantelli lemma.
CANTELLI
Let be i.i.d. with . If , then
Proof. By letting , we can suppose without loss of generality that . Now
Terms in the sum of the form , , and are (if are distinct) since the expectation of the product is the product of the expectations because of the independence, and in each case one of the terms has expectation 0. The only terms that do not vanish are those of the form and . There are and of these terms, respectively. In the second case, we can pick the two indices in ways, and with the indices fixed, the term can arise in a total of six ways: and . The last observation implies
where . Chebyshev’s inequality gives us
Since is arbitrary, the proof is complete.
In this section, we introduce the Toeplitz and Kronecker Lemmas, which play crucial roles in proving the Strong Law of Large Numbers with finite absolute first monent.
The Toeplitz Lemma provides conditions under which the averages of a sequence converge to a limit.
TOEPLITZ
Let be a sequence of nonnegative numbers, , , and , . Let be a sequence of numbers converging to . Then
In particular, if and then
Proof. Let , and let be such that for all . Choose such that
Then, for any ,
Then using the triangle inequality and splitting sum ut to
SERIES AND AVERAGES, KRONECKER
Let be a sequence of positive increasing numbers, as , and let be a sequence of numbers such that converges. Then
Proof. Let , , . Then from
since, if , then, by Toeplitz's lemma,
REMARK
If , and converges, then
The foolowing proof is taken from [Shiryaev2] pages 16-17
Here we present the strongest version of the SLLN, which was proved by Kolmogorov and require the finitnes of the expectation of the absolute value.
STRONG LAW OF LARGE NUMBERS, KOLMOGOROV
Let be i.i.d. with . If , then
We break down the proof of the theorem in several steps. First, we consider the truncated sequence , noting that the convergence of is equivalent to the convergence of . Next, we apply Kronecker's Lemma and Kolmogorov's Two-Series Theorem to derive the final result.
Proof. Let's suppose without a loss of generality and define the truncate random variable
Then
Hence, the Borel-Cantelli lemma yields i.o. , and so for all but finitely many a.s.
Because of for all but finitely many a.s., we have
Note that in general , but
Hence, by Toeplitz Lemma, for and ,
By Kronecker Lemma if
Define the centered truncated variable . It therefore remains to show that
We can estimate, where we use that are i.i.d and
We now interchange the order of summation, this is allowed because all terms are nonnegative: