De Moivre–Laplace theorem, which is a special case of the central limit theorem, states that the normal distribution may be used as an approximation to the binomial distribution under certain conditions.
The theorem appeared in the second edition of The Doctrine of Chances by Abraham de Moivre, published in 1738. Although de Moivre did not use the term "Bernoulli trials", he wrote about the probability distribution of the number of times "heads" appears when a coin is tossed 3600 times.
Let X1,X2,… be a sequence of independent and identically distributed random variables, where
X1={1,−1,withP(X1=1)=21,withP(X1=−1)=21.
and let Sn=∑k=1nXk. In words, we are betting 1 on the flipping of a fair coin and Sn is our winnings at time n.
Theorem (The De Moivre-Laplace Theorem)
The De Moivre-Laplace Theorem. If a<b then as n→∞
P(a≤n1/2Sn≤b)→∫ab(2π)−1/2e−z2/2dz
Motivation
If n and k are integers
P(S2n=2k)=(2nn+k)2−2n
since S2n=2k if and only if there are n+k flips that are +1 and n−k flips that are -1 in the first 2n. The first factor gives the number of such outcomes and the second the probability of each one.
Sum distribution
If we visualize such probabilitites and consequentially increase n we will start see the pattern: the binomial probabilities formed a bell-shaped curve and this curve became smoother as n increased.
Plot of binomial distributions
The transformation from a discrete, jumpy distribution to a smooth, symmetric bell curve represents one of probability theory's most beautiful and fundamental convergence phenomena.
Proof of De Moivre-Laplace Theorem
The foolowing proof is taken from [Durrett2019] p. 98-99
The proof strategy for the theorem is as follows: initially, we approximate the probability P(S2n=2k) for a single point. Subsequently, we proceed to calculate the probability for the interval P(a≤2n1/2S2n≤b).
Approximation of P(S2n=2k)
This subsection provides a proof for the theorem presented below, which approximates the single probability P(S2n=2k).
where 2Z/(2n)1/2={2z/(2n)1/2:z∈Z}. We have multiplied and divided by 2 since the space between points in the sum is (2/n)1/2, so if n is large the sum above is
≈∫ab(2π)−1/2e−x2/2dx
The integrand is the density of the (standard) normal distribution, so changing notation we can write the last quantity as P(a≤χ≤b) where χ is a random variable with that distribution. We proved the De Moivre-Laplace Theorem. To remove the restriction to even integers observe S2n+1=S2n±1 . □