Central Limit Theorem

Published June 28, 2026

The central limit theorem explains why normal distributions appear so often in probability. It says that many small independent random effects, when added together and normalized, have an approximately Gaussian distribution.

CENTRAL LIMIT THEOREM

Let

X_1,X_2,\ldots

be independent identically distributed random variables with the math expectation

\mathbb{E}X_1=\mu

and the finite variation

\operatorname{Var}(X_1)=\sigma^2

. If

S_n=X_1+\cdots+X_n

, then

\begin{equation*}\frac{S_n-n\mu}{\sigma\sqrt{n}}\xrightarrow{d}N(0,1),\qquad n\to\infty.\end{equation*}

Equivalently, for every pair of real numbers

a<b

\begin{equation*}\mathbb{P}\left(a\leq \frac{S_n-n\mu}{\sigma\sqrt{n}}\leq b\right)\to\frac{1}{\sqrt{2\pi}}\int_a^b e^{-x^2/2}\,\mathrm{d}x.\end{equation*}

The normalization subtracts the expected value

n\mu

and divides by the natural scale of fluctuations,

\sigma\sqrt{n}

A standard example is the sum of dice rolls. Let

Y_1,\ldots,Y_n

be independent rolls of a fair six-sided die. Then

\begin{equation*}\mathbb{E}Y_1=\frac{7}{2},\qquad\operatorname{Var}(Y_1)=\frac{35}{12}.\end{equation*}

For

T_n=Y_1+\cdots+Y_n

, the central limit theorem gives

\begin{equation*}\frac{T_n-\frac{7n}{2}}{\sqrt{35n/12}}\xrightarrow{d}N(0,1).\end{equation*}

Repeated sums of

10

dice stack into a bell-shaped histogram centered near

35

In the animation, each brick records one sum of ten dice. Individual rolls are discrete and bounded, but the histogram of many sums is already close to the normal curve with mean

35

and variance

10\cdot 35/12=175/6

A Galton board gives another concrete example of the same phenomenon. Each ball makes a sequence of independent left-or-right choices, so the final bucket is determined by a binomial count. With many balls, the bucket heights begin to form the same bell-shaped profile predicted by the central limit theorem.

A Galton board turns repeated independent binary choices into a binomial histogram.

With more rows and many more balls, the same binomial mechanism produces a smoother approximation to the normal density.

A larger Galton board makes the normal approximation visually sharper.

We prove the theorem using characteristic functions. Replacing

X_i

X_i-\mu

, it is enough to prove the result in the centered case

\mu=0

. Let

\begin{equation*}\varphi(t)=\mathbb{E}e^{itX_1}\end{equation*}

be the characteristic function of

X_1

. Since

\mathbb{E}X_1=0

and

\operatorname{Var}(X_1)=\sigma^2

, the characteristic function has the expansion

\begin{equation*}\varphi(t)=1-\frac{\sigma^2t^2}{2}+o(t^2),\qquad t\to 0.\end{equation*}

For

S_n=X_1+\cdots+X_n

, independence gives

\begin{equation*}\mathbb{E}\exp\left(it\frac{S_n}{\sigma\sqrt{n}}\right)=\left[\varphi\left(\frac{t}{\sigma\sqrt{n}}\right)\right]^n=\left(1-\frac{t^2}{2n}+o\left(\frac{1}{n}\right)\right)^n.\end{equation*}

Thus the desired limiting characteristic function should be

e^{-t^2/2}

, which is the characteristic function of the standard normal distribution. The only point needing care is that the expression above is complex-valued, so we use the following elementary extension of the familiar limit

(1+c/n)^n\to e^c

By the continuity theorem for characteristic functions, this convergence of characteristic functions implies

\begin{equation*}\frac{S_n}{\sigma\sqrt{n}}\xrightarrow{d}N(0,1)\end{equation*}

in the centered case.

References

[Durrett]Durrett, R. Probability: Theory and Examples. Cambridge University Press, 2019.

Central Limit Theorem

Motivation

The sum of 10 dices

Galton board

Proof of the Central Limit Theorem

References