Polya's Criterion

Published May 27, 2026

Polya’s criterion, in the context of probability theory, is a condition used to determine whether a function is a characteristic function. Combined with the property that characteristic functions are non-negative definite, Pólya's criterion ensures that the function is non-negative definite.

POLYA'S CRITERION

Let

\varphi(t)

be real nonnegative and have

\varphi(0)=

1, \varphi(t)=\varphi(-t)

, and

\varphi

is decreasing and convex on

(0, \infty)

with

\begin{equation*}\lim _{t \downarrow 0} \varphi(t)=1, \quad \lim _{t \uparrow \infty} \varphi(t)=0\end{equation*}

Then there is a probability measure

\nu

(0, \infty)

, so that

\begin{equation}\varphi(t)=\int_0^{\infty}\left(1-\left|\frac{t}{s}\right|\right)^{+} \nu(d s)\end{equation}

and hence

\varphi

is a characteristic function.

The foolowing proof is due to [Durrett2019] pages 119

The assumption that

\lim _{t \rightarrow 0} \varphi(t)=1

is necessary because the function

\varphi(t)=

1_{\{0\}}(t)

which is 1 at 0 and 0 otherwise satisfies all the other hypotheses.

\quad

Proof. Let

\varphi^{\prime}

be the right derivative of

\phi

, i.e.,

\begin{equation*}\varphi^{\prime}(t)=\lim _{h \backslash 0} \frac{\varphi(t+h)-\varphi(t)}{h}\end{equation*}

Since

\varphi

is convex this exists and is right continuous and increasing. So we can let

\mu

be the measure on

(0, \infty)

with

\begin{equation*}\mu(a, b]=\varphi^{\prime}(b)-\varphi^{\prime}(a) \quad \text{for all } 0 \leq a<b<\infty,\end{equation*}

and let

\nu

be the measure on

(0, \infty)

with

\begin{equation*}d \nu / d \mu=s.\end{equation*}

Since

\varphi

is decreasing, we have

\varphi'(t)\le 0

for all

t>0

. Also, because

\varphi

is convex, the function

\varphi'

is increasing, so the limit

\begin{equation*}\ell:=\lim_{t\to\infty}\varphi'(t)\end{equation*}

exists and satisfies

\ell\le 0

. In fact

\ell=0

. Indeed, if

\ell<0

, then for some

\epsilon>0

we would have

\varphi'(t)\le -\epsilon

for all sufficiently large

t

, and hence

\varphi(t)

would eventually decrease at least linearly, which would force

\varphi(t)<0

for large

t

. This contradicts the assumption that

\varphi

is nonnegative. Therefore

\begin{equation*}\lim_{t\to\infty}\varphi'(t)=0.\end{equation*}

Now, by the definition of

\mu

, for every

s>0

we have

\begin{equation*}\mu((s,\infty))=\lim_{b\to\infty}\mu((s,b])=\lim_{b\to\infty}\bigl(\varphi'(b)-\varphi'(s)\bigr)=-\varphi'(s).\end{equation*}

Since

d\nu/d\mu=r

, we have

\nu(dr)=r\,\mu(dr)

, and therefore

r^{-1}\nu(dr)=\mu(dr)

. Hence

\begin{equation*}-\varphi^{\prime}(s)=\mu((s,\infty))=\int_{(s,\infty)} \mu(dr)=\int_{(s,\infty)} r^{-1} \nu(d r).\end{equation*}

Since

\lim_{u\to\infty}\varphi(u)=0

, we also have for every

t\ge 0

\begin{equation*}\varphi(t)=-\int_t^\infty \varphi'(s)\,ds.\end{equation*}

Substituting the previous formula for

-\varphi'(s)

and using Fubini's theorem, we get for

t \geq 0

\begin{equation*}\varphi(t) =\int_t^{\infty} \int_s^{\infty} r^{-1} \nu(d r) d s=\int_t^{\infty} r^{-1} \int_t^r d s \nu(d r) =\int_t^{\infty}\left(1-\frac{t}{r}\right) \nu(d r)=\int_0^{\infty}\left(1-\frac{t}{r}\right)^{+} \nu(d r)\end{equation*}

Using

\varphi(-t)=\varphi(t)

to extend the formula to

t \leq 0

we have

(1)

. Setting

t=0

(1)

shows

\nu

has total mass 1.

\varphi

is piece-wise linear,

\nu

has a finite number of atoms and the result follows from fact that weighted average of characteristic function is again characteristic function. To prove the general result, let

\nu_n

be a sequence of measures on

(0, \infty)

with a finite number of atoms that converges weakly to

\nu

and let

\begin{equation*}\varphi_n(t)=\int_0^{\infty}\left(1-\left|\frac{t}{s}\right|\right)^{+} \nu_n(d s)\end{equation*}

Since

s \rightarrow(1-|t / s|)^{+}

is bounded and continuous,

\varphi_n(t) \rightarrow \varphi(t)

and the desired result follows from Levy's Continuity Theorem.

\Box

Examples of

\varphi_1(t)

and

\varphi_2(t)

We can apply Polya's criterion to verify whether certain functions are characteristic. Simple examples which can be deduced to characteristic from the plot are

\begin{align*}\varphi_1(t) &= \exp(-|t|) \\\varphi_2(t) &= \begin{cases} 1 - |t|, & |t| \leq 1 , \\ 0, & |t|>1 .\end{cases}\end{align*}

We can see that in the case

1 < \alpha < 2

the function

\exp(-|t|^\alpha)

is not convex and Polya's criterion can be applied directly

THEOREM

\exp(-|t|^\alpha)

is a characteristic function for

0 < \alpha < 2

The case

\alpha = 1

corresponds to the Cauchy distribution and the case

\alpha = 2

corresponds to the standard normal distribution.

\quad

Proof. The key idea is to approximate the function

\exp(-|t|^\alpha)

by a sequence of characteristic functions and then apply Lévy's Continuity Theorem at the end.

The function which help us to make approximation is

\begin{equation*}\psi(t) = 1 - (1 - \cos t)^{\alpha/2}.\end{equation*}

Then for any for any

\beta

and

|x| < 1

we have the formula

\begin{equation*}(1 - x)^\beta = \sum_{n=0}^{\infty} \binom{\beta}{n} (-x)^n,\qquad\binom{\beta}{n} = \frac{\beta (\beta - 1) \cdots (\beta - n + 1)}{1 \cdot 2 \cdot \ldots \cdot n}.\end{equation*}

Now we can represent

\psi(t)

as a infinite series

\begin{equation*}\psi(t) = 1 - (1 - \cos t)^{\alpha/2} = \sum_{n=1}^{\infty} c_n (\cos t)^n, \qquad c_n = \binom{\alpha/2}{n} (-1)^{n+1}.\end{equation*}

c_n \geq 0

(we used

\alpha < 2

), and

\sum_n c_n = 1

(take

t=0

in the definition of

\psi(t)

). To confirm that

\psi(t)

is a characteristic function, first note that

\begin{equation*}\mathbb{P}(X = 1) = \mathbb{P}(X = -1) = 1/2 \quad \Longrightarrow \quad \mathbb{E}e^{itX} = \frac{e^{it} + e^{-it}}{2} = \cos t.\end{equation*}

\cos t

is a characteristic function. Moreover, if

X_1,\dots,X_n

are independent random variables with

\begin{equation*}\mathbb{P}(X_k=1)=\mathbb{P}(X_k=-1)=\tfrac12 \quad \Longrightarrow \quad \mathbb{E}e^{it(X_1+\cdots+X_n)} = \prod_{k=1}^n \mathbb{E}e^{itX_k} = (\cos t)^n.\end{equation*}

Hence

(\cos t)^n

is also a characteristic function. Since a weighted average of characteristic functions remains a characteristic function, it follows that

\psi(t)

is indeed a characteristic function.

From analysis

1 - \cos t \sim t^2/2

t \to 0

, so

\begin{equation}1 - \cos( \frac{\sqrt{2} t}{n^{1/\alpha}}) \sim \frac{t^2}{n^{2/\alpha}}.\end{equation}

Using the above-mentioned lemma, we get the pointwise limit

\begin{equation*}\lim_{n \to \infty} \{\psi(\frac{\sqrt{2} t}{n^{1/\alpha}})\}^n = \lim_{n \to \infty} \left\{ 1 - (1 - \cos( \frac{\sqrt{2} t}{n^{1/\alpha}}))^{\alpha/2} \right\}^n = \exp(-|t|^\alpha).\end{equation*}

Since each function

\{\psi(\frac{\sqrt{2} t}{n^{1/\alpha}})\}^n

is a characteristic function, Lévy's Continuity Theorem implies that

\exp(-|t|^\alpha)

is also a characteristic function.

\Box

References

[Durrett2019]Rick Durrett. Probability Theory and Examples. Fifth edition. 2019.

Polya's Criterion

Proof of Polya's criterion

Radon-Nikodym measure

Integrating over new measure

Approximating by atomic measures

Applications of Polya's criterion

Exponent with power less two

References