Polya’s criterion, in the context of probability theory, is a condition used to determine whether a function is a characteristic function.
Combined with the property that characteristic functions are non-negative definite, Pólya's criterion ensures that the function is non-negative definite.
Theorem (Polya's criterion)
Let φ(t) be real nonnegative and have φ(0)=1,φ(t)=φ(−t), and φ is decreasing and convex on (0,∞) with
t↓0limφ(t)=1,t↑∞limφ(t)=0
Then there is a probability measure ν on (0,∞), so that
φ(t)=∫0∞(1−st)+ν(ds)
and hence φ is a characteristic function.
Proof of Polya's criterion
The foolowing proof is due to [Durrett2019] pages 119
The assumption that limt→0φ(t)=1 is necessary because the function φ(t)=1{0}(t) which is 1 at 0 and 0 otherwise satisfies all the other hypotheses. We could allow limt→∞φ(t)=c>0 by having a point mass of size c at 0, but we leave this extension to the reader.
Radon-Nikodym measure
Proof. Let φ′ be the right derivative of ϕ, i.e.,
φ′(t)=h\0limhφ(t+h)−φ(t)
Since φ is convex this exists and is right continuous and increasing. So we can let μ be the measure on (0,∞) with μ(a,b]=φ′(b)−φ′(a) for all 0≤a<b<∞, and let ν be the measure on (0,∞) with dν/dμ=s.
Integrating over new measure
Now φ′(t)→0 as t→∞ (for if φ′(t)↓−ϵ we would have φ(t)≤1−ϵt for all t ), so
−φ′(s)=∫s∞r−1ν(dr)
Integrating again and using Fubini's theorem we have for t≥0
Using φ(−t)=φ(t) to extend the formula to t≤0 we have (*). Setting t=0 in (∗) shows ν has total mass 1.
Non smooth case
If φ is piece-wise linear, ν has a finite number of atoms and the result follows from fact that weighted average of characteristic function is again characteristic function. To prove the general result, let νn be a sequence of measures on (0,∞) with a finite number of atoms that converges weakly to ν and let
φn(t)=∫0∞(1−st)+νn(ds)
Since s→(1−∣t/s∣)+is bounded and continuous, φn(t)→φ(t) and the desired result follows from Levy's Continuity Theorem. □
Applications of Polya's criterion
We can apply Polya's criterion to verify whether certain functions are characteristic. Simple examples which can be deduced to characteristic from the plot are
φ1(t)φ2(t)=exp(−∣t∣)={1−∣t∣,0,∣t∣≤1,∣t∣>1.
Exponent with power less two
We need helper lemma to prove the main theorem.
Lemma
If cj→0, aj→∞, and ajcj→λ, then (1+cj)aj→eλ.
Proof. As x→0, xlog(1+x)→1, so ajlog(1+cj)→λ, and the desired result follows. □
The case α=1 correspond to the Cauchy distribution and the case α=2 correspond to a standard distribution.
Theorem
exp(−∣t∣α) is a characteristic function for 0<α<2
The case α=1 correspond to the Cauchy distribution and the case α=2 correspond to a standard distribution.
Proof. The key idea of the proof is to approximate the function exp(−∣t∣α) as a sequence of characteristic functions. By applying Lévy's Continuity Theorem, it follows that exp(−∣t∣α) is characteristic function as well.
The function which help us to make approximation is
ψ(t)=1−(1−cost)α/2.
Then for any for any β and ∣x∣<1 we have the formula
(1−x)β=n=0∑∞(nβ)(−x)n
where
(nβ)=1⋅2⋅…⋅nβ(β−1)⋯(β−n+1).
Now we can represent ψ(t) as a infinite series
ψ(t)=1−(1−cost)α/2=n=1∑∞cn(cost)n,
where
cn=(nα/2)(−1)n+1,
cn≥0(we used α<2), and ∑ncn=1 (take t=0 in the definition of ψ(t)). \\
To confirm that ψ(t) is a characteristic function, we utilize the fact that cos(t) is the characteristic function of a coin flip, i.e., P(X=1)=P(X=−1)=1/2. By using the binomial coefficients, we can show that cosn(t) is also a characteristic function. Since a weighted average of characteristic functions remains a characteristic function, it follows that ψ(t) is indeed a characteristic function. \\
From analysis 1−cost∼t2/2 as t→0, so
1−cos(n1/α2t)∼n2/αt2.
The representation of exp(−∣t∣α) as limit of characteristic functions is due to Lévy's Continuity Theorem and above-mentioned lemma,