Characteristic Functions

A tool that always exists, even when moments don't.

Characteristic functions are one of the most powerful tools in probability theory. Unlike moments, they always exist and uniquely determinethe distribution. This makes them essential for studying fat-tailed distributions where moments may be infinite.

Definition and Intuition

Definition

Characteristic Function

The characteristic function of a random variable is:

where is the imaginary unit and .

Using Euler's formula , this can be written as:

The characteristic function is essentially the Fourier transform of the probability distribution. It encodes all information about the distribution in the frequency domain.

Fundamental Properties

1. Always Exists

Key Insight

No Existence Problems

Unlike moments, the characteristic function always exists for any random variable. This is because for all real , so the expected value is always bounded:

Even when or is infinite, the characteristic function is perfectly well-defined.

2. Uniquely Determines the Distribution

Definition

Uniqueness Theorem

Two random variables have the same distribution if and only if they have the same characteristic function:

This one-to-one correspondence means we can study distributions entirely through their characteristic functions — no information is lost.

3. Sums of Independent Variables

Definition

Convolution Property

If and are independent, then:

Read: “The characteristic function of X plus Y equals phi X of t times phi Y of t”

For independent variables, the characteristic function of the sum is the product of the individual characteristic functions

This is incredibly useful — finding the distribution of a sum (which involves convolution of PDFs, a difficult operation) becomes simple multiplication of characteristic functions.

Important Examples

Example

Normal Distribution

For :

The sum of two independent normals: . This follows directly from multiplying characteristic functions.

Example

Cauchy Distribution

The Cauchy distribution (which has no mean!) has characteristic function:

Even though we cannot compute , the characteristic function is simple and well-defined. The fact that it's non-differentiable at reflects the non-existence of moments.

Example

Verifying the Convolution Property

If are independent Cauchy:

This is the characteristic function of a Cauchy distribution with scale 2. So the sum of two Cauchy random variables is again Cauchy — a property called stability.

Connection to Moments

When moments exist, they can be extracted from the characteristic function via derivatives:

Definition

Moments from Characteristic Functions

If , then:

where is the n-th derivative evaluated at .

In particular:

The characteristic function is always infinitely differentiable when the corresponding moments exist. When moments don't exist (as for Cauchy), the derivatives at 0 don't exist either.

Stable Distributions

Characteristic functions are essential for defining and studying stable distributions, which are the fat-tailed generalizations of the normal distribution.

Definition

Stable Distribution

A distribution is stable if for any i.i.d. random variables from that distribution, there exist constants and such that:

where has the same distribution as and .

Read: “X_1 plus X_2 is equal in distribution to a times X plus b”

The sum of two copies looks like a scaled and shifted single copy — the shape is preserved

In terms of characteristic functions, stability means:

More generally, for copies:

Key Insight

Why Stable Distributions Matter

Stable distributions are the only possible limits in generalized central limit theorems. When the variance is infinite, sums don't converge to normal — they converge to a stable distribution with parameter .

The normal () and Cauchy ( ) are special cases. The general stable distribution has:

(for ; the formula is slightly different at )

Example

The Stability Parameter α

The parameter controls the tail behavior:

: Normal distribution (all moments exist)
: Cauchy distribution (no moments exist)
: Mean exists but variance is infinite
: Even the mean is infinite

Smaller means heavier tails. The tail probability decays like for large .

Key Takeaways

The characteristic function is the Fourier transform of the distribution
Always exists — even when moments don't (unlike moment generating functions)
Uniquely determines the distribution — no information is lost
For independent variables:
Moments (when they exist) can be recovered from derivatives at
Stable distributions are characterized by — their shape is preserved under summation
The stability parameter controls tail heaviness: smaller means fatter tails

Module Conclusion

You now have the measure-theoretic vocabulary to engage with Taleb's more technical arguments. The key insights are: probability is built on measure theory; there are multiple types of convergence with different implications for limit theorems; and characteristic functions provide a universal language for studying distributions even when moments fail. With these tools, you can appreciate why fat tails pose fundamental challenges to standard statistical methods — it's not just about extreme events being more likely, but about the very foundations of what we can know from samples.