Harmonic Analysis

Fourier analysis, singular integrals, wavelets, and representation theory.


Harmonic analysis is the art and science of decomposing functions into simpler oscillatory components — sines, cosines, complex exponentials — and then reconstructing them from those pieces. Born from the study of vibrating strings and heat flow in the eighteenth and nineteenth centuries, the subject has grown into one of the most powerful and far-reaching fields in mathematics, touching partial differential equations, number theory, signal processing, and the abstract theory of groups. The central miracle is that a single function, no matter how complicated, can be faithfully encoded by the collection of its frequency components and recovered from them exactly.

Fourier Series and Convergence

The story begins with periodic functions — functions that repeat themselves after a fixed interval. For concreteness, consider functions defined on the circle T=R/Z\mathbb{T} = \mathbb{R}/\mathbb{Z}, which we can think of as functions f:[0,1)Cf: [0, 1) \to \mathbb{C} with f(0)=f(1)f(0) = f(1). The guiding insight, due to Jean-Baptiste Joseph Fourier in his 1822 Théorie analytique de la chaleur, is that any reasonable periodic function can be written as a superposition of pure oscillations of the form e2πinxe^{2\pi i n x} for integers nn.

The Fourier coefficients of a function fL1(T)f \in L^1(\mathbb{T}) are defined by

f^(n)=01f(x)e2πinxdx,nZ.\hat{f}(n) = \int_0^1 f(x)\, e^{-2\pi i n x}\, dx, \qquad n \in \mathbb{Z}.

The corresponding Fourier series is the formal expression n=f^(n)e2πinx\sum_{n=-\infty}^{\infty} \hat{f}(n)\, e^{2\pi i n x}. Whether and in what sense this series converges back to ff is a subtle question that drove a century of mathematical development. The family {e2πinx}nZ\{e^{2\pi i n x}\}_{n \in \mathbb{Z}} forms an orthonormal basis for the Hilbert space L2(T)L^2(\mathbb{T}), with inner product f,g=01f(x)g(x)dx\langle f, g \rangle = \int_0^1 f(x)\overline{g(x)}\, dx. This orthonormality immediately gives Parseval’s identity:

n=f^(n)2=fL22,\sum_{n=-\infty}^{\infty} |\hat{f}(n)|^2 = \|f\|_{L^2}^2,

which says that the total energy of a function equals the sum of the energies of its frequency components.

Pointwise convergence of Fourier series is more delicate. The Dirichlet kernel DN(x)=n=NNe2πinx=sin((2N+1)πx)sin(πx)D_N(x) = \sum_{n=-N}^{N} e^{2\pi i n x} = \frac{\sin((2N+1)\pi x)}{\sin(\pi x)} represents the partial sum operator: the NN-th partial sum of the Fourier series of ff is the convolution SNf=fDNS_N f = f * D_N. Unfortunately, DND_N oscillates wildly and its L1L^1 norm grows like logN\log N, which causes difficulties. For continuous functions, the partial sums need not converge pointwise everywhere; du Bois-Reymond constructed a continuous function whose Fourier series diverges at a point in 1876. A landmark result of Lennart Carleson in 1966 showed, however, that for every fL2(T)f \in L^2(\mathbb{T}), the Fourier series converges to ff almost everywhere — a theorem that resolved a question posed by Luzin and that contributed to Carleson’s 2006 Abel Prize.

Summability methods recover lost convergence by averaging the partial sums. The Fejér kernel KN=1Nk=0N1DkK_N = \frac{1}{N} \sum_{k=0}^{N-1} D_k is non-negative and has L1L^1 norm equal to 11, making it an approximate identity. The Fejér summation theorem states that for any continuous periodic function, the arithmetic means of the partial sums (Cesàro means) converge uniformly. This is a much cleaner result than pointwise convergence and is the cornerstone of classical approximation theory. The Riemann-Lebesgue lemma provides a fundamental constraint: f^(n)0\hat{f}(n) \to 0 as n|n| \to \infty for any fL1(T)f \in L^1(\mathbb{T}). Moreover, the smoothness of ff governs the decay rate of its Fourier coefficients — if ff is kk times continuously differentiable, then f^(n)=O(nk)|\hat{f}(n)| = O(|n|^{-k}).

The Fourier Transform

On the real line R\mathbb{R}, periodicity gives way to general functions in L1(Rn)L^1(\mathbb{R}^n), and the Fourier series is replaced by the Fourier transform. For fL1(Rn)f \in L^1(\mathbb{R}^n), the Fourier transform is

f^(ξ)=Rnf(x)e2πiξxdx,ξRn.\hat{f}(\xi) = \int_{\mathbb{R}^n} f(x)\, e^{-2\pi i \xi \cdot x}\, dx, \qquad \xi \in \mathbb{R}^n.

The variable ξ\xi is the frequency (or wave number) and the transform f^(ξ)\hat{f}(\xi) measures how much of the frequency ξ\xi is present in ff. The Fourier transform has a constellation of remarkable algebraic properties that make it indispensable across mathematics and physics. It converts differentiation into multiplication: if ff is differentiable, then jf^(ξ)=2πiξjf^(ξ)\widehat{\partial_j f}(\xi) = 2\pi i \xi_j \hat{f}(\xi). This is the key that unlocks the theory of partial differential equations — an operator like the Laplacian Δ\Delta becomes, under the Fourier transform, simply multiplication by 4π2ξ2-4\pi^2 |\xi|^2.

The transform also interacts beautifully with convolution. The convolution of two functions is defined by (fg)(x)=Rnf(xy)g(y)dy(f * g)(x) = \int_{\mathbb{R}^n} f(x-y) g(y)\, dy, and the fundamental identity fg^=f^g^\widehat{f * g} = \hat{f} \cdot \hat{g} says that convolution in physical space corresponds to pointwise multiplication in frequency space. Conversely, pointwise multiplication in physical space becomes convolution in frequency space. This duality between multiplication and convolution is one of the deepest structural facts in analysis.

The inversion formula says that under suitable conditions on ff and f^\hat{f}, one can recover ff from its transform:

f(x)=Rnf^(ξ)e2πiξxdξ.f(x) = \int_{\mathbb{R}^n} \hat{f}(\xi)\, e^{2\pi i \xi \cdot x}\, d\xi.

This is valid, for example, when both ff and f^\hat{f} belong to L1(Rn)L^1(\mathbb{R}^n). The Fourier transform was pioneered by Fourier himself, formalized rigorously by Cauchy and Dirichlet in the nineteenth century, and extended to its modern functional-analytic form by Norbert Wiener and others in the early twentieth century. It is the cornerstone of all subsequent developments in harmonic analysis.

Plancherel’s Theorem and L2 Theory

The definition of the Fourier transform as an integral formula f^(ξ)=f(x)e2πiξxdx\hat{f}(\xi) = \int f(x) e^{-2\pi i \xi \cdot x} dx requires fL1(Rn)f \in L^1(\mathbb{R}^n) to make sense — the integral must converge. But the space L2(Rn)L^2(\mathbb{R}^n) of square-integrable functions is the natural setting for energy methods and quantum mechanics, and a function in L2L^2 need not be in L1L^1. The bridge between the two is Plancherel’s theorem, proved by Michel Plancherel in 1910.

The theorem states that the Fourier transform extends uniquely from L1L2L^1 \cap L^2 to a unitary isomorphism of L2(Rn)L^2(\mathbb{R}^n) onto itself. Concretely, for every fL2(Rn)f \in L^2(\mathbb{R}^n),

f^L2=fL2,\|\hat{f}\|_{L^2} = \|f\|_{L^2},

and the inner product is also preserved: f^,g^=f,g\langle \hat{f}, \hat{g} \rangle = \langle f, g \rangle. This is Parseval’s identity in the L2L^2 setting. The extension is constructed by approximation: for fL2f \in L^2, take a sequence fnL1L2f_n \in L^1 \cap L^2 converging to ff in L2L^2, define f^\hat{f} as the L2L^2-limit of f^n\hat{f}_n, and check that the result is independent of the approximating sequence.

Plancherel’s theorem has profound consequences. It means that the Fourier transform is an invertible linear isometry of L2(Rn)L^2(\mathbb{R}^n), so it preserves all the Hilbert space structure. The fact that it is unitary — not merely an isometry, but one whose inverse is also an isometry — means the inverse Fourier transform is just the adjoint of the Fourier transform. The L2L^2 theory of the Fourier transform is therefore extremely clean and complete.

Beyond L2L^2, the Riesz-Thorin interpolation theorem (a complex interpolation method) yields the Hausdorff-Young inequality: for 1p21 \leq p \leq 2 and its conjugate exponent qq (with 1p+1q=1\frac{1}{p} + \frac{1}{q} = 1),

f^LqfLp.\|\hat{f}\|_{L^q} \leq \|f\|_{L^p}.

This interpolates between the trivial L1LL^1 \to L^\infty bound f^f1\|\hat{f}\|_\infty \leq \|f\|_1 and the Plancherel isometry L2L2L^2 \to L^2. The Hardy spaces HpH^p — spaces of functions whose Fourier transform is supported on the positive half-line (or more generally, functions that extend to analytic functions in the upper half-plane with appropriate growth bounds) — provide the right function spaces for studying the boundary behavior of analytic functions and the endpoint behavior of singular integral operators.

Schwartz Space and Distributions

Both L1(Rn)L^1(\mathbb{R}^n) and L2(Rn)L^2(\mathbb{R}^n) have shortcomings as domains for the Fourier transform. The Schwartz space S(Rn)\mathcal{S}(\mathbb{R}^n), introduced by Laurent Schwartz in his development of distribution theory (for which he won the Fields Medal in 1950), is the ideal domain. A function f:RnCf: \mathbb{R}^n \to \mathbb{C} belongs to S(Rn)\mathcal{S}(\mathbb{R}^n) if it is smooth and all its partial derivatives decay faster than any polynomial at infinity:

supxRnxαβf(x)<for all multi-indices α,β.\sup_{x \in \mathbb{R}^n} |x^\alpha \partial^\beta f(x)| < \infty \quad \text{for all multi-indices } \alpha, \beta.

The Schwartz space is invariant under the Fourier transform — the transform of a Schwartz function is again a Schwartz function. Moreover, the Fourier transform is a topological automorphism of S(Rn)\mathcal{S}(\mathbb{R}^n). The space is dense in every Lp(Rn)L^p(\mathbb{R}^n) for 1p<1 \leq p < \infty, making it a convenient dense subspace for approximation arguments.

The dual space S(Rn)\mathcal{S}'(\mathbb{R}^n) is the space of tempered distributions — continuous linear functionals on S(Rn)\mathcal{S}(\mathbb{R}^n). Every locally integrable function with at most polynomial growth defines a tempered distribution via Tf(φ)=f(x)φ(x)dxT_f(\varphi) = \int f(x)\varphi(x)\, dx. But S\mathcal{S}' also contains objects with no classical interpretation: the Dirac delta δ0(φ)=φ(0)\delta_0(\varphi) = \varphi(0) is a tempered distribution, as is the derivative of any distribution. The Fourier transform extends from S\mathcal{S} to S\mathcal{S}' by duality: T^(φ)=T(φ^)\hat{T}(\varphi) = T(\hat{\varphi}). Under this extension, δ0^=1\widehat{\delta_0} = 1 (the constant function) and 1^=δ0\hat{1} = \delta_0, a striking illustration of the duality between point concentration in space and perfect spread in frequency.

Distributions provide the natural language for the fundamental solutions of linear PDEs. A fundamental solution of a linear partial differential operator LL is a distribution EE satisfying LE=δ0LE = \delta_0. Once EE is known, the solution to Lu=fLu = f is given by convolution u=Efu = E * f. For the Laplacian in Rn\mathbb{R}^n (n3n \geq 3), the fundamental solution is E(x)=cnx2nE(x) = c_n |x|^{2-n}; the Fourier transform reduces finding EE to the algebraic equation 4π2ξ2E^(ξ)=1-4\pi^2 |\xi|^2 \hat{E}(\xi) = 1, which gives E^(ξ)=14π2ξ2\hat{E}(\xi) = -\frac{1}{4\pi^2 |\xi|^2}. The Whittaker-Shannon-Kotelnikov sampling theorem is another beautiful application: a band-limited function (one whose Fourier transform has compact support) is completely determined by its values at a discrete set of sample points, and can be reconstructed exactly from those samples — the mathematical foundation of all digital audio and signal processing.

Singular Integral Operators

Not every natural operator in analysis is bounded on L2L^2 by a simple Plancherel argument. Singular integral operators are linear operators whose kernels have a non-integrable singularity, yet which turn out to be bounded on LpL^p spaces through subtle cancellation. The systematic study of such operators was developed by Alberto Calderón and Antoni Zygmund in a landmark series of papers beginning in 1952.

The prototypical example is the Hilbert transform on R\mathbb{R}:

(Hf)(x)=p.v.1πf(y)xydy,(Hf)(x) = \text{p.v.}\, \frac{1}{\pi} \int_{-\infty}^{\infty} \frac{f(y)}{x - y}\, dy,

where “p.v.” denotes the principal value — the integral is interpreted as limε0xy>εf(y)xydy\lim_{\varepsilon \to 0} \int_{|x-y| > \varepsilon} \frac{f(y)}{x-y} dy. The Hilbert transform arises naturally in complex analysis (as the operator that maps the real part of a boundary value to its harmonic conjugate), in signal processing (where it produces the analytic signal), and throughout PDE theory. Its Fourier-side action is simply multiplication by isgn(ξ)-i\, \text{sgn}(\xi), which shows immediately that HH is bounded on L2L^2 with HL2L2=1\|H\|_{L^2 \to L^2} = 1. The deep result is that HH is also bounded on Lp(R)L^p(\mathbb{R}) for every 1<p<1 < p < \infty.

The Riesz transforms Rjf=p.v.cnxjyjxyn+1f(y)dyR_j f = \text{p.v.}\, c_n \int \frac{x_j - y_j}{|x-y|^{n+1}} f(y)\, dy are the nn-dimensional generalizations of the Hilbert transform, one for each coordinate direction. They satisfy jRj2=I\sum_j R_j^2 = -I (the negative identity), connecting them to the Laplacian: RjRkΔ=jkR_j R_k \Delta = \partial_j \partial_k. Together with the Hilbert transform, the Riesz transforms are the fundamental building blocks of the Calderón-Zygmund theory.

A Calderón-Zygmund kernel is a function K:Rn{0}CK: \mathbb{R}^n \setminus \{0\} \to \mathbb{C} satisfying: a size condition K(x)Cxn|K(x)| \leq C|x|^{-n}, a smoothness condition K(x)Cxn1|\nabla K(x)| \leq C|x|^{-n-1}, and a cancellation condition requiring that the mean of KK on spheres centered at the origin vanishes. The corresponding singular integral operator Tf=p.v.KfTf = \text{p.v.}\, K * f is bounded on Lp(Rn)L^p(\mathbb{R}^n) for all 1<p<1 < p < \infty, and satisfies a weak (1,1)(1,1) bound: {x:Tf(x)>λ}Cλ1fL1|\{x : |Tf(x)| > \lambda\}| \leq C\lambda^{-1} \|f\|_{L^1}. The proof of LpL^p boundedness uses the Calderón-Zygmund decomposition of a function into a “good” part that is bounded and a “bad” part concentrated on small cubes — a technique of extraordinary versatility that appears throughout modern analysis.

Littlewood-Paley Theory and Wavelets

Littlewood-Paley theory is a technique for analyzing functions by decomposing them in frequency space into dyadic shells and then recombining. The idea is due to John Edensor Littlewood and Raymond Paley in papers from the 1930s, and it provides a unified framework for LpL^p estimates that is far more flexible than direct Fourier analysis.

Fix a smooth function ψ:RnR\psi: \mathbb{R}^n \to \mathbb{R} supported in the annulus {1/2ξ2}\{1/2 \leq |\xi| \leq 2\} such that jZψ(2jξ)=1\sum_{j \in \mathbb{Z}} \psi(2^{-j} \xi) = 1 for ξ0\xi \neq 0. Define the dyadic frequency projections by Pjf^(ξ)=ψ(2jξ)f^(ξ)\widehat{P_j f}(\xi) = \psi(2^{-j}\xi)\hat{f}(\xi), so that PjfP_j f contains only the frequency components of ff in the shell {ξ2j}\{|\xi| \sim 2^j\}. The Littlewood-Paley square function is

Sf(x)=(jZPjf(x)2)1/2.Sf(x) = \left(\sum_{j \in \mathbb{Z}} |P_j f(x)|^2\right)^{1/2}.

The fundamental result of Littlewood-Paley theory is that for 1<p<1 < p < \infty, the LpL^p norm of ff is equivalent to the LpL^p norm of SfSf:

cpfLpSfLpCpfLp.c_p \|f\|_{L^p} \leq \|Sf\|_{L^p} \leq C_p \|f\|_{L^p}.

This equivalence is enormously useful: to prove that an operator TT is bounded on LpL^p, it often suffices to show that TT acts “well” on each dyadic piece PjfP_j f separately and then recombine using the square function. The Littlewood-Paley square function estimate is the engine behind the Mikhlin multiplier theorem and the Hörmander multiplier theorem, which give sufficient conditions on a multiplier m(ξ)m(\xi) — in terms of derivative bounds on dyadic shells — for the operator f^m(ξ)f^\hat{f} \mapsto m(\xi)\hat{f} to be bounded on LpL^p.

Wavelets are a related but distinct tool, developed starting in the 1980s by Yves Meyer, Ingrid Daubechies, Stéphane Mallat, and others. A wavelet is a function ψL2(R)\psi \in L^2(\mathbb{R}) such that the family {ψj,k(x)=2j/2ψ(2jxk)}j,kZ\{\psi_{j,k}(x) = 2^{j/2}\psi(2^j x - k)\}_{j,k \in \mathbb{Z}} forms an orthonormal basis for L2(R)L^2(\mathbb{R}). The parameter jj controls the scale (frequency) and kk controls the translation (position). Unlike Fourier analysis, which localizes perfectly in frequency but not at all in space, wavelets provide simultaneous localization in space and frequency — a property quantified by the uncertainty principle, which states that the product of a function’s spread in space and its spread in frequency is bounded below.

The construction of orthonormal wavelet bases is accomplished through multiresolution analysis (MRA), a nested sequence of closed subspaces V1V0V1L2(R)\cdots \subset V_{-1} \subset V_0 \subset V_1 \subset \cdots \subset L^2(\mathbb{R}) with jVj=L2(R)\overline{\bigcup_j V_j} = L^2(\mathbb{R}) and jVj={0}\bigcap_j V_j = \{0\}. The wavelet ψ\psi is constructed from the orthogonal complement of V0V_0 in V1V_1. Daubechies wavelets, discovered by Ingrid Daubechies in 1988, are the canonical family of compactly supported orthonormal wavelets; they have widespread application in image compression (JPEG 2000 uses wavelets rather than the Fourier-based DCT of JPEG), signal denoising, and numerical analysis. Besov spaces Bp,qsB^s_{p,q} and Triebel-Lizorkin spaces Fp,qsF^s_{p,q} provide a unified family of function spaces — encompassing Sobolev spaces, Hölder spaces, and LpL^p spaces as special cases — that can be characterized precisely in terms of the sizes of the Littlewood-Paley pieces PjfP_j f.

Harmonic Analysis on Groups

The deepest generalization of Fourier analysis replaces the real line or the circle by an arbitrary locally compact abelian group GG. The theory, developed principally by André Weil in his 1940 treatise L’intégration dans les groupes topologiques et ses applications, shows that every locally compact abelian group possesses a unique (up to scalar) translation-invariant measure — the Haar measure — and that a complete Fourier analysis can be developed on GG using this measure.

The dual group G^\hat{G} is the group of continuous homomorphisms from GG to the circle group T\mathbb{T}, called characters. The Fourier transform of fL1(G)f \in L^1(G) is defined by f^(χ)=Gf(x)χ(x)dμ(x)\hat{f}(\chi) = \int_G f(x) \overline{\chi(x)}\, d\mu(x) for χG^\chi \in \hat{G}. The Pontryagin duality theorem states that GG is canonically isomorphic to the dual of G^\hat{G} — that is, G^^G\hat{\hat{G}} \cong G. This profound result unifies and extends the classical duality between a periodic function on R/Z\mathbb{R}/\mathbb{Z} and its Fourier series (indexed by Z\mathbb{Z}), and between a function on R\mathbb{R} and its Fourier transform (also on R\mathbb{R}, since R^R\hat{\mathbb{R}} \cong \mathbb{R}). On any locally compact abelian group, Plancherel’s theorem holds: the Fourier transform extends to a unitary isomorphism from L2(G)L^2(G) to L2(G^)L^2(\hat{G}).

The standard examples illuminate the generality. When G=TG = \mathbb{T} (the circle), G^=Z\hat{G} = \mathbb{Z} and the Fourier transform is the classical Fourier series. When G=RG = \mathbb{R}, G^=R\hat{G} = \mathbb{R} and the transform is the classical Fourier integral. When G=ZG = \mathbb{Z}, G^=T\hat{G} = \mathbb{T} and the transform is the discrete-time Fourier transform of signal processing. When G=Z/NZG = \mathbb{Z}/N\mathbb{Z}, G^=Z/NZ\hat{G} = \mathbb{Z}/N\mathbb{Z} and the transform is the discrete Fourier transform (DFT), computed efficiently by the fast Fourier transform (FFT) algorithm of Cooley and Tukey (1965), one of the most important algorithms of the twentieth century.

For non-abelian locally compact groups, the Fourier transform takes a more complex form: the role of characters is played by unitary irreducible representations. The Peter-Weyl theorem for compact groups states that the matrix coefficients of all irreducible unitary representations form an orthonormal basis for L2(G)L^2(G). For the group SU(2)\mathrm{SU}(2), these are the Wigner DD-matrices, essential in quantum mechanics for describing the rotation of angular momentum states. The Plancherel formula for non-abelian groups expresses fL2(G)2\|f\|_{L^2(G)}^2 as an integral over the unitary dual G^\hat{G} (the set of equivalence classes of irreducible unitary representations) of Hilbert-Schmidt norms of “operator-valued Fourier transforms.” For Lie groups such as SL(2,R)\mathrm{SL}(2, \mathbb{R}), working out this Plancherel decomposition was a central achievement of Harish-Chandra in the 1950s and 1960s, and it is foundational to the Langlands program — one of the deepest ongoing projects in mathematics, connecting representation theory, automorphic forms, and number theory. Harmonic analysis on groups thus serves as both the beginning and the cutting edge of the subject.