Algebraic Number Theory

Number fields, rings of integers, ideals, class groups, and p-adic numbers.


Algebraic number theory grew out of nineteenth-century attempts to prove Fermat’s Last Theorem by working not with ordinary integers but with richer number systems where new factorizations become possible. What emerged was a discipline that reveals deep structural truths about how integers, primes, and polynomial equations interlock — truths invisible to the elementary approach but laid bare once you enlarge the arena from Z\mathbb{Z} to the rings of integers of arbitrary number fields. The subject stands at a crossroads: it extends the ideas of quadratic reciprocity and abstract algebra into a unified framework that today underpins the Langlands program, the proof of Fermat’s Last Theorem, and large swaths of modern cryptography.

Algebraic Numbers and Algebraic Integers

The most natural way to extend the rational numbers is to adjoin solutions of polynomial equations. A complex number α\alpha is called an algebraic number if it satisfies some non-zero polynomial with rational coefficients: there exist a0,a1,,anQa_0, a_1, \ldots, a_n \in \mathbb{Q}, not all zero, such that

anαn+an1αn1++a1α+a0=0.a_n \alpha^n + a_{n-1} \alpha^{n-1} + \cdots + a_1 \alpha + a_0 = 0.

Every rational number is trivially algebraic (it satisfies xq=0x - q = 0), but so is 2\sqrt{2} (satisfying x22=0x^2 - 2 = 0), the primitive cube root of unity ω=e2πi/3\omega = e^{2\pi i/3} (satisfying x2+x+1=0x^2 + x + 1 = 0), and every root of every polynomial with rational coefficients. Numbers that are not algebraic — such as π\pi and ee, as proved by Lindemann in 1882 and Hermite in 1873 respectively — are called transcendental.

Among algebraic numbers, a distinguished subclass plays the role that ordinary integers play among rationals. An algebraic number α\alpha is called an algebraic integer if it satisfies a monic polynomial with integer coefficients:

αn+cn1αn1++c1α+c0=0,ciZ.\alpha^n + c_{n-1}\alpha^{n-1} + \cdots + c_1 \alpha + c_0 = 0, \quad c_i \in \mathbb{Z}.

The requirement that the leading coefficient be 11 is precisely what distinguishes integers from fractions: the rational number 32\frac{3}{2} satisfies 2x3=02x - 3 = 0 (with rational coefficients) but satisfies no monic polynomial with integer coefficients, so it is not an algebraic integer. The set of all algebraic integers forms a ring — sums and products of algebraic integers are again algebraic integers — a fact that is elegant but not obvious.

The minimal polynomial of an algebraic number α\alpha over Q\mathbb{Q} is the unique monic polynomial of least degree in Q[x]\mathbb{Q}[x] that has α\alpha as a root. It is always irreducible over Q\mathbb{Q}. The degree of α\alpha is the degree of its minimal polynomial, which equals the dimension [Q(α):Q][\mathbb{Q}(\alpha) : \mathbb{Q}] of the field extension generated by α\alpha. The norm N(α)N(\alpha) and trace Tr(α)\text{Tr}(\alpha) are, respectively, the product and the sum of all Galois conjugates of α\alpha — the roots of its minimal polynomial. These quantities are always rational, and when α\alpha is an algebraic integer, they are ordinary integers. Norms and traces are the algebraic number theorist’s primary bridge between the abstract and the concrete.

The historical impetus came from Kummer’s 1844–1847 attempts to prove Fermat’s Last Theorem by factoring xp+ypx^p + y^p in the ring Z[ζp]\mathbb{Z}[\zeta_p] of integers in the cyclotomic field Q(ζp)\mathbb{Q}(\zeta_p), where ζp=e2πi/p\zeta_p = e^{2\pi i/p} is a primitive pp-th root of unity. Kummer discovered that unique factorization fails in these rings — an obstacle he circumvented by introducing ideal numbers, later formalized by Dedekind into the modern theory of ideals.

Number Fields and Rings of Integers

A number field is a finite-degree field extension of Q\mathbb{Q}. Every number field KK has the form K=Q(α)K = \mathbb{Q}(\alpha) for some algebraic number α\alpha by the primitive element theorem, and its degree is [K:Q][K : \mathbb{Q}]. The simplest examples are the quadratic fields Q(d)\mathbb{Q}(\sqrt{d}) for a squarefree integer dd, which have degree 22, and the cyclotomic fields Q(ζn)\mathbb{Q}(\zeta_n), which have degree φ(n)\varphi(n) and play a central role in the theory of reciprocity laws.

Inside any number field KK, the ring of integers OK\mathcal{O}_K consists of all elements of KK that are algebraic integers. It is always a free Z\mathbb{Z}-module of rank [K:Q][K : \mathbb{Q}], meaning there exists an integral basis {ω1,,ωn}\{\omega_1, \ldots, \omega_n\} such that every element of OK\mathcal{O}_K can be written uniquely as c1ω1++cnωnc_1 \omega_1 + \cdots + c_n \omega_n with ciZc_i \in \mathbb{Z}. For the quadratic field K=Q(d)K = \mathbb{Q}(\sqrt{d}), the ring of integers is

OK={Z ⁣[1+d2]if d1(mod4),Z[d]if d2 or 3(mod4).\mathcal{O}_K = \begin{cases} \mathbb{Z}\!\left[\tfrac{1+\sqrt{d}}{2}\right] & \text{if } d \equiv 1 \pmod{4}, \\ \mathbb{Z}[\sqrt{d}] & \text{if } d \equiv 2 \text{ or } 3 \pmod{4}. \end{cases}

The discriminant ΔK\Delta_K of KK is an integer that encodes the geometry of OK\mathcal{O}_K inside KK; it equals the determinant of the matrix (Tr(ωiωj))(\text{Tr}(\omega_i \omega_j)). A prime pp ramifies in KK if and only if pΔKp \mid \Delta_K, a fact that connects the discriminant to the branching behavior of primes.

Rings of integers are Dedekind domains: integral domains in which every non-zero ideal factors uniquely into a product of prime ideals. This is a remarkable structural theorem proved by Dedekind in 1871 in his landmark supplement to Dirichlet’s Vorlesungen über Zahlentheorie. While OK\mathcal{O}_K may fail to be a unique factorization domain (UFD) — in Z[5]\mathbb{Z}[\sqrt{-5}], for instance, 6=23=(1+5)(15)6 = 2 \cdot 3 = (1 + \sqrt{-5})(1 - \sqrt{-5}) are two distinct factorizations into irreducibles — the failure is completely captured by the ideal structure: the ideals (2)(2), (3)(3), (1+5)(1+\sqrt{-5}), and (15)(1-\sqrt{-5}) all factor into prime ideals in a way that is unique. Dedekind’s insight was that unique factorization should be sought among ideals, not elements.

Units, Class Groups, and Ideal Theory

An element uOKu \in \mathcal{O}_K is a unit if it has a multiplicative inverse also lying in OK\mathcal{O}_K, equivalently if N(u)=±1N(u) = \pm 1. The units form a group OK×\mathcal{O}_K^\times under multiplication. Dirichlet’s unit theorem, proved in 1846, gives a complete structural description of this group:

OK×μK×Zr1+r21,\mathcal{O}_K^\times \cong \mu_K \times \mathbb{Z}^{r_1 + r_2 - 1},

where μK\mu_K is the (finite cyclic) group of roots of unity in KK, r1r_1 is the number of real embeddings of KK, and r2r_2 is the number of pairs of complex conjugate embeddings, with r1+2r2=[K:Q]r_1 + 2r_2 = [K:\mathbb{Q}]. The integer r=r1+r21r = r_1 + r_2 - 1 is called the unit rank. For K=Q(2)K = \mathbb{Q}(\sqrt{2}), we have r1=2r_1 = 2, r2=0r_2 = 0, so the unit rank is 11 and the units are ±(1+2)n\pm (1+\sqrt{2})^n for nZn \in \mathbb{Z} — the fundamental unit 1+21 + \sqrt{2} generates an infinite cyclic factor. For imaginary quadratic fields like Q(1)\mathbb{Q}(\sqrt{-1}), the unit rank is 00 and the units are purely the roots of unity: {±1,±i}\{\pm 1, \pm i\} in this case.

The ideal class group Cl(K)\text{Cl}(K) measures precisely how far OK\mathcal{O}_K is from being a UFD. Two non-zero fractional ideals a\mathfrak{a} and b\mathfrak{b} of OK\mathcal{O}_K are equivalent if a=xb\mathfrak{a} = x \mathfrak{b} for some xK×x \in K^\times; the equivalence classes form a group under ideal multiplication, and this is the ideal class group. The order hK=Cl(K)h_K = |\text{Cl}(K)| is the class number of KK. The key fact is: OK\mathcal{O}_K is a UFD if and only if hK=1h_K = 1. The class number thus quantifies the failure of unique factorization.

Computing class numbers is a central algorithmic problem. Minkowski’s bound gives the decisive tool: every ideal class contains an integral ideal a\mathfrak{a} with N(a)MKN(\mathfrak{a}) \leq M_K, where the Minkowski bound is

MK=n!nn(4π)r2ΔK.M_K = \frac{n!}{n^n} \left(\frac{4}{\pi}\right)^{r_2} \sqrt{|\Delta_K|}.

Since there are only finitely many integral ideals of bounded norm, this shows that the class group is finite. Moreover, to compute the class group it suffices to determine the ideal classes of all prime ideals p\mathfrak{p} with N(p)MKN(\mathfrak{p}) \leq M_K — a finite computation. For example, in Q(5)\mathbb{Q}(\sqrt{-5}), one finds MK4.5M_K \approx 4.5 and the class group is Z/2Z\mathbb{Z}/2\mathbb{Z}, reflecting the non-unique factorization of 66 seen earlier.

The analytic class number formula, one of the crown jewels of the subject, expresses the class number through analytic means. The Dedekind zeta function of KK is

ζK(s)=a01N(a)s=p prime11N(p)s,\zeta_K(s) = \sum_{\mathfrak{a} \neq 0} \frac{1}{N(\mathfrak{a})^s} = \prod_{\mathfrak{p} \text{ prime}} \frac{1}{1 - N(\mathfrak{p})^{-s}},

and it satisfies the class number formula: the residue of ζK(s)\zeta_K(s) at s=1s = 1 equals

lims1(s1)ζK(s)=2r1(2π)r2hKRKwKΔK,\lim_{s \to 1} (s-1)\zeta_K(s) = \frac{2^{r_1}(2\pi)^{r_2} h_K R_K}{w_K \sqrt{|\Delta_K|}},

where RKR_K is the regulator (the volume of the unit lattice) and wKw_K is the number of roots of unity in KK. This formula, a generalization of Dirichlet’s 1839 class number formula for quadratic fields, links algebraic invariants of KK to the analytic behavior of its zeta function — a connection that would eventually grow into the Langlands program.

Valuations and Local Fields

A valuation on a field FF is a function :FR0|\cdot| : F \to \mathbb{R}_{\geq 0} satisfying x=0x=0|x| = 0 \Leftrightarrow x = 0, xy=xy|xy| = |x||y|, and the triangle inequality x+yx+y|x + y| \leq |x| + |y|. If the stronger ultrametric inequality x+ymax(x,y)|x + y| \leq \max(|x|, |y|) holds, the valuation is called non-archimedean. The ordinary absolute value on Q\mathbb{Q} is archimedean; the pp-adic absolute value np=pvp(n)|n|_p = p^{-v_p(n)} — where vp(n)v_p(n) is the exponent of pp in the prime factorization of nn — is non-archimedean.

Ostrowski’s theorem (1916) classifies all valuations on Q\mathbb{Q} up to equivalence: every non-trivial absolute value on Q\mathbb{Q} is equivalent either to the ordinary absolute value or to the pp-adic absolute value p|\cdot|_p for some prime pp. This is the first indication that the primes of Z\mathbb{Z} and the archimedean embedding QR\mathbb{Q} \hookrightarrow \mathbb{R} stand on equal footing — they are the places of Q\mathbb{Q}, and number theory is richest when all places are considered simultaneously.

The pp-adic numbers Qp\mathbb{Q}_p are constructed as the completion of Q\mathbb{Q} with respect to p|\cdot|_p, precisely as R\mathbb{R} is the completion with respect to the ordinary absolute value. Elements of Qp\mathbb{Q}_p have a canonical pp-adic expansion:

x=k=makpk,ak{0,1,,p1},x = \sum_{k=m}^{\infty} a_k p^k, \quad a_k \in \{0, 1, \ldots, p-1\},

which converges in the pp-adic topology. The subring Zp={xQp:xp1}\mathbb{Z}_p = \{x \in \mathbb{Q}_p : |x|_p \leq 1\} of pp-adic integers consists of those elements with non-negative pp-adic valuation. Within Zp\mathbb{Z}_p, the unique maximal ideal is pZpp\mathbb{Z}_p, and the quotient Zp/pZpFp\mathbb{Z}_p / p\mathbb{Z}_p \cong \mathbb{F}_p is the finite field with pp elements.

A local field is a locally compact, non-discrete topological field. Over characteristic zero, these are exactly R\mathbb{R}, C\mathbb{C}, and the finite extensions of Qp\mathbb{Q}_p. The extension theory of local fields is much simpler than that of global fields: every finite extension L/QpL/\mathbb{Q}_p is completely described by three invariants — the ramification index ee, the residue degree ff, and the relation [L:Qp]=ef[L:\mathbb{Q}_p] = ef. The prime pp factors in OL\mathcal{O}_L as pOL=pep\mathcal{O}_L = \mathfrak{p}^e, so e>1e > 1 means pp is ramified in LL.

Hensel’s lemma is the fundamental lifting tool for pp-adic arithmetic. In one formulation: if f(x)Zp[x]f(x) \in \mathbb{Z}_p[x] and aZpa \in \mathbb{Z}_p satisfies f(a)0(modp)f(a) \equiv 0 \pmod{p} and f(a)≢0(modp)f'(a) \not\equiv 0 \pmod{p}, then there exists a unique a^Zp\hat{a} \in \mathbb{Z}_p with f(a^)=0f(\hat{a}) = 0 and a^a(modp)\hat{a} \equiv a \pmod{p}. This is a pp-adic analogue of Newton’s method, and it means that a simple root modulo pp lifts uniquely to an exact root in Zp\mathbb{Z}_p. Hensel’s lemma implies, for instance, that every integer that is a square modulo pp (with pp odd) is a square in Zp\mathbb{Z}_p.

The study of local fields is called local number theory, and it is indispensable for understanding global number fields. The passage from local to global is formalized through the adele ring AQ=R×pQp\mathbb{A}_\mathbb{Q} = \mathbb{R} \times \prod'_p \mathbb{Q}_p (a restricted product), which packages all completions simultaneously and enables powerful analytic techniques such as the Poisson summation formula on the adeles.

Ramification and Decomposition

When a prime pZp \in \mathbb{Z} is extended to a prime ideal p\mathfrak{p} of OK\mathcal{O}_K, the prime pp may behave in one of three fundamentally different ways. The ideal pOKp\mathcal{O}_K factors into prime ideals of OK\mathcal{O}_K:

pOK=p1e1p2e2pgeg.p\mathcal{O}_K = \mathfrak{p}_1^{e_1} \mathfrak{p}_2^{e_2} \cdots \mathfrak{p}_g^{e_g}.

Each prime pi\mathfrak{p}_i has a residue degree fi=[OK/pi:Fp]f_i = [\mathcal{O}_K/\mathfrak{p}_i : \mathbb{F}_p], and the fundamental identity i=1geifi=n=[K:Q]\sum_{i=1}^g e_i f_i = n = [K:\mathbb{Q}] holds. The prime pp is ramified in KK if any ei>1e_i > 1, split if g>1g > 1 and all ei=fi=1e_i = f_i = 1, and inert if g=1g = 1 and f1=nf_1 = n. Ramification is the most delicate case: it signals that pp divides the discriminant ΔK\Delta_K, and only finitely many primes ramify in any given number field.

For Galois extensions K/QK/\mathbb{Q} — those for which the Galois group Gal(K/Q)\text{Gal}(K/\mathbb{Q}) acts transitively on the prime ideals above pp — the situation simplifies: all ei=ee_i = e and all fi=ff_i = f are equal, so efg=nefg = n. The decomposition group DpGal(K/Q)D_\mathfrak{p} \leq \text{Gal}(K/\mathbb{Q}) is the stabilizer of p\mathfrak{p}, and the inertia group IpDpI_\mathfrak{p} \unlhd D_\mathfrak{p} consists of those automorphisms that act trivially on OK/p\mathcal{O}_K/\mathfrak{p}. We have Dp=ef|D_\mathfrak{p}| = ef and Ip=e|I_\mathfrak{p}| = e; pp is unramified exactly when IpI_\mathfrak{p} is trivial.

For unramified primes in a Galois extension, each decomposition group DpGal((OK/p)/Fp)D_\mathfrak{p} \cong \text{Gal}((\mathcal{O}_K/\mathfrak{p})/\mathbb{F}_p) is cyclic, generated by a canonical element called the Frobenius element Frobp\text{Frob}_\mathfrak{p}, characterized by the congruence

Frobp(x)xp(modp)for all xOK.\text{Frob}_\mathfrak{p}(x) \equiv x^p \pmod{\mathfrak{p}} \quad \text{for all } x \in \mathcal{O}_K.

The Frobenius element is a cornerstone of modern number theory. Different primes p\mathfrak{p} lying over the same rational prime pp have conjugate Frobenius elements, so in an abelian Galois extension the Frobenius depends only on pp, yielding a well-defined Artin symbol (K/Q,p)Gal(K/Q)(K/\mathbb{Q}, p) \in \text{Gal}(K/\mathbb{Q}). The map p(K/Q,p)p \mapsto (K/\mathbb{Q}, p) is the Artin map of class field theory, and the fact that it is a group homomorphism from the group of fractional ideals to Gal(K/Q)\text{Gal}(K/\mathbb{Q}) — the Artin reciprocity law — is the crowning theorem of class field theory, the branch of algebraic number theory that classifies all abelian extensions of a number field.

The distribution of Frobenius elements is governed by the Chebotarev density theorem (1922), which asserts that for any conjugacy class CGal(K/Q)C \subseteq \text{Gal}(K/\mathbb{Q}), the proportion of primes pp with FrobpC\text{Frob}_p \in C equals C/Gal(K/Q)|C|/|\text{Gal}(K/\mathbb{Q})|. This is a vast generalization of Dirichlet’s theorem on primes in arithmetic progressions — which is the special case where KK is a cyclotomic field and the Galois group is abelian. The Chebotarev density theorem is proved using the analytic properties of the Dedekind zeta function and its factorization into Artin LL-functions, forging yet another link between the algebraic structure of number fields and the analytic behavior of their associated LL-functions.

The theory of ramification and decomposition connects seamlessly to the study of quadratic reciprocity (as treated in that topic): the Legendre symbol (pq)\left(\frac{p}{q}\right) records how the prime qq splits in Q(p)\mathbb{Q}(\sqrt{p^*}), and the quadratic reciprocity law is a special case of the Artin reciprocity law for quadratic fields. The abstract algebra of Galois groups, normal subgroups, and field extensions (as treated in abstract algebra) provides the language in which all of these ideas are most clearly expressed. Together, these strands weave into algebraic number theory’s central ambition: to understand the absolute Galois group Gal(Q/Q)\text{Gal}(\overline{\mathbb{Q}}/\mathbb{Q}), the symmetry group of all algebraic numbers, through its representations and its action on arithmetic objects — a program that remains one of the deepest open problems in mathematics.