The ZFC Axioms

The Zermelo-Fraenkel axioms with Choice — the standard foundation of mathematics.


The Zermelo-Fraenkel axioms with Choice, collectively known as ZFC, form the standard axiomatic foundation for nearly all of modern mathematics. Born out of the crisis triggered by Russell’s paradox, these axioms replaced the informal notion of “a set is any collection” with a precise first-order theory that carefully controls which sets exist and how new sets can be formed. In this article we present the axioms, explore the cumulative hierarchy they give rise to, show how the natural numbers and basic arithmetic are constructed within the theory, and sketch the path from there to the integers, rationals, and real numbers — demonstrating that ZFC is powerful enough to serve as a universal foundation.

The Zermelo-Fraenkel Axioms

The story of ZFC begins with a catastrophe. In 1901, Bertrand Russell showed that the naive comprehension principle — the assumption that any property determines a set — leads to contradiction. The set R={x:xx}R = \{x : x \notin x\} can neither contain itself nor fail to contain itself. This paradox struck at the heart of Gottlob Frege’s recently published Grundgesetze der Arithmetik and made it clear that set theory needed to be rebuilt on firmer ground.

Ernst Zermelo published the first axiomatization in 1908, driven both by Russell’s paradox and by the need to place his own 1904 proof of the well-ordering theorem on a rigorous footing. His system introduced the key idea of restricting set formation: instead of forming sets from arbitrary properties, one can only separate elements from an already-existing set. In 1922, Abraham Fraenkel and, independently, Thoralf Skolem identified a gap in Zermelo’s system — the need for the axiom schema of replacement — and proposed refinements. The resulting theory, known as ZF, is formulated in first-order logic with a single binary relation symbol \in (membership). When the Axiom of Choice is added, we obtain ZFC.

The axioms of ZF are as follows:

AxiomInformal Statement
ExtensionalityTwo sets are equal if and only if they have the same elements.
Empty SetThere exists a set with no elements (derivable from Separation, but often stated explicitly).
PairingFor any two sets aa and bb, the set {a,b}\{a, b\} exists.
UnionFor any set aa, the union a\bigcup a of all elements of elements of aa exists.
Power SetFor any set aa, the collection of all subsets of aa forms a set P(a)\mathcal{P}(a).
InfinityThere exists a set containing \emptyset and closed under the successor operation xx{x}x \mapsto x \cup \{x\}.
Separation SchemaFor any set aa and any first-order property φ(z)\varphi(z), the collection {za:φ(z)}\{z \in a : \varphi(z)\} is a set.
Replacement SchemaIf φ(x,y)\varphi(x, y) defines a function on a set aa, then the image {y:xaφ(x,y)}\{y : \exists x \in a\, \varphi(x, y)\} is a set.
Foundation (Regularity)Every nonempty set has an \in-minimal element.

The Extensionality axiom says that a set is completely determined by its elements — there are no two distinct sets with exactly the same members. Formally: xy[z(zxzy)x=y]\forall x\,\forall y\,[\forall z\,(z \in x \leftrightarrow z \in y) \to x = y]. This is the identity criterion for sets and underlies the entire theory.

Pairing, Union, and Power Set provide the basic closure operations. From any two sets you can form their pair; from any collection of sets you can take the union of their elements; and from any set you can form the collection of all its subsets. Together with the empty set, these axioms let you build up an enormous variety of finite and infinite structures.

The Axiom of Infinity is what lifts set theory beyond the finite. It asserts the existence of at least one infinite set — specifically, a set ω\omega satisfying ω\emptyset \in \omega and x(xωx{x}ω)\forall x\,(x \in \omega \to x \cup \{x\} \in \omega). Without this axiom, all the other axioms would be satisfied by the hereditarily finite sets alone.

Separation (also called Comprehension or Aussonderung) is the safe replacement for naive comprehension. It allows you to carve out subsets of an existing set using a first-order property, but it cannot conjure a set from thin air. It is a schema — one axiom for each first-order formula φ\varphi. Replacement is strictly stronger: it says that the image of a set under any definable function is again a set. Replacement implies Separation, but it is kept as a distinct schema because of its additional power in constructing sets of high rank, such as Vω+ωV_{\omega + \omega}.

The Axiom of Choice (AC) stands apart and is traditionally listed separately, which is why we speak of ZF versus ZFC. It asserts that for every family of nonempty sets, there exists a function selecting exactly one element from each. Formally: for every set aa of nonempty sets, there is a function f:aaf : a \to \bigcup a such that f(x)xf(x) \in x for each xax \in a. The Axiom of Choice is equivalent to Zorn’s Lemma (every partially ordered set in which every chain has an upper bound contains a maximal element) and to the Well-Ordering Theorem (every set can be well-ordered). Weaker variants include Countable Choice and the Axiom of Dependent Choice, which suffice for most of analysis. Choice was historically controversial — it asserts existence without providing a construction — but it is now accepted as a standard part of working mathematics.

The Cumulative Hierarchy and Foundation

The axioms of ZFC do not merely list rules for forming sets; they implicitly describe a layered picture of the entire set-theoretic universe. This picture is the cumulative hierarchy, developed by John von Neumann in the 1920s, and it provides the conceptual and technical backbone of modern set theory.

The hierarchy is built by transfinite recursion on the ordinals. We begin with nothing and iteratively apply the power set operation:

V0=V_0 = \emptyset

Vα+1=P(Vα)V_{\alpha + 1} = \mathcal{P}(V_\alpha)

Vλ=α<λVαfor limit ordinals λV_\lambda = \bigcup_{\alpha < \lambda} V_\alpha \quad \text{for limit ordinals } \lambda

At the base, V0V_0 is empty. At the first successor stage, V1=P()={}V_1 = \mathcal{P}(\emptyset) = \{\emptyset\}, a set with one element. Then V2={,{}}V_2 = \{\emptyset, \{\emptyset\}\} with two elements, and V3V_3 has four elements — and so on, with each finite stage VnV_n having 2Vn12^{|V_{n-1}|} elements. The first limit stage Vω=n<ωVnV_\omega = \bigcup_{n < \omega} V_n collects all the hereditarily finite sets — every set whose members, and members of members, and so on, are all finite. Beyond VωV_\omega, the hierarchy continues through all the transfinite ordinals: Vω+1=P(Vω)V_{\omega + 1} = \mathcal{P}(V_\omega) is already uncountable, and the levels grow with breathtaking speed. The universe of all sets is defined as:

V=αOrdVαV = \bigcup_{\alpha \in \mathrm{Ord}} V_\alpha

where Ord\mathrm{Ord} denotes the proper class of all ordinals. Every set in ZFC appears at some level of this hierarchy, and the rank of a set xx is the least ordinal α\alpha such that xVα+1x \in V_{\alpha + 1}. Rank provides a measure of complexity: the natural numbers have finite rank, the set of real numbers has rank ω+1\omega + 1, and the objects of ordinary mathematics live at relatively low levels of the hierarchy.

The Axiom of Foundation (also called Regularity) is what makes this picture work. It states that every nonempty set xx contains an element yy such that xy=x \cap y = \emptyset — in other words, yy is \in-minimal in xx. Formally:

x[xy(yxyx=)]\forall x\,[x \neq \emptyset \to \exists y\,(y \in x \land y \cap x = \emptyset)]

An equivalent formulation is that there is no infinite descending chain x2x1x0\cdots \in x_2 \in x_1 \in x_0 — the membership relation is well-founded. Foundation rules out pathological sets: there is no set xx with xxx \in x, no pair of sets with aba \in b and bab \in a, and no circular membership of any kind. Foundation guarantees that every set can be assigned a rank and thus appears somewhere in the cumulative hierarchy.

This is sometimes called the iterative conception of sets: sets are formed in stages, and at each stage you can only collect sets that were already available at earlier stages. The axioms of ZFC can be seen as formalizing this intuitive picture. The iterative conception provides a philosophical justification for the axioms — each axiom corresponds to a natural step in the stage-by-stage construction — and it explains why ZFC avoids the paradoxes: Russell’s set R={x:xx}R = \{x : x \notin x\} would have to contain sets of every rank and therefore could not appear at any particular stage.

Natural Numbers and Peano Arithmetic

One of the first accomplishments of ZFC is the construction of the natural numbers as specific sets. The standard encoding, due to von Neumann, identifies each natural number with the set of all smaller natural numbers:

0=,1={},2={,{}},3={,{},{,{}}},0 = \emptyset, \quad 1 = \{\emptyset\}, \quad 2 = \{\emptyset, \{\emptyset\}\}, \quad 3 = \{\emptyset, \{\emptyset\}, \{\emptyset, \{\emptyset\}\}\}, \quad \ldots

In general, the successor of nn is defined as S(n)=n{n}S(n) = n \cup \{n\}. Under this encoding, each natural number nn is a set with exactly nn elements, and m<nm < n if and only if mnm \in n. The elegance of the von Neumann construction is that it simultaneously defines the natural numbers and the ordering relation on them.

The Axiom of Infinity guarantees the existence of a set that contains 0=0 = \emptyset and is closed under the successor operation. By applying the Separation schema, we extract the smallest such set — the intersection of all sets containing \emptyset and closed under successor. This minimal set is the set of natural numbers ω\omega (also written N\mathbb{N}).

The set ω\omega satisfies the Peano axioms as formulated within ZFC:

  1. 0ω0 \in \omega (zero is a natural number).
  2. If nωn \in \omega, then S(n)ωS(n) \in \omega (the successor of a natural number is a natural number).
  3. There is no nωn \in \omega with S(n)=0S(n) = 0 (zero is not a successor).
  4. If S(m)=S(n)S(m) = S(n), then m=nm = n (the successor function is injective).
  5. Mathematical induction: if a set AωA \subseteq \omega contains 00 and is closed under successor, then A=ωA = \omega.

The induction principle follows from the definition of ω\omega as the smallest inductive set. Since AA satisfies the closure conditions, it is an inductive set containing ω\omega as a subset; but ω\omega is contained in every inductive set, so A=ωA = \omega.

With ω\omega in hand, we define addition by recursion on the second argument:

m+0=m,m+S(n)=S(m+n)m + 0 = m, \qquad m + S(n) = S(m + n)

and multiplication similarly:

m0=0,mS(n)=mn+mm \cdot 0 = 0, \qquad m \cdot S(n) = m \cdot n + m

These recursive definitions are justified by the recursion theorem, which itself follows from the properties of well-ordered sets in ZFC. The familiar laws of arithmetic — commutativity, associativity, distributivity — are then proved by induction. In this way, the entire structure of elementary number theory is derived from the axioms of ZFC, with no additional assumptions.

Formal Constructions within ZFC

The power of ZFC as a foundation for mathematics becomes visible when we extend beyond the natural numbers to construct the full number systems of analysis. Each step uses a standard set-theoretic technique — typically equivalence classes or subsets — applied to structures already built.

Before proceeding, we need the notion of an ordered pair. The Kuratowski definition encodes the ordered pair (a,b)(a, b) as the set {{a},{a,b}}\{\{a\}, \{a, b\}\}. This seemingly arbitrary definition has exactly the right property: (a,b)=(c,d)(a, b) = (c, d) if and only if a=ca = c and b=db = d. With ordered pairs in hand, we can define Cartesian products, relations, and functions as sets, making them first-class citizens of ZFC.

The integers Z\mathbb{Z} are constructed from the natural numbers as equivalence classes of pairs. Consider the set ω×ω\omega \times \omega of all pairs (a,b)(a, b) of natural numbers, and define an equivalence relation by (a,b)(c,d)(a, b) \sim (c, d) if and only if a+d=b+ca + d = b + c. The intuition is that the pair (a,b)(a, b) represents the integer aba - b. Each equivalence class is an integer: the class of (3,0)(3, 0) is the integer +3+3, the class of (0,2)(0, 2) is 2-2, and the class of (5,5)(5, 5) is 00. Addition and multiplication are defined on representatives and shown to be well-defined (independent of the choice of representative).

The rationals Q\mathbb{Q} are built from the integers by the same technique. Consider pairs (p,q)(p, q) with pZp \in \mathbb{Z} and qZ{0}q \in \mathbb{Z} \setminus \{0\}, and define (p,q)(r,s)(p, q) \sim (r, s) if and only if ps=qrp \cdot s = q \cdot r. Each equivalence class is a rational number, representing the fraction p/qp/q. The field operations extend naturally, and one verifies that Q\mathbb{Q} is an ordered field.

The real numbers R\mathbb{R} require a more substantial construction. There are two classical approaches, both carried out entirely within ZFC. In the Dedekind cut construction, a real number is defined as a partition of Q\mathbb{Q} into two nonempty sets LL and RR such that every element of LL is less than every element of RR and LL has no greatest element. The real number 2\sqrt{2}, for instance, is the cut where L={qQ:q<0 or q2<2}L = \{q \in \mathbb{Q} : q < 0 \text{ or } q^2 < 2\}. The alternative approach uses Cauchy sequences: a real number is an equivalence class of Cauchy sequences of rationals, where two sequences are equivalent if their difference converges to zero. Both constructions yield a complete ordered field, and one proves that any two complete ordered fields are isomorphic — so the real numbers are uniquely determined up to isomorphism.

This chain of constructions — from \emptyset to ω\omega to Z\mathbb{Z} to Q\mathbb{Q} to R\mathbb{R} — illustrates the foundational thesis of ZFC: essentially all of mathematics can be formalized as statements about sets built from the empty set using the ZFC axioms. Functions are sets of ordered pairs, topological spaces are sets equipped with distinguished collections of subsets, groups are sets with binary operations satisfying certain properties, and so on. This universality is both ZFC’s great strength and its raison d’etre. It does not mean that working mathematicians must think in terms of sets at every moment, but it does mean that there is a single coherent framework in which the consistency of all standard mathematical reasoning can, in principle, be verified.