Axiomatic Set Theory
The formal foundation of mathematics — ZFC axioms, ordinals, cardinals, and the continuum hypothesis.
Axiomatic set theory is the formal framework in which virtually all of modern mathematics can be expressed and proved. Where naive set theory relies on intuitive notions of “collection” and runs headlong into paradoxes, the axiomatic approach replaces intuition with a precise list of first-order axioms — the Zermelo-Fraenkel axioms with Choice (ZFC) — that govern what sets exist and how they behave. This topic examines the logical structure of ZFC, the models and hierarchies it gives rise to, and the remarkable independence phenomena that reveal the inherent limits of any single axiom system for mathematics.
Set-Theoretic Axioms and Foundations
The need to axiomatize set theory became urgent in 1901, when Bertrand Russell discovered a devastating contradiction in Frege’s system. Consider the set . If , then by definition ; if , then . This is Russell’s paradox, and it showed that unrestricted set comprehension — the principle that any property determines a set — is inconsistent. The resolution required abandoning the naive “a set is any collection” and replacing it with carefully controlled axioms that specify which sets can be formed and how.
Ernst Zermelo published the first axiomatization in 1908, motivated both by Russell’s paradox and by the desire to place his 1904 proof of the well-ordering theorem on rigorous foundations. Abraham Fraenkel (1922) and Thoralf Skolem (1922) independently identified the need for the axiom schema of replacement, and John von Neumann contributed the axiom of foundation and the modern treatment of ordinals. The resulting system, ZFC, consists of the following axioms, all expressed in the first-order language of set theory with a single binary relation symbol :
-
Extensionality. Two sets are equal if and only if they have the same elements:
-
Foundation (Regularity). Every nonempty set contains an element disjoint from : This prevents pathological structures like or infinite descending -chains.
-
Comprehension (Separation) Schema. For any formula with parameters, and any set , the collection is a set: This is the safe replacement for unrestricted comprehension: you can only separate elements from an already-existing set, not conjure a set from an arbitrary property.
-
Pairing. For any two sets and , the pair exists:
-
Union. For any set , the union exists.
-
Power Set. For any set , the power set exists.
-
Infinity. There exists an infinite set. Specifically, there exists a set containing and closed under the successor operation :
-
Replacement Schema. If is a functional formula (for every in a set there is a unique with ), then the image is a set. This is strictly stronger than separation and is needed to construct sets of high rank, such as .
-
Choice (AC). For every family of nonempty sets, there exists a function selecting one element from each:
Each axiom addresses a specific need. Extensionality ensures that sets are determined by their elements, not by how they are described. Foundation gives the universe of sets its well-founded, hierarchical character. Comprehension and replacement control which new sets can be formed. Pairing, union, and power set provide basic closure operations. Infinity guarantees the existence of the natural numbers. Choice, the most controversial axiom historically, is equivalent to Zorn’s lemma and the well-ordering theorem, and is indispensable in algebra, analysis, and topology.
The axioms of ZFC organize the set-theoretic universe into the cumulative hierarchy. Define by transfinite recursion:
The universe of all sets is then , where denotes the class of all ordinals. Foundation guarantees that every set appears at some level: for every set , there is a least ordinal such that , called the rank of . The cumulative hierarchy is not just a visualization tool — it is the structural backbone of ZFC and the starting point for understanding inner models and forcing extensions.
An important metatheoretic tool is the reflection principle: for any finite list of first-order sentences true in , there exists an ordinal such that satisfies those same sentences. Reflection provides a way to “localize” arguments about the entire universe to arguments about set-sized initial segments, and it underlies many constructions in large cardinal theory.
Constructible Universe and Inner Models
In 1938, Kurt Godel achieved one of the landmark results of twentieth-century mathematics by constructing a model of set theory in which both the Axiom of Choice and the Generalized Continuum Hypothesis hold. His tool was the constructible universe , a “thin” inner model of in which every set is explicitly definable from those that came before.
The constructible hierarchy is defined by transfinite recursion, paralleling the cumulative hierarchy but replacing the power set with the definable power set:
Here denotes the set of all subsets of that are first-order definable over with parameters from . The constructible universe is then . Since , we always have and hence . The axiom asserts that every set is constructible — that the universe is as “thin” as possible.
The key structural result about is the condensation lemma: if is an elementary substructure of some (written ), then the Mostowski collapse of is isomorphic to some with . This lemma is the engine behind most of the important properties of . It implies, for instance, that satisfies the Generalized Continuum Hypothesis (GCH): for all ordinals . The argument proceeds by showing that every subset of that appears in is definable at a countable level, so there are at most subsets of in .
A central concept in the study of is absoluteness. A formula is absolute between two transitive models if for all parameters from , holds in if and only if it holds in . The formulas — those that are both and in the Levy hierarchy — are absolute between transitive models that satisfy enough of ZFC. Absoluteness is what allows results proved in to be transferred to : if is absolute and provable in , then it holds in as well.
Godel’s achievement was a relative consistency result: if ZFC is consistent, then so is ZFC + , and hence ZFC + GCH + AC. The proof works by showing that is a model of ZFC that additionally satisfies . Since implies both AC and GCH, the consistency of these principles follows. This was the first half of the independence of CH; the second half — showing that CH is also consistent — would have to wait twenty-five years for Cohen’s forcing.
Inner model theory extends Godel’s ideas by constructing models between and that accommodate large cardinal axioms. The model , built from a normal measure on a measurable cardinal, was studied by Kenneth Kunen in the 1970s. More sophisticated constructions — the core model developed by Anthony Dodd and Ronald Jensen, and later extended by John Steel and others — provide canonical inner models for increasingly large cardinals. The covering lemma of Jensen states that if does not exist (a condition related to the non-existence of certain large cardinals), then is “close to” in a precise sense: every uncountable set of ordinals in can be covered by a constructible set of the same cardinality.
Descriptive Set Theory
Descriptive set theory studies the structure of “definable” subsets of Polish spaces — complete, separable metric spaces such as , the Cantor space , and the Baire space . The central question is: which regularity properties (Lebesgue measurability, the Baire property, the perfect set property) do definable sets possess, and how does the answer depend on the complexity of the definition?
The starting point is the Borel hierarchy. A subset of a Polish space is Borel if it belongs to the -algebra generated by the open sets. The Borel sets are stratified into levels indexed by countable ordinals. The open sets form the class and the closed sets form . At the next level, consists of countable unions of closed sets ( sets) and consists of countable intersections of open sets ( sets). In general, for a countable ordinal :
The hierarchy is proper: at each level , there exist sets in that are not in and vice versa. All Borel sets are Lebesgue measurable, have the Baire property, and satisfy the perfect set property (every uncountable Borel set contains a perfect subset and hence has cardinality ).
Beyond the Borel sets lies the projective hierarchy. A set is analytic (or ) if it is the continuous image of a Borel set, equivalently the projection of a Borel subset of . The coanalytic sets () are the complements of analytic sets. The projective hierarchy continues: sets are projections of sets, and sets are their complements. Every Borel set is both and . The classical theorem of Mikhail Suslin (1917) establishes that the Borel sets are exactly the sets that are simultaneously analytic and coanalytic: .
Analytic sets retain many regularity properties: they are Lebesgue measurable, have the Baire property, and satisfy the perfect set property (a result due to Suslin). However, the situation at the level and beyond becomes sensitive to the axioms. In Godel’s , there exist sets without the perfect set property, and there exist sets that are not Lebesgue measurable. These results show that ZFC alone cannot decide the regularity of projective sets — resolving the question requires either large cardinal axioms or determinacy hypotheses.
Effective descriptive set theory refines the boldface hierarchy by restricting to computable or arithmetically definable operations. The lightface classes and correspond to the arithmetic hierarchy from computability theory: a set is (lightface) if it is computably enumerable, and so on. At the analytic level, the lightface sets are exactly the sets definable by a formula without parameters, and these coincide with the hyperarithmetic sets (via the Suslin-Kleene theorem). The effective theory provides a bridge between descriptive set theory and computability theory, connecting the structure of definable sets of reals to the recursion-theoretic hierarchy.
Determinacy and Large Cardinals in Logic
Consider a two-player infinite game on natural numbers associated to a set : Players I and II alternately choose natural numbers , producing a sequence . Player I wins if ; otherwise Player II wins. The set is determined if one of the two players has a winning strategy — a function that tells that player how to move at each turn, guaranteeing victory regardless of the opponent’s play.
The Axiom of Determinacy (AD) asserts that every subset of is determined. AD is incompatible with the full Axiom of Choice (using AC, one can construct non-determined sets via a well-ordering of the reals), but it has remarkably strong consequences for the regularity of all sets of reals: under AD, every set of reals is Lebesgue measurable, has the Baire property, and satisfies the perfect set property. AD thus provides an alternative foundational framework in which all sets of reals are “well-behaved.”
The first breakthrough connecting determinacy to ZFC was Donald Martin’s proof (1975) that Borel determinacy is a theorem of ZFC. This is a difficult result — Martin showed that the proof requires the power set axiom applied times for Borel sets of rank , and Harvey Friedman proved that Borel determinacy cannot be established in Zermelo set theory (ZFC without replacement), demonstrating that the replacement schema is genuinely needed. Borel determinacy is thus a natural mathematical theorem whose proof requires the full strength of ZFC.
Beyond the Borel level, determinacy requires large cardinal hypotheses. Large cardinal axioms assert the existence of cardinals with strong closure, reflection, or embedding properties that cannot be proved to exist in ZFC alone. The hierarchy includes:
- Inaccessible cardinals: is inaccessible if it is uncountable, regular, and a strong limit ( for all ). The existence of an inaccessible cardinal implies the consistency of ZFC, so by the second incompleteness theorem, ZFC cannot prove that inaccessible cardinals exist.
- Measurable cardinals: is measurable if there exists a non-principal -complete ultrafilter on , equivalently if there is a non-trivial elementary embedding with critical point . Measurable cardinals imply -determinacy.
- Woodin cardinals: is Woodin if for every function , there exists a cardinal and an elementary embedding with critical point such that . Woodin cardinals are the key to projective determinacy.
- Supercompact cardinals: is supercompact if for every , there exists an elementary embedding with critical point such that is closed under -sequences.
The deep connection between large cardinals and determinacy is one of the central themes of modern set theory. Woodin’s theorem (building on work of Martin and Steel) establishes that if there exist infinitely many Woodin cardinals with a measurable cardinal above them all, then Projective Determinacy (PD) holds — every projective set is determined. PD in turn implies that all projective sets are Lebesgue measurable and have the perfect set property, resolving questions that ZFC alone leaves open. The large cardinal hierarchy thus provides a natural calibration of the logical strength needed to establish determinacy at each level of the projective hierarchy: -determinacy follows from the existence of a measurable cardinal, -determinacy from a Woodin cardinal, and full PD from infinitely many Woodin cardinals.
Independence Results
The independence of the Continuum Hypothesis from ZFC is one of the most significant achievements in the history of mathematics, resolved through two complementary results separated by twenty-five years.
The first half, as discussed above, was Godel’s 1938 result: if ZFC is consistent, then ZFC + CH is consistent. The proof proceeds by showing that the constructible universe is a model of ZFC in which GCH holds. Since every model of ZFC contains as an inner model, the consistency of ZFC implies the consistency of ZFC + GCH. Godel himself believed that CH was false and that new axioms would eventually settle the question, but his consistency proof showed that any such refutation could not be carried out within ZFC.
The second half came in 1963, when Paul Cohen invented the method of forcing and used it to construct a model of ZFC in which CH fails. Cohen’s work earned him the Fields Medal in 1966 — the only Fields Medal ever awarded for a result in mathematical logic. The forcing method is arguably the most powerful technique in all of set theory, and its invention transformed the field.
The basic idea of forcing is to extend a countable transitive model of ZFC (a “ground model”) by adjoining a new set — called a generic filter — to obtain a larger model that still satisfies ZFC but may satisfy new statements that does not. The construction proceeds as follows. Choose a partial order in , whose elements are called forcing conditions. A filter is -generic if meets every dense set that belongs to . A set is dense in if every condition has an extension in . The Rasiowa-Sikorski lemma guarantees that -generic filters exist whenever is countable.
The forcing relation (” forces ”) is defined by recursion on the complexity of and has the crucial property that a sentence holds in if and only if some condition forces . The forcing relation is definable within , even though the generic filter is not an element of . This allows properties of the extension to be analyzed entirely within the ground model.
To prove the independence of CH, Cohen used Cohen forcing, where consists of finite partial functions from to , ordered by reverse inclusion. Each generic filter determines new subsets of (new reals), and the resulting model satisfies , which contradicts CH. A careful argument (using the countable chain condition of Cohen forcing) shows that the construction preserves all cardinals, so in the extension is the same as in the ground model.
An elegant reformulation of forcing uses Boolean-valued models. Instead of working with a partial order and a generic filter, one replaces the two-valued truth values with a complete Boolean algebra and constructs a model in which every sentence receives a truth value in . The sentence holds in the forcing extension if and only if its Boolean value is nonzero. This approach, developed by Dana Scott and Robert Solovay shortly after Cohen’s work, eliminates the need to work with generic filters and makes the algebraic structure of forcing more transparent.
The forcing method has yielded a vast landscape of independence results beyond CH. Suslin’s hypothesis (that every complete dense linear order without endpoints satisfying the countable chain condition is isomorphic to ) was shown to be independent of ZFC by Solovay and Stanley Tennenbaum (1971). Martin’s axiom (MA), a combinatorial principle that holds in models obtained by certain iterated forcings, has become a standard tool; it implies many consequences of CH for sets of reals without implying CH itself. The current frontier includes the study of forcing axioms such as the Proper Forcing Axiom (PFA) and Martin’s Maximum (MM), introduced by Saharon Shelah and by Matthew Foreman, Menachem Magidor, and Shelah in the 1980s. These axioms, which assert that generic filters exist for broad classes of forcing notions, have strong structural consequences — for instance, MM implies — and they represent candidates for natural extensions of ZFC that might eventually settle open problems about the combinatorics of the continuum.
The independence phenomena revealed by Godel and Cohen fundamentally reshaped the philosophy of mathematics. They showed that the ZFC axioms, while sufficient to formalize virtually all of ordinary mathematics, leave deep questions about infinite sets genuinely undetermined. The search for new axioms — whether large cardinals, determinacy principles, or forcing axioms — that might resolve these questions in a natural and convincing way remains one of the central programs of contemporary set theory and mathematical logic.