Naive Set Theory
Paradoxes, Cantor's original theory, and the need for axiomatization.
Naive set theory is the original, intuitive approach to the mathematics of collections — the idea that any clearly defined property determines a set of objects satisfying that property. Developed by Georg Cantor in the 1870s, it provided the first rigorous language for talking about infinity, functions, and mathematical structures in a unified way. Yet this very generality harbors contradictions, most famously Russell’s paradox, which revealed that unrestricted set formation leads to logical inconsistency and spurred the development of the axiomatic set theories explored later in this branch.
Sets, Elements, and Membership
The concept of a set is among the most fundamental in all of mathematics. In his 1895 Beiträge zur Begründung der transfiniten Mengenlehre, Georg Cantor defined a set as “a gathering together into a whole of definite, distinct objects of our perception or our thought.” Those objects are called the elements or members of the set. The relationship between an element and a set is called membership, and it is written with the symbol : if is a member of the set , we write , and if is not a member, we write . Membership is the only primitive relation in set theory — everything else is built from it.
There are two standard ways to specify which elements belong to a set. Roster notation (also called enumeration) simply lists the elements inside curly braces:
This works well for small or finite sets, but it becomes impractical — or impossible — for large or infinite collections. The more powerful alternative is set-builder notation, which defines a set by a property that its members must satisfy:
The colon (or vertical bar) is read “such that,” so is “the set of all such that is a prime number.” Set-builder notation is where the real expressive power lies, and also where the danger lurks — as we shall see shortly.
Two special sets deserve immediate mention. The empty set, denoted or , is the set with no elements at all. It is unique: there is only one set with no members, a fact that follows from the extensionality principle discussed in the next section. At the other extreme, naive set theory entertains the notion of a universal set , the set of all objects under discussion in a given context. Venn diagrams, the overlapping-circle illustrations introduced by John Venn in 1880, typically depict sets as regions inside a rectangle representing . While Venn diagrams are invaluable as an intuitive aid, they should be treated as illustrations rather than proofs — the visual metaphor can mislead when sets have complex or infinite structure.
Cantor’s original vision — that every well-defined property determines a set — is sometimes called the unrestricted comprehension principle: for any property , there exists a set . This principle is wonderfully intuitive and, for most everyday mathematical purposes, entirely safe. But in 1901, Bertrand Russell demonstrated that it is logically self-contradictory. Consider the property “does not contain itself,” and form the set
Now ask: is a member of itself? If , then by the defining property, — a contradiction. If , then satisfies the defining property, so — again a contradiction. This is Russell’s paradox, and it shows that unrestricted comprehension is inconsistent. The paradox is not an exotic curiosity; it strikes at the very foundation of the system. Cantor himself had encountered related difficulties, including the paradox of the “set of all sets” (if such a set exists, its power set must be both a subset of and strictly larger than — a contradiction by Cantor’s own theorem). These paradoxes are precisely what motivated the development of axiomatic set theories such as Zermelo-Fraenkel set theory (ZFC), which replace unrestricted comprehension with carefully controlled axioms that avoid self-referential constructions while preserving the vast majority of useful set-theoretic reasoning.
Despite its logical flaws, naive set theory remains the working language of most mathematicians. The intuition it provides is indispensable, and its constructions are sound whenever we restrict attention to sets built from well-understood starting materials — which is exactly what the axioms of ZFC formalize, as explored in Axiomatic Set Theory (ZFC).
Subsets, Power Sets, and Equality
Given two sets and , we say that is a subset of , written , if every element of is also an element of :
If and — that is, contains at least one element not in — we say is a proper subset of and write (though some authors use to avoid ambiguity). The empty set is a subset of every set: since has no elements, the condition “every element of is in ” is vacuously true for any . Every set is also a subset of itself, so the subset relation is reflexive.
The concept of set equality rests on the axiom of extensionality, one of the most important principles in all of set theory. It states that two sets are equal if and only if they have exactly the same elements:
Equivalently, if and only if and . This “double containment” argument is the standard technique for proving two sets are equal: show that each is a subset of the other. Extensionality also tells us that a set is determined entirely by its members, not by how it is described. The sets , , and are all the same set, because they contain exactly the same elements. Order and repetition in roster notation do not matter.
The power set of a set , denoted , is the set of all subsets of :
For a concrete example, if , then . The power set always includes both (which is a subset of every set) and itself. For a finite set with elements, the power set has exactly elements:
The reason is combinatorial: each element of is either included or excluded from a given subset, giving two independent choices per element. This formula also explains the common notation for the power set.
The power set construction is far more than a bookkeeping device — it is the engine behind some of the deepest results in set theory. Cantor’s theorem, which we will encounter in Functions and Cardinality, states that for any set (finite or infinite), there is no surjection from onto . In other words, the power set is always strictly larger than the original set. For finite sets, this is the obvious fact that . For infinite sets, it is a profound result: it means there is no single “largest” infinity, but rather an endless hierarchy of ever-larger infinite cardinalities. This hierarchy, discovered by Cantor, was one of the most revolutionary ideas in the history of mathematics and remains at the heart of modern set theory.
Set Operations and Algebraic Laws
Just as arithmetic provides operations for combining numbers, set theory provides operations for combining sets. The union of two sets and is the set of all elements belonging to at least one of them:
The intersection of and is the set of elements belonging to both:
When — the two sets share no elements — we say and are disjoint. The complement of a set relative to a universal set is the set of all elements in that are not in :
More generally, the relative complement (or set difference) of in is defined without reference to a universal set:
Finally, the symmetric difference of and collects those elements belonging to exactly one of the two sets:
These operations obey a rich collection of algebraic laws that mirror — and historically inspired — the laws of Boolean algebra from propositional logic. The parallel is not accidental: union corresponds to disjunction (), intersection to conjunction (), and complementation to negation (). The key identities are summarized below.
| Law | Identity |
|---|---|
| Commutativity | and |
| Associativity | and likewise for |
| Distributivity | and dually |
| Idempotence | and |
| Identity | and |
| Annihilation | and |
| Complement | and |
| Double complement | |
| De Morgan’s laws | and |
De Morgan’s laws for sets, named after Augustus De Morgan who stated the logical versions in the 1840s, are particularly important. They express a fundamental duality: complementing a union yields the intersection of the complements, and vice versa. These laws generalize to arbitrary (even infinite) unions and intersections:
The algebraic structure formed by a collection of sets under , , and complementation (with and as identity elements) is a Boolean algebra — the very same structure that Boole’s propositional calculus gives rise to. This correspondence between logic and set theory is one of the most elegant features of the foundations of mathematics: the algebra of truth values and the algebra of collections are two manifestations of the same abstract structure. The connection runs even deeper through Stone’s representation theorem, which shows that every Boolean algebra is isomorphic to a field of sets — a topic explored further in the Lattices and Boolean Algebras section of axiomatic set theory.
Ordered Pairs and Relations
The sets we have discussed so far are unordered — the set is the same as . But much of mathematics depends on order: coordinates in the plane, the input-output behavior of functions, and the structure of relations all require us to distinguish from . The challenge is to define the notion of an ordered pair purely in terms of sets, so that the entire apparatus of set theory can be brought to bear.
The standard solution, due to Kazimierz Kuratowski in 1921, defines the ordered pair as:
This encoding looks peculiar, but it has the one property that matters: if and only if and . The proof is a short exercise in extensionality — one verifies by case analysis that equality of the outer sets forces equality of the components. Other definitions of ordered pair have been proposed (notably by Norbert Wiener in 1914), but Kuratowski’s is the one adopted as standard because of its simplicity.
With ordered pairs in hand, we define the Cartesian product of two sets and as the set of all ordered pairs whose first component comes from and whose second comes from :
The name honors Rene Descartes, whose coordinate geometry identified points in the plane with pairs of real numbers — the plane being . For finite sets, , which is why the operation is called a “product.” The construction extends to -tuples: is the set of all ordered -tuples with each .
A relation from to is any subset . When , we say that is related to by , often written . The domain of is the set of all first components, , and the range (or image) is the set of all second components, . A relation on a single set — that is, a subset of — is where the most interesting structural properties emerge.
Three properties of a relation on are especially important. is reflexive if for every — every element is related to itself. is symmetric if implies — the relation goes both ways. is transitive if and together imply — the relation can be chained. A relation that is reflexive, symmetric, and transitive is called an equivalence relation, and equivalence relations are ubiquitous in mathematics. Equality itself is the simplest example, but the concept extends to congruence modulo in number theory, similarity of geometric figures, isomorphism of algebraic structures, and countless other settings.
Every equivalence relation on a set partitions into disjoint equivalence classes: for each element , its equivalence class is , and the collection of all equivalence classes forms a partition of — a family of nonempty, pairwise disjoint subsets whose union is all of . Conversely, every partition of determines an equivalence relation (declare two elements equivalent if they belong to the same block). This bijection between equivalence relations and partitions is one of the most useful structural results in naive set theory.
When a relation is reflexive and transitive but replaces symmetry with antisymmetry — the condition that and together imply — the result is a partial order. Partial orders formalize the idea of ranking or hierarchy: the subset relation on the power set of any set is a partial order, as is the divisibility relation on positive integers. If, in addition, every two elements are comparable (for any , either or ), the partial order is a total order (or linear order) — the usual ordering on the real numbers is the prototypical example.
Finally, a function from to is a special kind of relation: it is a subset such that for every , there is exactly one with . This definition — a function as a set of ordered pairs satisfying a uniqueness condition — is one of the great conceptual achievements of the set-theoretic approach to mathematics. It frees the notion of function from any dependence on formulas, rules, or algorithms, and allows us to reason about functions with the full power of set theory. The deep study of functions, their properties, and their role in comparing the sizes of infinite sets is the subject of Functions and Cardinality.