In the foundations of mathematics, von Neumann–Bernays–Gödel set theory (NBG) is an axiomatic set theory that is a conservative extension of Zermelo–Fraenkel–choice set theory (ZFC). NBG introduces the notion of class, which is a collection of sets defined by a formula whose quantifiers range only over sets. NBG can define classes that are larger than sets, such as the class of all sets and the class of all ordinals. Morse–Kelley set theory (MK) allows classes to be defined by formulas whose quantifiers range over classes. NBG is finitely axiomatizable, while ZFC and MK are not.
A key theorem of NBG is the class existence theorem, which states that for every formula whose quantifiers range only over sets, there is a class consisting of the sets satisfying the formula. This class is built by mirroring the step-by-step construction of the formula with classes. Since all set-theoretic formulas are constructed from two kinds of atomic formulas (membership and equality) and finitely many logical symbols, only finitely many axioms are needed to build the classes satisfying them. This is why NBG is finitely axiomatizable. Classes are also used for other constructions, for handling the set-theoretic paradoxes, and for stating the axiom of global choice, which is stronger than ZFC's axiom of choice.
John von Neumann introduced classes into set theory in 1925. The primitive notions of his theory were function and argument. Using these notions, he defined class and set.[1] Paul Bernays reformulated von Neumann's theory by taking class and set as primitive notions.[2] Kurt Gödel simplified Bernays' theory for his relative consistency proof of the axiom of choice and the generalized continuum hypothesis.[3]
Classes have several uses in NBG:
Once classes are added to the language of ZFC, it is easy to transform ZFC into a set theory with classes. First, the axiom schema of class comprehension is added. This axiom schema states: For every formula [math]\displaystyle{ \phi(x_1, \ldots, x_n) }[/math] that quantifies only over sets, there exists a class [math]\displaystyle{ A }[/math] consisting of the [math]\displaystyle{ n }[/math]-tuples satisfying the formula—that is, [math]\displaystyle{ \forall x_1 \cdots \,\forall x_n [(x_1, \ldots , x_n) \in A \iff \phi(x_1, \ldots, x_n)]. }[/math] Then the axiom schema of replacement is replaced by a single axiom that uses a class. Finally, ZFC's axiom of extensionality is modified to handle classes: If two classes have the same elements, then they are identical. The other axioms of ZFC are not modified.[8]
This theory is not finitely axiomatized. ZFC's replacement schema has been replaced by a single axiom, but the axiom schema of class comprehension has been introduced.
To produce a theory with finitely many axioms, the axiom schema of class comprehension is first replaced with finitely many class existence axioms. Then these axioms are used to prove the class existence theorem, which implies every instance of the axiom schema.[8] The proof of this theorem requires only seven class existence axioms, which are used to convert the construction of a formula into the construction of a class satisfying the formula.
NBG has two types of objects: classes and sets. Intuitively, every set is also a class. There are two ways to axiomatize this. Bernays used many-sorted logic with two sorts: classes and sets.[2] Gödel avoided sorts by introducing primitive predicates: [math]\displaystyle{ \mathfrak{Cls}(A) }[/math] for "[math]\displaystyle{ A }[/math] is a class" and [math]\displaystyle{ \mathfrak{M}(A) }[/math] for "[math]\displaystyle{ A }[/math] is a set" (in German, "set" is Menge). He also introduced axioms stating that every set is a class and that if class [math]\displaystyle{ A }[/math] is a member of a class, then [math]\displaystyle{ A }[/math] is a set.[9] Using predicates is the standard way to eliminate sorts. Elliott Mendelson modified Gödel's approach by having everything be a class and defining the set predicate [math]\displaystyle{ M(A) }[/math] as [math]\displaystyle{ \exists C(A \in C). }[/math][10] This modification eliminates Gödel's class predicate and his two axioms.
Bernays' two-sorted approach may appear more natural at first, but it creates a more complex theory.[lower-alpha 2] In Bernays' theory, every set has two representations: one as a set and the other as a class. Also, there are two membership relations: the first, denoted by "∈", is between two sets; the second, denoted by "η", is between a set and a class.[2] This redundancy is required by many-sorted logic because variables of different sorts range over disjoint subdomains of the domain of discourse.
The differences between these two approaches do not affect what can be proved, but they do affect how statements are written. In Gödel's approach, [math]\displaystyle{ A \in C }[/math] where [math]\displaystyle{ A }[/math] and [math]\displaystyle{ C }[/math] are classes is a valid statement. In Bernays' approach this statement has no meaning. However, if [math]\displaystyle{ A }[/math] is a set, there is an equivalent statement: Define "set [math]\displaystyle{ a }[/math] represents class [math]\displaystyle{ A }[/math]" if they have the same sets as members—that is, [math]\displaystyle{ \forall x(x \in a \iff x \;\eta \;A). }[/math] The statement [math]\displaystyle{ a \;\eta \;C }[/math] where set [math]\displaystyle{ a }[/math] represents class [math]\displaystyle{ A }[/math] is equivalent to Gödel's [math]\displaystyle{ A \in C. }[/math][2]
The approach adopted in this article is that of Gödel with Mendelson's modification. This means that NBG is an axiomatic system in first-order predicate logic with equality, and its only primitive notions are class and the membership relation.
A set is a class that belongs to at least one class: [math]\displaystyle{ A }[/math] is a set if and only if [math]\displaystyle{ \exist C(A \in C) }[/math]. A class that is not a set is called a proper class: [math]\displaystyle{ A }[/math] is a proper class if and only if [math]\displaystyle{ \forall C(A \notin C) }[/math].[12] Therefore, every class is either a set or a proper class, and no class is both.
Gödel introduced the convention that uppercase variables range over classes, while lowercase variables range over sets.[9] Gödel also used names that begin with an uppercase letter to denote particular classes, including functions and relations defined on the class of all sets. Gödel's convention is used in this article. It allows us to write:
The following axioms and definitions are needed for the proof of the class existence theorem.
Axiom of extensionality. If two classes have the same elements, then they are identical.
This axiom generalizes ZFC's axiom of extensionality to classes.
Axiom of pairing. If [math]\displaystyle{ x }[/math] and [math]\displaystyle{ y }[/math] are sets, then there exists a set [math]\displaystyle{ p }[/math] whose only members are [math]\displaystyle{ x }[/math] and [math]\displaystyle{ y }[/math].
As in ZFC, the axiom of extensionality implies the uniqueness of the set [math]\displaystyle{ p }[/math], which allows us to introduce the notation [math]\displaystyle{ \{x, y\}. }[/math]
Ordered pairs are defined by:
Tuples are defined inductively using ordered pairs:
Class existence axioms will be used to prove the class existence theorem: For every formula in [math]\displaystyle{ n }[/math] free set variables that quantifies only over sets, there exists a class of [math]\displaystyle{ n }[/math]-tuples that satisfy it. The following example starts with two classes that are functions and builds a composite function. This example illustrates the techniques that are needed to prove the class existence theorem, which lead to the class existence axioms that are needed.
Example 1: If the classes [math]\displaystyle{ F }[/math] and [math]\displaystyle{ G }[/math] are functions, then the composite function [math]\displaystyle{ G \circ F }[/math] is defined by the formula: [math]\displaystyle{ \exists t[(x, t) \in F \,\land\, (t, y) \in G]. }[/math] Since this formula has two free set variables, [math]\displaystyle{ x }[/math] and [math]\displaystyle{ y, }[/math] the class existence theorem constructs the class of ordered pairs: [math]\displaystyle{ G \circ F\,=\,\{(x, y): \exists t[(x, t) \in F \,\land\, (t, y) \in G]\}. }[/math]
Because this formula is built from simpler formulas using conjunction [math]\displaystyle{ \land }[/math] and existential quantification [math]\displaystyle{ \exists }[/math], class operations are needed that take classes representing the simpler formulas and produce classes representing the formulas with [math]\displaystyle{ \land }[/math] and [math]\displaystyle{ \exists }[/math]. To produce a class representing a formula with [math]\displaystyle{ \land }[/math], intersection used since [math]\displaystyle{ x \in A \cap B \iff x \in A \land x \in B. }[/math] To produce a class representing a formula with [math]\displaystyle{ \exists }[/math], the domain is used since [math]\displaystyle{ x \in Dom(A) \iff \exists t[(x,t) \in A]. }[/math] Before taking the intersection, the tuples in [math]\displaystyle{ F }[/math] and [math]\displaystyle{ G }[/math] need an extra component so they have the same variables. The component [math]\displaystyle{ y }[/math] is added to the tuples of [math]\displaystyle{ F }[/math] and [math]\displaystyle{ x }[/math] is added to the tuples of [math]\displaystyle{ G }[/math]: [math]\displaystyle{ F' = \{(x, t, y): (x, t) \in F\}\, }[/math] and [math]\displaystyle{ \,G' = \{(t, y, x): (t, y) \in G\} }[/math] In the definition of [math]\displaystyle{ F', }[/math] the variable [math]\displaystyle{ y }[/math] is not restricted by the statement [math]\displaystyle{ (x, t) \in F, }[/math] so [math]\displaystyle{ y }[/math] ranges over the class [math]\displaystyle{ V }[/math] of all sets. Similarly, in the definition of [math]\displaystyle{ G', }[/math] the variable [math]\displaystyle{ x }[/math] ranges over [math]\displaystyle{ V. }[/math] So an axiom is needed that adds an extra component (whose values range over [math]\displaystyle{ V }[/math]) to the tuples of a given class. Next, the variables are put in the same order to prepare for the intersection: [math]\displaystyle{ F'' = \{(x, y, t): (x, t) \in F\}\, }[/math] and [math]\displaystyle{ \,G'' = \{(x, y, t): (t, y) \in G\} }[/math] To go from [math]\displaystyle{ F' }[/math] to [math]\displaystyle{ F'' }[/math] and from [math]\displaystyle{ G' }[/math] to [math]\displaystyle{ G'' }[/math] requires two different permutations, so axioms that support permutations of tuple components are needed. The intersection of [math]\displaystyle{ F'' }[/math] and [math]\displaystyle{ G'' }[/math] handles [math]\displaystyle{ \land }[/math]: [math]\displaystyle{ F'' \cap G'' = \{(x, y, t): (x, t) \in F \,\land\, (t, y) \in G\} }[/math] Since [math]\displaystyle{ (x, y, t) }[/math] is defined as [math]\displaystyle{ ((x, y), t) }[/math], taking the domain of [math]\displaystyle{ F'' \cap G'' }[/math] handles [math]\displaystyle{ \exists t }[/math] and produces the composite function: [math]\displaystyle{ G \circ F = Dom(F'' \cap G'') = \{(x, y): \exists t ((x, t) \in F \,\land\, (t, y) \in G)\} }[/math] So axioms of intersection and domain are needed. |
The class existence axioms are divided into two groups: axioms handling language primitives and axioms handling tuples. There are four axioms in the first group and three axioms in the second group.[lower-alpha 4]
Axioms for handling language primitives:
Membership. There exists a class [math]\displaystyle{ E }[/math] containing all the ordered pairs whose first component is a member of the second component.
Intersection (conjunction). For any two classes [math]\displaystyle{ A }[/math] and [math]\displaystyle{ B }[/math], there is a class [math]\displaystyle{ C }[/math] consisting precisely of the sets that belong to both [math]\displaystyle{ A }[/math] and [math]\displaystyle{ B }[/math].
Complement (negation). For any class [math]\displaystyle{ A }[/math], there is a class [math]\displaystyle{ B }[/math] consisting precisely of the sets not belonging to [math]\displaystyle{ A }[/math].
Domain (existential quantifier). For any class [math]\displaystyle{ A }[/math], there is a class [math]\displaystyle{ B }[/math] consisting precisely of the first components of the ordered pairs of [math]\displaystyle{ A }[/math].
By the axiom of extensionality, class [math]\displaystyle{ C }[/math] in the intersection axiom and class [math]\displaystyle{ B }[/math] in the complement and domain axioms are unique. They will be denoted by: [math]\displaystyle{ A \cap B, }[/math] [math]\displaystyle{ \complement A, }[/math] and [math]\displaystyle{ Dom(A), }[/math] respectively.[lower-alpha 5] On the other hand, extensionality is not applicable to [math]\displaystyle{ E }[/math] in the membership axiom since it specifies only those sets in [math]\displaystyle{ E }[/math] that are ordered pairs.[clarification needed]
The first three axioms imply the existence of the empty class and the class of all sets: The membership axiom implies the existence of a class [math]\displaystyle{ E. }[/math] The intersection and complement axioms imply the existence of [math]\displaystyle{ E \cap \complement E }[/math], which is empty. By the axiom of extensionality, this class is unique; it is denoted by [math]\displaystyle{ \empty. }[/math] The complement of [math]\displaystyle{ \empty }[/math] is the class [math]\displaystyle{ V }[/math] of all sets, which is also unique by extensionality. The set predicate [math]\displaystyle{ M(A) }[/math], which was defined as [math]\displaystyle{ \exists C(A \in C) }[/math], is now redefined as [math]\displaystyle{ A \in V }[/math] to avoid quantifying over classes.
Axioms for handling tuples:
Product by [math]\displaystyle{ V }[/math]. For any class [math]\displaystyle{ A }[/math], there is a class [math]\displaystyle{ B }[/math] consisting of the ordered pairs whose first component belongs to [math]\displaystyle{ A }[/math].
Circular permutation. For any class [math]\displaystyle{ A }[/math], there is a class [math]\displaystyle{ B }[/math] whose 3‑tuples are obtained by applying the circular permutation [math]\displaystyle{ (y,z,x) \mapsto (x,y,z) }[/math] to the 3‑tuples of [math]\displaystyle{ A }[/math].
Transposition. For any class [math]\displaystyle{ A }[/math], there is a class [math]\displaystyle{ B }[/math] whose 3‑tuples are obtained by transposing the last two components of the 3‑tuples of [math]\displaystyle{ A }[/math].
By extensionality, the product by [math]\displaystyle{ V }[/math] axiom implies the existence of a unique class, which is denoted by [math]\displaystyle{ A \times V. }[/math] This axiom is used to define the class [math]\displaystyle{ V^n }[/math] of all [math]\displaystyle{ n }[/math]-tuples: [math]\displaystyle{ V^1 = V }[/math] and [math]\displaystyle{ V^{n+1} = V^n \times V.\, }[/math] If [math]\displaystyle{ A }[/math] is a class, extensionality implies that [math]\displaystyle{ A \cap V^n }[/math] is the unique class consisting of the [math]\displaystyle{ n }[/math]-tuples of [math]\displaystyle{ A. }[/math] For example, the membership axiom produces a class [math]\displaystyle{ E }[/math] that may contain elements that are not ordered pairs, while the intersection [math]\displaystyle{ E \cap V^2 }[/math] contains only the ordered pairs of [math]\displaystyle{ E }[/math].
The circular permutation and transposition axioms do not imply the existence of unique classes because they specify only the 3‑tuples of class [math]\displaystyle{ B. }[/math] By specifying the 3‑tuples, these axioms also specify the [math]\displaystyle{ n }[/math]-tuples for [math]\displaystyle{ n \ge 4 }[/math] since: [math]\displaystyle{ (x_1, \ldots, x_{n-2}, x_{n-1}, x_n) = ((x_1, \ldots, x_{n-2}), x_{n-1}, x_n). }[/math] The axioms for handling tuples and the domain axiom imply the following lemma, which is used in the proof of the class existence theorem.
Tuple lemma —
One more axiom is needed to prove the class existence theorem: the axiom of regularity. Since the existence of the empty class has been proved, the usual statement of this axiom is given.[lower-alpha 6]
Axiom of regularity. Every nonempty set has at least one element with which it has no element in common. [math]\displaystyle{ \forall a\,[a \neq \empty \implies \exists u(u \in a \land u \cap a = \empty)]. }[/math]
This axiom implies that a set cannot belong to itself: Assume that [math]\displaystyle{ x \in x }[/math] and let [math]\displaystyle{ a = \{x\}. }[/math] Then [math]\displaystyle{ x \cap a \ne \empty }[/math] since [math]\displaystyle{ x \in x \cap a. }[/math] This contradicts the axiom of regularity because [math]\displaystyle{ x }[/math] is the only element in [math]\displaystyle{ a. }[/math] Therefore, [math]\displaystyle{ x \notin x. }[/math] The axiom of regularity also prohibits infinite descending membership sequences of sets: [math]\displaystyle{ \cdots \in x_{n+1} \in x_n \in \cdots \in x_1 \in x_0. }[/math]
Gödel stated regularity for classes rather than for sets in his 1940 monograph, which was based on lectures given in 1938.[26] In 1939, he proved that regularity for sets implies regularity for classes.[27]
Class existence theorem — Let [math]\displaystyle{ \phi(x_1, \dots, x_n, Y_1, \dots, Y_m) }[/math] be a formula that quantifies only over sets and contains no free variables other than [math]\displaystyle{ x_1, \dots, x_n, Y_1, \dots, Y_m }[/math] (not necessarily all of these). Then for all [math]\displaystyle{ Y_1, \dots, Y_m }[/math], there exists a unique class [math]\displaystyle{ A }[/math] of [math]\displaystyle{ n }[/math]-tuples such that: [math]\displaystyle{ \forall x_1 \cdots \,\forall x_n [(x_1, \dots , x_n) \in A \iff \phi(x_1, \dots, x_n, Y_1, \dots, Y_m)]. }[/math] The class [math]\displaystyle{ A }[/math] is denoted by [math]\displaystyle{ \{(x_1, \dots , x_n): \phi(x_1, \dots, x_n, Y_1, \dots, Y_m)\}. }[/math][lower-alpha 7]
The theorem's proof will be done in two steps:
Transformation rules. In rules 1 and 2 below, [math]\displaystyle{ \Delta }[/math] and [math]\displaystyle{ \Gamma }[/math] denote set or class variables. These two rules eliminate all occurrences of class variables before an [math]\displaystyle{ \in }[/math] and all occurrences of equality. Each time rule 1 or 2 is applied to a subformula, [math]\displaystyle{ i }[/math] is chosen so that [math]\displaystyle{ z_i }[/math] differs from the other variables in the current formula. The three rules are repeated until there are no subformulas to which they can be applied. This produces a formula that is built only with [math]\displaystyle{ \neg }[/math], [math]\displaystyle{ \land }[/math], [math]\displaystyle{ \exists }[/math], [math]\displaystyle{ \in }[/math], set variables, and class variables [math]\displaystyle{ Y_k }[/math] where [math]\displaystyle{ Y_k }[/math] does not appear before an [math]\displaystyle{ \in }[/math].
Transformation rules: bound variables. Consider the composite function formula of example 1 with its free set variables replaced by [math]\displaystyle{ x_1 }[/math] and [math]\displaystyle{ x_2 }[/math]: [math]\displaystyle{ \exists t[(x_1, t) \in F \,\land\, (t, x_2) \in G]. }[/math] The inductive proof will remove [math]\displaystyle{ \exists t }[/math], which produces the formula [math]\displaystyle{ (x_1, t) \in F \land (t, x_2) \in G. }[/math] However, since the class existence theorem is stated for subscripted variables, this formula does not have the form expected by the induction hypothesis. This problem is solved by replacing the variable [math]\displaystyle{ t }[/math] with [math]\displaystyle{ x_3. }[/math] Bound variables within nested quantifiers are handled by increasing the subscript by one for each successive quantifier. This leads to rule 4, which must be applied after the other rules since rules 1 and 2 produce quantified variables.
Example 2: Rule 4 is applied to the formula [math]\displaystyle{ \phi(x_1) }[/math] that defines the class consisting of all sets of the form [math]\displaystyle{ \{\empty, \{\empty, \dots\}, \dots\}. }[/math] That is, sets that contain at least [math]\displaystyle{ \empty }[/math] and a set containing [math]\displaystyle{ \empty }[/math] — for example, [math]\displaystyle{ \{\empty, \{\empty, a, b, c\}, d, e\} }[/math] where [math]\displaystyle{ a, b, c, d, }[/math] and [math]\displaystyle{ e }[/math] are sets.
[math]\displaystyle{ \begin{align} \phi(x_1) \,&=\, \exists u\;\,[\,u \in x_1 \,\land\, \neg\exists v\;\,(\;v\, \in \,u\,)] \,\land\, \,\exists w\;\bigl(w \in x_1 \,\land\, \exists y\;\,[(\;y\, \in w \;\land\; \neg\exists z\;\,(\;z\, \in \,y\,)]\bigr) \\ \phi_r(x_1) \, &=\, \exists x_2[x_2 \!\in\! x_1 \,\land\, \neg\exists x_3(x_3 \!\in\! x_2)] \,\land\, \,\exists x_2\bigl(x_2 \!\in\! x_1 \,\land\, \exists x_3[(x_3 \!\in\! x_2 \,\land\, \neg\exists x_4(x_4 \!\in\! x_3)]\bigr) \end{align} }[/math] Since [math]\displaystyle{ x_1 }[/math] is the only free variable, [math]\displaystyle{ n = 1. }[/math] The quantified variable [math]\displaystyle{ x_3 }[/math] appears twice in [math]\displaystyle{ x_3 \in x_2 }[/math] at nesting depth 2. Its subscript is 3 because [math]\displaystyle{ n+q = 1+2 = 3. }[/math] If two quantifier scopes are at the same nesting depth, they are either identical or disjoint. The two occurrences of [math]\displaystyle{ x_3 }[/math] are in disjoint quantifier scopes, so they do not interact with each other. |
Proof of the class existence theorem. The proof starts by applying the transformation rules to the given formula to produce a transformed formula. Since this formula is equivalent to the given formula, the proof is completed by proving the class existence theorem for transformed formulas.
The following lemma is used in the proof.
Expansion lemma — Let [math]\displaystyle{ 1 \le i \lt j \le n, }[/math] and let [math]\displaystyle{ P }[/math] be a class containing all the ordered pairs [math]\displaystyle{ (x_i, x_j) }[/math] satisfying [math]\displaystyle{ R(x_i, x_j). }[/math] That is, [math]\displaystyle{ P \supseteq \{(x_i, x_j): R(x_i, x_j)\}. }[/math] Then [math]\displaystyle{ P }[/math] can be expanded into the unique class [math]\displaystyle{ Q }[/math] of [math]\displaystyle{ n }[/math]-tuples satisfying [math]\displaystyle{ R(x_i, x_j) }[/math]. That is, [math]\displaystyle{ Q = \{(x_1, \ldots , x_n): R(x_i, x_j)\}. }[/math]
Proof:
Class existence theorem for transformed formulas — Let [math]\displaystyle{ \phi(x_1, \ldots, x_n, Y_1, \ldots, Y_m) }[/math] be a formula that:
Then for all [math]\displaystyle{ Y_1, \dots, Y_m }[/math], there exists a unique class [math]\displaystyle{ A }[/math] of [math]\displaystyle{ n }[/math]-tuples such that: [math]\displaystyle{ \forall x_1 \cdots \,\forall x_n [(x_1, \ldots , x_n) \in A \iff \phi(x_1, \ldots, x_n, Y_1, \ldots, Y_m)]. }[/math]
Proof: Basis step: [math]\displaystyle{ \phi }[/math] has 0 logical symbols. The theorem's hypothesis implies that [math]\displaystyle{ \phi }[/math] is an atomic formula of the form [math]\displaystyle{ x_i \in x_j }[/math] or [math]\displaystyle{ x_i \in Y_k. }[/math]
Case 1: If [math]\displaystyle{ \phi }[/math] is [math]\displaystyle{ x_i \in x_j }[/math], we build the class [math]\displaystyle{ E_{i,j,n} = \{(x_1, \ldots, x_{n}): x_i \in x_j\}, }[/math] the unique class of [math]\displaystyle{ n }[/math]-tuples satisfying [math]\displaystyle{ x_i \in x_j. }[/math]
Case a: [math]\displaystyle{ \phi }[/math] is [math]\displaystyle{ x_i \in x_j }[/math] where [math]\displaystyle{ i \lt j. }[/math] The axiom of membership produces a class [math]\displaystyle{ P }[/math] containing all the ordered pairs [math]\displaystyle{ (x_i, x_j) }[/math] satisfying [math]\displaystyle{ x_i \in x_j. }[/math] Apply the expansion lemma to [math]\displaystyle{ P }[/math] to obtain [math]\displaystyle{ E_{i,j,n} = \{(x_1, \ldots, x_{n}): x_i \in x_j\}. }[/math]
Case b: [math]\displaystyle{ \phi }[/math] is [math]\displaystyle{ x_i \in x_j }[/math] where [math]\displaystyle{ i \gt j. }[/math] The axiom of membership produces a class [math]\displaystyle{ P }[/math] containing all the ordered pairs [math]\displaystyle{ (x_i, x_j) }[/math] satisfying [math]\displaystyle{ x_i \in x_j. }[/math] Apply the tuple lemma's statement 4 to [math]\displaystyle{ P }[/math] to obtain [math]\displaystyle{ P' }[/math] containing all the ordered pairs [math]\displaystyle{ (x_j, x_i) }[/math] satisfying [math]\displaystyle{ x_i \in x_j. }[/math] Apply the expansion lemma to [math]\displaystyle{ P' }[/math] to obtain [math]\displaystyle{ E_{i,j,n} = \{(x_1, \ldots, x_{n}): x_i \in x_j\}. }[/math]
Case c: [math]\displaystyle{ \phi }[/math] is [math]\displaystyle{ x_i \in x_j }[/math] where [math]\displaystyle{ i = j. }[/math] Since this formula is false by the axiom of regularity, no [math]\displaystyle{ n }[/math]-tuples satisfy it, so [math]\displaystyle{ E_{i,j,n} = \empty. }[/math]
Case 2: If [math]\displaystyle{ \phi }[/math] is [math]\displaystyle{ x_i \in Y_k }[/math], we build the class [math]\displaystyle{ E_{i,Y_k,n} = \{(x_1, \ldots, x_{n}): x_i \in Y_k\}, }[/math] the unique class of [math]\displaystyle{ n }[/math]-tuples satisfying [math]\displaystyle{ x_i \in Y_k. }[/math]
Case a: [math]\displaystyle{ \phi }[/math] is [math]\displaystyle{ x_i \in Y_k }[/math] where [math]\displaystyle{ i \lt n. }[/math] Apply the axiom of product by [math]\displaystyle{ V }[/math] to [math]\displaystyle{ Y_k }[/math] to produce the class [math]\displaystyle{ P = Y_k \times V = \{(x_i, x_{i+1}): x_i \in Y_k\}. }[/math] Apply the expansion lemma to [math]\displaystyle{ P }[/math] to obtain [math]\displaystyle{ E_{i,Y_k,n} = \{(x_1, \ldots, x_{n}): x_i \in Y_k\}. }[/math]
Case b: [math]\displaystyle{ \phi }[/math] is [math]\displaystyle{ x_i \in Y_k }[/math] where [math]\displaystyle{ i = n \gt 1. }[/math] Apply the axiom of product by [math]\displaystyle{ V }[/math] to [math]\displaystyle{ Y_k }[/math] to produce the class [math]\displaystyle{ P = Y_k \times V = \{(x_i, x_{i-1}): x_i \in Y_k\}. }[/math] Apply the tuple lemma's statement 4 to [math]\displaystyle{ P }[/math] to obtain [math]\displaystyle{ P' = V \times Y_k = \{(x_{i-1}, x_i): x_i \in Y_k\}. }[/math] Apply the expansion lemma to [math]\displaystyle{ P' }[/math] to obtain [math]\displaystyle{ E_{i,Y_k,n} = \{(x_1, \ldots, x_{n}): x_i \in Y_k\}. }[/math]
Case c: [math]\displaystyle{ \phi }[/math] is [math]\displaystyle{ x_i \in Y_k }[/math] where [math]\displaystyle{ i = n = 1. }[/math] Then [math]\displaystyle{ E_{i,Y_k,n} = Y_k. }[/math]
Inductive step: [math]\displaystyle{ \phi }[/math] has [math]\displaystyle{ k }[/math] logical symbols where [math]\displaystyle{ k\gt 0 }[/math]. Assume the induction hypothesis that the theorem is true for all [math]\displaystyle{ \psi }[/math] with less than [math]\displaystyle{ k }[/math] logical symbols. We now prove the theorem for [math]\displaystyle{ \phi }[/math] with [math]\displaystyle{ k }[/math] logical symbols. In this proof, the list of class variables [math]\displaystyle{ Y_1, \dots, Y_m }[/math] is abbreviated by [math]\displaystyle{ \vec{Y} }[/math], so a formula—such as [math]\displaystyle{ \phi(x_1, \dots, x_n, Y_1, \dots, Y_m) }[/math]—can be written as [math]\displaystyle{ \phi(x_1, \dots, x_n, \vec{Y}). }[/math]
Case 1: [math]\displaystyle{ \phi(x_1, \ldots, x_n, \vec{Y}) = \neg \psi(x_1, \ldots, x_n, \vec{Y}). }[/math] Since [math]\displaystyle{ \psi }[/math] has [math]\displaystyle{ k-1 }[/math] logical symbols, the induction hypothesis implies that there is a unique class [math]\displaystyle{ A }[/math] of [math]\displaystyle{ n }[/math]-tuples such that: [math]\displaystyle{ \quad(x_1, \ldots , x_n) \in A \iff \psi(x_1, \ldots, x_n, \vec{Y}). }[/math] By the complement axiom, there is a class [math]\displaystyle{ \complement A }[/math] such that [math]\displaystyle{ \forall u\,[u\in \complement A \iff \neg(u \in A)]. }[/math] However, [math]\displaystyle{ \complement A }[/math] contains elements other than [math]\displaystyle{ n }[/math]-tuples if [math]\displaystyle{ n \gt 1. }[/math] To eliminate these elements, use [math]\displaystyle{ \complement_{V^n} A =\, }[/math][math]\displaystyle{ \complement A \cap V^n =\, }[/math][math]\displaystyle{ V^n \setminus A, }[/math] which is the complement relative to the class [math]\displaystyle{ V^n }[/math] of all [math]\displaystyle{ n }[/math]-tuples.[lower-alpha 5] Then, by extensionality, [math]\displaystyle{ \complement_{V^n} A }[/math] is the unique class of [math]\displaystyle{ n }[/math]-tuples such that: [math]\displaystyle{ \begin{alignat}{2} \quad&(x_1, \ldots, x_n) \in \complement_{V^n} A &&\iff \neg [(x_1, \ldots , x_n) \in A] \\ & &&\iff \neg \psi(x_1, \ldots, x_n, \vec{Y}) \\ & &&\iff \phi(x_1, \ldots, x_n, \vec{Y}). \end{alignat} }[/math]
Case 2: [math]\displaystyle{ \phi(x_1, \ldots, x_n, \vec{Y}) = \psi_1(x_1, \ldots, x_n, \vec{Y}) \land \psi_2(x_1, \ldots, x_n, \vec{Y}). }[/math] Since both [math]\displaystyle{ \psi_1 }[/math] and [math]\displaystyle{ \psi_2 }[/math] have less than [math]\displaystyle{ k }[/math] logical symbols, the induction hypothesis implies that there are unique classes of [math]\displaystyle{ n }[/math]-tuples, [math]\displaystyle{ A_1 }[/math] and [math]\displaystyle{ A_2 }[/math], such that:
By the axioms of intersection and extensionality, [math]\displaystyle{ A_1 \cap A_2 }[/math] is the unique class of [math]\displaystyle{ n }[/math]-tuples such that: [math]\displaystyle{ \begin{alignat}{2} \quad&(x_1, \ldots, x_n) \in A_1 \cap A_2 &&\iff (x_1, \ldots, x_n) \in A_1 \land (x_1, \ldots, x_n) \in A_2 \\ & &&\iff \psi_1(x_1, \ldots, x_n, \vec{Y}) \land \psi_2(x_1, \ldots, x_n, \vec{Y}) \\ & &&\iff \phi(x_1, \ldots, x_n, \vec{Y}). \end{alignat} }[/math]
Case 3: [math]\displaystyle{ \phi(x_1, \ldots, x_n, \vec{Y}) = \exists x_{n+1} \psi(x_1, \ldots, x_n, x_{n+1}, \vec{Y}). }[/math] The quantifier nesting depth of [math]\displaystyle{ \psi }[/math] is one more than that of [math]\displaystyle{ \phi }[/math] and the additional free variable is [math]\displaystyle{ x_{n+1}. }[/math] Since [math]\displaystyle{ \psi }[/math] has [math]\displaystyle{ k-1 }[/math] logical symbols, the induction hypothesis implies that there is a unique class [math]\displaystyle{ A }[/math] of [math]\displaystyle{ (n+1) }[/math]-tuples such that: [math]\displaystyle{ \quad(x_1, \ldots, x_n, x_{n+1}) \in A \iff \psi(x_1, \ldots, x_n, x_{n+1}, \vec{Y}). }[/math] By the axioms of domain and extensionality, [math]\displaystyle{ Dom(A) }[/math] is the unique class of [math]\displaystyle{ n }[/math]-tuples such that:[lower-alpha 8] [math]\displaystyle{ \begin{alignat}{2} \quad&(x_1, \ldots, x_n) \in Dom(A) &&\iff \exists x_{n+1}[((x_1, \ldots, x_n), x_{n+1}) \in A] \\ & &&\iff \exists x_{n+1}[(x_1, \ldots, x_n, x_{n+1}) \in A] \\ & &&\iff \exists x_{n+1}\,\psi(x_1, \ldots, x_n, x_{n+1}, \vec{Y}) \\ & &&\iff \phi(x_1, \ldots, x_n, \vec{Y}). \end{alignat} }[/math]
Gödel pointed out that the class existence theorem "is a metatheorem, that is, a theorem about the system [NBG], not in the system …"[30] It is a theorem about NBG because it is proved in the metatheory by induction on NBG formulas. Also, its proof—instead of invoking finitely many NBG axioms—inductively describes how to use NBG axioms to construct a class satisfying a given formula. For every formula, this description can be turned into a constructive existence proof that is in NBG. Therefore, this metatheorem can generate the NBG proofs that replace uses of NBG's class existence theorem.
A recursive computer program succinctly captures the construction of a class from a given formula. The definition of this program does not depend on the proof of the class existence theorem. However, the proof is needed to prove that the class constructed by the program satisfies the given formula and is built using the axioms. This program is written in pseudocode that uses a Pascal-style case statement.[lower-alpha 9] [math]\displaystyle{ \begin{array}{l} \mathbf{function} \;\text{Class}(\phi, \,n) \\ \quad\begin{array}{rl} \mathbf{input}\!: \;\,&\phi \text{ is a transformed formula of the form } \phi(x_1, \ldots, x_n, Y_1, \ldots, Y_m); \\ &n \text{ specifies that a class of } n\text{-tuples is returned.} \\ \;\;\;\;\mathbf{output}\!: \;\,&\text{class } A \text{ of } n\text{-tuples satisfying } \\ &\,\forall x_1 \cdots \,\forall x_n [(x_1, \ldots , x_n) \in A \iff \phi(x_1, \ldots, x_n, Y_1, \ldots, Y_m)]. \end{array} \\ \mathbf{begin} \\ \quad \mathbf{case} \;\phi \;\mathbf{of} \\ \qquad \begin{alignat}{2} x_i \in x_j: \;\;&\mathbf{return} \;\,E_{i,j,n}; &&\text{// } E_{i,j,n} \;\,= \{(x_1, \dots, x_n): x_i \in x_j\} \\ x_i \in Y_k: \;\;&\mathbf{return} \;\,E_{i,Y_k,n}; &&\text{// } E_{i,Y_k,n} = \{(x_1, \dots, x_{n}): x_i \in Y_k\} \\ \neg\psi: \;\;&\mathbf{return} \;\,\complement_{V^n}\text{Class}(\psi, \,n); &&\text{// } \complement_{V^n}\text{Class}(\psi, \,n) = V^n \setminus \text{Class}(\psi, \,n) \\ \psi_1 \land \psi_2: \;\;&\mathbf{return} \;\,\text{Class}(\psi_1, \,n) \cap \text{Class}(\psi_2, \,n);&& \\ \;\;\;\;\,\exists x_{n+1}(\psi): \;\;&\mathbf{return} \;\,Dom(\text{Class}(\psi, \,n+1)); &&\text{// } x_{n+1} \text{ is free in } \psi; \text{ Class}(\psi, \,n+1) \\ &\ &&\text{// returns a class of } (n+1)\text{-tuples} \end{alignat} \\ \quad \mathbf{end} \\ \mathbf{end} \end{array} }[/math]
Let [math]\displaystyle{ \phi }[/math] be the formula of example 2. The function call [math]\displaystyle{ A = Class(\phi, 1) }[/math] generates the class [math]\displaystyle{ A, }[/math] which is compared below with [math]\displaystyle{ \phi. }[/math] This shows that the construction of the class [math]\displaystyle{ A }[/math] mirrors the construction of its defining formula [math]\displaystyle{ \phi. }[/math]
[math]\displaystyle{ \begin{alignat}{2} &\phi \;&&= \;\;\exists x_2\,(x_2 \!\in\! x_1 \land \;\;\neg\;\;\;\;\exists x_3\;(x_3 \!\in\! x_2)) \,\land \;\;\,\exists x_2\,(x_2 \!\in\! x_1 \land \;\;\,\exists x_3\,(x_3 \!\in\! x_2 \,\land\;\;\neg\;\;\;\;\exists x_4\;(x_4 \!\in\! x_3))) \\ &A \;&&= Dom\,(\;E_{2,1,2}\; \cap \;\complement_{V^2}\,Dom\,(\;E_{3,2,3}\;)) \,\cap\, Dom\,(\;E_{2,1,2}\;\cap \, Dom\,(\;\,E_{3,2,3}\; \cap \;\complement_{V^3}\,Dom\,(\;E_{4,3,4}\;))) \end{alignat} }[/math]
Gödel extended the class existence theorem to formulas [math]\displaystyle{ \phi }[/math] containing relations over classes (such as [math]\displaystyle{ Y_1 \subseteq Y_2 }[/math] and the unary relation [math]\displaystyle{ M(Y_1) }[/math]), special classes (such as [math]\displaystyle{ Ord }[/math] ), and operations (such as [math]\displaystyle{ (x_1, x_2) }[/math] and [math]\displaystyle{ x_1 \cap Y_1 }[/math]).[32] To extend the class existence theorem, the formulas defining relations, special classes, and operations must quantify only over sets. Then [math]\displaystyle{ \phi }[/math] can be transformed into an equivalent formula satisfying the hypothesis of the class existence theorem.
The following definitions specify how formulas define relations, special classes, and operations:
A term is defined by:
The following transformation rules eliminate relations, special classes, and operations. Each time rule 2b, 3b, or 4 is applied to a subformula, [math]\displaystyle{ i }[/math] is chosen so that [math]\displaystyle{ z_i }[/math] differs from the other variables in the current formula. The rules are repeated until there are no subformulas to which they can be applied. [math]\displaystyle{ \, \Gamma_1, \dots, \Gamma_k, \Gamma, }[/math] and [math]\displaystyle{ \Delta }[/math] denote terms.
Example 3: Transforming [math]\displaystyle{ Y_1 \subseteq Y_2. }[/math]
[math]\displaystyle{ Y_1 \subseteq Y_2 \iff \forall z_1(z_1 \in Y_1 \implies z_1 \in Y_2) \quad\text{(rule 1)} }[/math] |
Example 4: Transforming [math]\displaystyle{ x_1 \cap Y_1 \in x_2. }[/math]
[math]\displaystyle{ \begin{alignat}{2} x_1 \cap Y_1 \in x_2 &\iff \exists z_1[z_1 = x_1 \cap Y_1 \,\land\, z_1 \in x_2] &&\text{(rule 3b)} \\ &\iff \exists z_1[\forall z_2(z_2 \in z_1 \iff z_2 \in x_1 \cap Y_1) \,\land\, z_1 \in x_2] &&\text{(rule 4)} \\ &\iff \exists z_1[\forall z_2(z_2 \in z_1 \iff z_2 \in x_1 \land z_2 \in Y_1) \,\land\, z_1 \in x_2] \quad&&\text{(rule 3a)} \\ \end{alignat} }[/math] This example illustrates how the transformation rules work together to eliminate an operation. |
Class existence theorem (extended version) — Let [math]\displaystyle{ \phi(x_1, \dots, x_n, Y_1, \dots, Y_m) }[/math] be a formula that quantifies only over sets, contains no free variables other than [math]\displaystyle{ x_1, \dots, x_n, Y_1, \dots, Y_m }[/math], and may contain relations, special classes, and operations defined by formulas that quantify only over sets. Then for all [math]\displaystyle{ Y_1, \dots, Y_m, }[/math] there exists a unique class [math]\displaystyle{ A }[/math] of [math]\displaystyle{ n }[/math]-tuples such that [math]\displaystyle{ \forall x_1 \cdots \,\forall x_n [(x_1, \dots , x_n) \in A \iff \phi(x_1, \dots, x_n, Y_1, \dots, Y_m)]. }[/math][lower-alpha 10]
Apply the transformation rules to [math]\displaystyle{ \phi }[/math] to produce an equivalent formula containing no relations, special classes, or operations. This formula satisfies the hypothesis of the class existence theorem. Therefore, for all [math]\displaystyle{ Y_1, \dots, Y_m, }[/math] there is a unique class [math]\displaystyle{ A }[/math] of [math]\displaystyle{ n }[/math]-tuples satisfying [math]\displaystyle{ \forall x_1 \cdots \,\forall x_n [(x_1, \dots, x_n) \in A \iff \phi(x_1, \dots, x_n, Y_1, \dots, Y_m)]. }[/math]
The axioms of pairing and regularity, which were needed for the proof of the class existence theorem, have been given above. NBG contains four other set axioms. Three of these axioms deal with class operations being applied to sets.
Definition. [math]\displaystyle{ F }[/math] is a function if [math]\displaystyle{ F \subseteq V^2 \land \forall x\, \forall y\, \forall z\, [(x,y) \in F \,\land\, (x,z) \in F \implies y = z]. }[/math]
In set theory, the definition of a function does not require specifying the domain or codomain of the function (see Function (set theory)). NBG's definition of function generalizes ZFC's definition from a set of ordered pairs to a class of ordered pairs.
ZFC's definitions of the set operations of image, union, and power set are also generalized to class operations. The image of class [math]\displaystyle{ A }[/math] under the function [math]\displaystyle{ F }[/math] is [math]\displaystyle{ F[A] = \{y: \exists x(x \in A \,\land\, (x, y) \in F)\}. }[/math] This definition does not require that [math]\displaystyle{ A \subseteq Dom(F). }[/math] The union of class [math]\displaystyle{ A }[/math] is [math]\displaystyle{ \cup A = \{x: \exists y(x \in y\, \,\land\, y \in A)\}. }[/math] The power class of [math]\displaystyle{ A }[/math] is [math]\displaystyle{ \mathcal{P}(A) = \{x: x \subseteq A\}. }[/math] The extended version of the class existence theorem implies the existence of these classes. The axioms of replacement, union, and power set imply that when these operations are applied to sets, they produce sets.[34]
Axiom of replacement. If [math]\displaystyle{ F }[/math] is a function and [math]\displaystyle{ a }[/math] is a set, then [math]\displaystyle{ F[a] }[/math], the image of [math]\displaystyle{ a }[/math] under [math]\displaystyle{ F }[/math], is a set. [math]\displaystyle{ \forall F \,\forall a \,[F \text{ is a function}\implies \exists b \,\forall y\,(y \in b \iff \exists x(x \in a \,\land\, (x, y) \in F))]. }[/math]
Not having the requirement [math]\displaystyle{ A \subseteq Dom(F) }[/math] in the definition of [math]\displaystyle{ F[A] }[/math] produces a stronger axiom of replacement, which is used in the following proof.
Theorem (NBG's axiom of separation) — If [math]\displaystyle{ a }[/math] is a set and [math]\displaystyle{ B }[/math] is a subclass of [math]\displaystyle{ a, }[/math] then [math]\displaystyle{ B }[/math] is a set.
The class existence theorem constructs the restriction of the identity function to [math]\displaystyle{ B }[/math]: [math]\displaystyle{ I{\restriction_B} = \{(x_1, x_2): x_1 \in B \land x_2 = x_1\}. }[/math] Since the image of [math]\displaystyle{ a }[/math] under [math]\displaystyle{ I{\restriction_B} }[/math] is [math]\displaystyle{ B }[/math], the axiom of replacement implies that [math]\displaystyle{ B }[/math] is a set. This proof depends on the definition of image not having the requirement [math]\displaystyle{ a \subseteq Dom(F) }[/math] since [math]\displaystyle{ Dom(I{\restriction_B}) = B \subseteq a }[/math] rather than [math]\displaystyle{ a \subseteq Dom(I{\restriction_B}). }[/math]
Axiom of union. If [math]\displaystyle{ a }[/math] is a set, then there is a set containing [math]\displaystyle{ \cup a. }[/math] [math]\displaystyle{ \forall a\, \exists b\, \forall x\,[\,\exists y(x \in y\, \,\land\, y \in a) \implies x \in b\,]. }[/math]
Axiom of power set. If [math]\displaystyle{ a }[/math] is a set, then there is a set containing [math]\displaystyle{ \mathcal{P}(a). }[/math]
Theorem — If [math]\displaystyle{ a }[/math] is a set, then [math]\displaystyle{ \cup a }[/math] and [math]\displaystyle{ \mathcal{P}(a) }[/math] are sets.
The axiom of union states that [math]\displaystyle{ \cup a }[/math] is a subclass of a set [math]\displaystyle{ b }[/math], so the axiom of separation implies [math]\displaystyle{ \cup a }[/math] is a set. Likewise, the axiom of power set states that [math]\displaystyle{ \mathcal{P}(a) }[/math] is a subclass of a set [math]\displaystyle{ b }[/math], so the axiom of separation implies that [math]\displaystyle{ \mathcal{P}(a) }[/math] is a set.
Axiom of infinity. There exists a nonempty set [math]\displaystyle{ a }[/math] such that for all [math]\displaystyle{ x }[/math] in [math]\displaystyle{ a }[/math], there exists a [math]\displaystyle{ y }[/math] in [math]\displaystyle{ a }[/math] such that [math]\displaystyle{ x }[/math] is a proper subset of [math]\displaystyle{ y }[/math]. [math]\displaystyle{ \exists a\, [\exists u(u \in a) \,\land\, \forall x(x \in a \implies \exists y(y \in a \,\land\, x \subset y))]. }[/math]
The axioms of infinity and replacement prove the existence of the empty set. In the discussion of the class existence axioms, the existence of the empty class [math]\displaystyle{ \empty }[/math] was proved. We now prove that [math]\displaystyle{ \empty }[/math] is a set. Let function [math]\displaystyle{ F = \empty }[/math] and let [math]\displaystyle{ a }[/math] be the set given by the axiom of infinity. By replacement, the image of [math]\displaystyle{ a }[/math] under [math]\displaystyle{ F }[/math], which equals [math]\displaystyle{ \empty }[/math], is a set.
NBG's axiom of infinity is implied by ZFC's axiom of infinity: [math]\displaystyle{ \,\exists a\, [\empty \in a \,\land\, \forall x(x \in a \implies x \cup \{x\} \in a)].\, }[/math] The first conjunct of ZFC's axiom, [math]\displaystyle{ \empty \in a }[/math], implies the first conjunct of NBG's axiom. The second conjunct of ZFC's axiom, [math]\displaystyle{ \forall x(x \in a \implies x \cup \{x\} \in a) }[/math], implies the second conjunct of NBG's axiom since [math]\displaystyle{ x \subset x \cup \{x\}. }[/math] To prove ZFC's axiom of infinity from NBG's axiom of infinity requires some of the other NBG axioms (see Weak axiom of infinity).[lower-alpha 12]
The class concept allows NBG to have a stronger axiom of choice than ZFC. A choice function is a function [math]\displaystyle{ f }[/math] defined on a set [math]\displaystyle{ s }[/math] of nonempty sets such that [math]\displaystyle{ f(x) \in x }[/math] for all [math]\displaystyle{ x \in s. }[/math] ZFC's axiom of choice states that there exists a choice function for every set of nonempty sets. A global choice function is a function [math]\displaystyle{ G }[/math] defined on the class of all nonempty sets such that [math]\displaystyle{ G(x) \in x }[/math] for every nonempty set [math]\displaystyle{ x. }[/math] The axiom of global choice states that there exists a global choice function. This axiom implies ZFC's axiom of choice since for every set [math]\displaystyle{ s }[/math] of nonempty sets, [math]\displaystyle{ G\vert_s }[/math] (the restriction of [math]\displaystyle{ G }[/math] to [math]\displaystyle{ s }[/math]) is a choice function for [math]\displaystyle{ s. }[/math] In 1964, William B. Easton proved that global choice is stronger than the axiom of choice by using forcing to construct a model that satisfies the axiom of choice and all the axioms of NBG except the axiom of global choice.[38] The axiom of global choice is equivalent to every class having a well-ordering, while ZFC's axiom of choice is equivalent to every set having a well-ordering.[lower-alpha 13]
Axiom of global choice. There exists a function that chooses an element from every nonempty set.
Von Neumann published an introductory article on his axiom system in 1925. In 1928, he provided a detailed treatment of his system.[39] Von Neumann based his axiom system on two domains of primitive objects: functions and arguments. These domains overlap—objects that are in both domains are called argument-functions. Functions correspond to classes in NBG, and argument-functions correspond to sets. Von Neumann's primitive operation is function application, denoted by [a, x] rather than a(x) where a is a function and x is an argument. This operation produces an argument. Von Neumann defined classes and sets using functions and argument-functions that take only two values, A and B. He defined x ∈ a if [a, x] ≠ A.[1]
Von Neumann's work in set theory was influenced by Georg Cantor's articles, Ernst Zermelo's 1908 axioms for set theory, and the 1922 critiques of Zermelo's set theory that were given independently by Abraham Fraenkel and Thoralf Skolem. Both Fraenkel and Skolem pointed out that Zermelo's axioms cannot prove the existence of the set {Z0, Z1, Z2, ...} where Z0 is the set of natural numbers and Zn+1 is the power set of Zn. They then introduced the axiom of replacement, which would guarantee the existence of such sets.[40][lower-alpha 14] However, they were reluctant to adopt this axiom: Fraenkel stated "that Replacement was too strong an axiom for 'general set theory'", while "Skolem only wrote that 'we could introduce' Replacement".[42]
Von Neumann worked on the problems of Zermelo set theory and provided solutions for some of them:
In 1929, von Neumann published an article containing the axioms that would lead to NBG. This article was motivated by his concern about the consistency of the axiom of limitation of size. He stated that this axiom "does a lot, actually too much." Besides implying the axioms of separation and replacement, and the well-ordering theorem, it also implies that any class whose cardinality is less than that of V is a set. Von Neumann thought that this last implication went beyond Cantorian set theory and concluded: "We must therefore discuss whether its [the axiom's] consistency is not even more problematic than an axiomatization of set theory that does not go beyond the necessary Cantorian framework."[57]
Von Neumann started his consistency investigation by introducing his 1929 axiom system, which contains all the axioms of his 1925 axiom system except the axiom of limitation of size. He replaced this axiom with two of its consequences, the axiom of replacement and a choice axiom. Von Neumann's choice axiom states: "Every relation R has a subclass that is a function with the same domain as R."[58]
Let S be von Neumann's 1929 axiom system. Von Neumann introduced the axiom system S + Regularity (which consists of S and the axiom of regularity) to demonstrate that his 1925 system is consistent relative to S. He proved:
These results imply: If S is consistent, then von Neumann's 1925 axiom system is consistent. Proof: If S is consistent, then S + Regularity is consistent (result 1). Using proof by contradiction, assume that the 1925 axiom system is inconsistent, or equivalently: the 1925 axiom system implies a contradiction. Since S + Regularity implies the axioms of the 1925 system (result 2), S + Regularity also implies a contradiction. However, this contradicts the consistency of S + Regularity. Therefore, if S is consistent, then von Neumann's 1925 axiom system is consistent.
Since S is his 1929 axiom system, von Neumann's 1925 axiom system is consistent relative to his 1929 axiom system, which is closer to Cantorian set theory. The major differences between Cantorian set theory and the 1929 axiom system are classes and von Neumann's choice axiom. The axiom system S + Regularity was modified by Bernays and Gödel to produce the equivalent NBG axiom system.
File:ETH-BIB-Bernays, Paul (1888-1977)-Portrait-Portr 00025 (cropped).tif In 1929, Paul Bernays started modifying von Neumann's new axiom system by taking classes and sets as primitives. He published his work in a series of articles appearing from 1937 to 1954.[59] Bernays stated that:
The purpose of modifying the von Neumann system is to remain nearer to the structure of the original Zermelo system and to utilize at the same time some of the set-theoretic concepts of the Schröder logic and of Principia Mathematica which have become familiar to logicians. As will be seen, a considerable simplification results from this arrangement.[60]
Bernays handled sets and classes in a two-sorted logic and introduced two membership primitives: one for membership in sets and one for membership in classes. With these primitives, he rewrote and simplified von Neumann's 1929 axioms. Bernays also included the axiom of regularity in his axiom system.[61]
In 1931, Bernays sent a letter containing his set theory to Kurt Gödel.[36] Gödel simplified Bernays' theory by making every set a class, which allowed him to use just one sort and one membership primitive. He also weakened some of Bernays' axioms and replaced von Neumann's choice axiom with the equivalent axiom of global choice.[62][lower-alpha 22] Gödel used his axioms in his 1940 monograph on the relative consistency of global choice and the generalized continuum hypothesis.[63]
Several reasons have been given for Gödel choosing NBG for his monograph:[lower-alpha 23]
Gödel's achievement together with the details of his presentation led to the prominence that NBG would enjoy for the next two decades.[70] In 1963, Paul Cohen proved his independence proofs for ZF with the help of some tools that Gödel had developed for his relative consistency proofs for NBG.[71] Later, ZFC became more popular than NBG. This was caused by several factors, including the extra work required to handle forcing in NBG,[72] Cohen's 1966 presentation of forcing, which used ZF,[73][lower-alpha 25] and the proof that NBG is a conservative extension of ZFC.[lower-alpha 26]
NBG is not logically equivalent to ZFC because its language is more expressive: it can make statements about classes, which cannot be made in ZFC. However, NBG and ZFC imply the same statements about sets. Therefore, NBG is a conservative extension of ZFC. NBG implies theorems that ZFC does not imply, but since NBG is a conservative extension, these theorems must involve proper classes. For example, it is a theorem of NBG that the global axiom of choice implies that the proper class V can be well-ordered and that every proper class can be put into one-to-one correspondence with V.[lower-alpha 27]
One consequence of conservative extension is that ZFC and NBG are equiconsistent. Proving this uses the principle of explosion: from a contradiction, everything is provable. Assume that either ZFC or NBG is inconsistent. Then the inconsistent theory implies the contradictory statements ∅ = ∅ and ∅ ≠ ∅, which are statements about sets. By the conservative extension property, the other theory also implies these statements. Therefore, it is also inconsistent. So although NBG is more expressive, it is equiconsistent with ZFC. This result together with von Neumann's 1929 relative consistency proof implies that his 1925 axiom system with the axiom of limitation of size is equiconsistent with ZFC. This completely resolves von Neumann's concern about the relative consistency of this powerful axiom since ZFC is within the Cantorian framework.
Even though NBG is a conservative extension of ZFC, a theorem may have a shorter and more elegant proof in NBG than in ZFC (or vice versa). For a survey of known results of this nature, see Pudlák 1998.
Morse–Kelley set theory has an axiom schema of class comprehension that includes formulas whose quantifiers range over classes. MK is a stronger theory than NBG because MK proves the consistency of NBG,[76] while Gödel's second incompleteness theorem implies that NBG cannot prove the consistency of NBG.
For a discussion of some ontological and other philosophical issues posed by NBG, especially when contrasted with ZFC and MK, see Appendix C of Potter 2004.
ZFC, NBG, and MK have models describable in terms of the cumulative hierarchy Vα and the constructible hierarchy Lα . Let V include an inaccessible cardinal κ, let X ⊆ Vκ, and let Def(X) denote the class of first-order definable subsets of X with parameters. In symbols where "[math]\displaystyle{ (X,\in) }[/math]" denotes the model with domain [math]\displaystyle{ X }[/math] and relation [math]\displaystyle{ \in }[/math], and "[math]\displaystyle{ \models }[/math]" denotes the satisfaction relation: [math]\displaystyle{ \operatorname{Def}(X) := \Bigl\{ \{ x \mid x \in X \text{ and } (X,\in) \models \phi(x,y_1,\ldots,y_n) \}: \phi \text{ is a first-order formula and } y_{1},\ldots,y_{n} \in X \Bigr\}. }[/math]
Then:
The ontology of NBG provides scaffolding for speaking about "large objects" without risking paradox. For instance, in some developments of category theory, a "large category" is defined as one whose objects and morphisms make up a proper class. On the other hand, a "small category" is one whose objects and morphisms are members of a set. Thus, we can speak of the "category of all sets" or "category of all small categories" without risking paradox since NBG supports large categories.
However, NBG does not support a "category of all categories" since large categories would be members of it and NBG does not allow proper classes to be members of anything. An ontological extension that enables us to talk formally about such a "category" is the conglomerate, which is a collection of classes. Then the "category of all categories" is defined by its objects: the conglomerate of all categories; and its morphisms: the conglomerate of all morphisms from A to B where A and B are objects.[82] On whether an ontology including classes as well as sets is adequate for category theory, see Muller 2001.
Original source: https://en.wikipedia.org/wiki/Von Neumann–Bernays–Gödel set theory.
Read more |