19 Hyperfinite Approximation

In some nonstandard frameworks there are infinite sets that can be "approximated" by hyperfinite sets. From the discussion in Chapter 14 we know that in an enlargement of a universe over a set A there will be a hyperfinite set B with A􀁹 B 􀂻o*A. This phenomenon suggests a new methodology for analysing infinite structures by "lifting" a corresponding analysis that is known for finite ones. The steps involved are as follows: (1) Obtain information about finite structures by standard reasoning. (2) Use transfer to lift this to hyperfinite structures, including the set B above. (3) Find some way of "pushing down" the results from B to the infinite set A. This procedure will now be illustrated with three applications: colouring of graphs, representation of Boolean algebras, and the Hahn-Banach theorem about extensions of linear functionals on vector spaces. For the first two of these, the pushing-down step (3) is immediate. For the Hahn-Banach theorem, however, it requires a little further nonstandard analysis in the form of an appeal to the shadow map. 260 19. Hyperfinite Approximation 19.1 Colourings and Graphs An r-colouring of a set G is a sequence C1, ... , Cr of pairwise disjoint sets satisfying Thus each member of G belongs to exactly one of the sets Ci. This situation induces a partition of G, which we regard as being given by an assignment of r different colours to the members of G. G n Ci is the set of elements of G that are assigned colour i ( cf. Section 17.1). Notice that the way we have defined this notion ensures that any rcolouring of G is also an r-colouring of any subset of G. In any nonstandard framework for G we will get with *Ci n *Cj = 0 whenever i # j. Hence the enlarged sets *C1, ... , *Cr form an r-colouring of *G, which moreover agrees with the original colouring when restricted to G. A graph is a structure ( G, E) comprising a nonempty set G with a binary relation E on G that is irreflexive and symmetric: (Vx E G) (x, x) ¢. E, (Vx,ny E G) ( (x,y) E En--+ (y,x) E E). This is visualised as a collection of nodes, or vertices, labelled by the members of G, with a line connecting the nodes labelled by x and y whenever (x, y) E E. A pair (x, y) that belongs tonE is called an edge with vertices x andoy. Any subset H of G defines the subgraph of (G, E) determined by retaining just those edges whose vertices both belong to H. In other words, there is a line connecting x and y in this subgraph precisely when x, y E H and (x,y) E E. In a nonstandard framework for G the structure (*G, *E) is a graph, since transfer ensures· that * E is irreflexive and symmetric: Moreover, the subgraph of this enlarged graph defined by G is just the original graph (G, E), because ( x, y) E E iff ( x, y) E *E for each x, y E G by transfer (note here that *(x, y) = (*x, *y) = (x, y), since x andny are individuals in a nonstandard framework for G). An r-colouring of a graph is an r-colouring of its set of nodes with the additional property that the vertices of any edge have different colours, i.e., 19.1 Colourings and Graphs 261 there is no line connecting two nodes of the same colour. This requires that each of the statements (Vx, y E G) ( (x, y) E E 1\ x E Ci 􀀆 y 􀁣 Ci) must hold true forni= 1, ... , r. An r-colouring of any graph is automatically an r-colouring of any of its subgraphs. Now, the assertion "C1, ... , Cr is an r-colouring of (G, E)" can be expressed by a formula that we abbreviate as Colour( C1, ... , Cn G, E). If 01(, ... , Cr, G, E are all taken to be constants naming particular entities in a universe over G, then this formula is a sentence whose *-transform asserts that *Cb ... , *Cr is an r-colouring of the enlarged graph (*G, *E). If, however, we take 01(, .n. , Cr to be variables, then we can form the sentence . (301, .o. , Cr E P(G)) Colour(C1, ... , Cn G, E), . which states that there exists an r-colouring of (G, E). This can be modified further to express the assertion that every finite subgraph of (G, E) has an r-colouring: replace G by a variable H and form (VH E PF(G)) (301, ... , Cr E P(G)) Colour(C1, ... , Cr, H, E). (i) Note that for (i) to be true it is required only that each finite subgraph have its own r-colouring. The colourings of different finite subgraphs need not agree with each other, so it is not obvious from (i) that the whole graph (G, E) can itself be r-coloured. Nonetheless, it is true that • if every finite subgraph of a graph has an r-colouring, then the graph itself has an r-colouring. A proof of this result will now be given by a simple application of the hyperfinite approximation methodology. We work in an enlargement of a universe over G, and observe first that by applying transfer to (i) we can infer that each member of *PF(G) defines a subgraph of (*G, *E) that has an r-colouring. Thus the fact that every finite subgraph of ( G, E) has an r-colouring can be lifted to the conclusion that every hyperfinite subgraph of (*G, *E) has an r-colouring. But in the enlargement there is a hyperfinite approximant of G, i.e., a set HE *PF(G) with G 􀂻 H 􀂻o*G. Then the subgraph of (*G, *E) defined by H has an r-colouring, and it remains only to push this situation down to (G, E) itself. But this is immediate, since (G, E) is a subgraph of the graph defined by H, so the r-colouring of H is also an r-colouring of ( G, E). Exercise 19.1.1 Write out explicitly the formula Colour(C1, ... , Cn G, E). 262 19. Hyperfinite Approximation 19.2 Boolean Algebras George Boole (1815-1864) was a pioneer of mathematical logic. He showed that the study of logical connectives, and the validity of inferences, could be carried out by a mathematical analysis of equations involving operations that constitute what we now refer to as Boolean algebra. To explain this, consider first the collection P(A) of all subsets of a set A. This is closed under the binary operations n and U of intersection and union of sets, and under the unary operation -of complementation relative to A. The structure (P(A); n, U, , 0, A) - is the power set algebra of A. More generally, a field of sets is any nonempty collection B of subsets of a set A (i.e., B 􀁽 P(A)) that is closed under n, U, andn-. Then (B; n, U, .,...., 0, A) is a subalgebra of the power set algebra of A. These are concrete examples of the abstract notion of a Boolean algebra, which can be defined as any structure (B; n, u, ', o, 1) in which B is a nonempty set that contains elements 0 (the zero) and 1 (the unit) and carries binary operations n and U and a unary operation 1 such that the following equations hold for all x, y, z in B: xny ynx, xUy yUx, x n (y u z) (xny)u(xnz), xu (y n z) (xuy)n(xuz), xn1 x, xUO x, X nx' 0, x Ux' 1. From these many other properties are deducible, including xnx x, xux x, x n (y n z) (xny)nz, xU (y U z) (xuy)Uz, xU(ynx) x, 19.2 Boolean Algebras 263 x n (y u x) x, (x n y)' x' U y', (xU y)' x' n y', xU1 1, xno 0, ' 1, 1' 0, x n y' = o iff xny = x. The element x n y is called the meet of x and y, while x U y is their join. A relation 􀜑 is defined in any Boolean algebra by X 􀄀 y iff X n y = X (iff X U y = y iff Xn y' = 0). Thenn􀄀 proves to be a partial ordering (reflexive, transitive, antisymmetric) in which the meet x n y is the greatest lower bound of x and y, and the join x U y is the least upper bound. In a field of sets, 􀄀 is the relation 􀁕 of set inclusion. Notice that in any nonstandard framework fornB, the operations n, U, 1 will extend to corresponding operations on *B, for which we continue to use the same symbols. The Boolean algebra axioms hold for these operations by transfer, and so *B becomes a Boolean algebra having Bas a subalgebra. Of singular importance is the two-element algebra based on the set {0, 1}, in which 1 is identified with "true" and 0 with "false"n, and n, U, 1 are the operations specified by the truth tables that give the usual meanings of the logical connectives /\, V, ' This algebra is denoted by 2. It is isomorphic to the power set algebra · of any one-element set {a}, identifying 1 with {a} and 0 with 0. The algebra 2 is a fundamental building block: from the representation to be discussed below it can be shown that any Boolean algebra is isomorphic to a subalgebra of an algebra that is constructed as a direct product of copies of 2. This implies that any equation satisfied by 2 will be satisfied by every Boolean algebra. Now, the properties of the set operations n, U,(-follow from their definitions, CnD -{x : x E C and x E D}, CUD {X : X E C or X E D}, -C {x E A : not x E C}, - and hence depend on the meaning of the words and, or, not. Thus it is the behaviour of these logical connectives that dictate that P(A) should be a Boolean algebra, indicating a natural connection between the algebra of sets and the algebra of connectives. This is further exemplified by a construction that builds Boolean algebras out of formulae. Let II be a nonempty set, 264 19. Hyperfinite Approximation whose members will be called sentence letters, and consider the class of II-formulae generated inductively from members of II by the connectives 1\, V, ---,, 􀅸, f--7. A II -valuation is a function v : II 􀅸 { 0, 1} assigning a truth value to each sentence letter. Any valuation extends in a unique way to all formulae by the usual truth conditions: v(•..i : i ::; n) of hyperreal numbers. (2) Let r E *JRo-R Prove that r is not a root of any polynomial with real coefficients. (Hint: a polynomial has finitely many roots.) Deduce that the set {rn : n EN} of finite powers of r is linearly independent over JR, and therefore that *JR is infinite-dimensional as a vector space over JR. (3) Explain why the set lL of limited hyperreals is an infinite-dimensional real subspace of *JR. 19.10 The Hahn-Banach Theorem A function of the form f : V 􀃏 lR is called a functional on a real vector space V. A linear functional is one that is additive, in the sense that f(x + y) = f(x) + f(y) (v) for all x, v E V, and homogeneous in the sense that f(>..x) = >..f(x) (vi) for all x E V andn>.. E JR. Ifnf satisfies (vi) only forn>.. 2: 0, then it is positively homogeneous, and if it satisfies f(x + y) :S f(x) + f(y) 276 19. Hyperfinite Approximation in place of (v), then it is subadditive. Finally, a function f is dominated by a function p if f(x) S p( x) for all x in the domain of f. Armed with these definitions we can state the following cornerstone result from functional analysis. Hahn-Banach Theorem: Let p be a positively homogeneous and subadditive functional on a real vector space V, and f a linear functional on a subspace W of V with f dominated by p. Then there exists a linear functional g on V that extends f and is still dominated by p. They key idea for the proof is a construction that takes a vector x E V-vV and extends f to the subspace generated by adjoining x to W: Lemma 19.10.1 Given the hypothesis of the Hahn-Banach theorem, if x E V -W, then there exists a linear functional h on a subspace of V including W U { x} such that h extends f and is dominated by p. Proof We give only a sketch of the proof of this standard piece of linear algebra. The subspace of V generated by W U { x} is the set {y + Ax : y E W and A E lR}. A functional his defined on this subspace by putting h(y+Ax) = f(y)+Ac, where c is a real constant. Any such c will make h a linear functional extending f, but c has to be chosen suitably to ensure that h is dominated by p. It turns out that for this purpose c can any number bounded above by the infimum of the numbers p(y+x)-f(y) and below by the supremum of the numbers -p( -yo-x)o-f(y) as y ranges over W. 0 The nature of this result suggests that we can prove the Hahn-Banach theorem by repeatedly applying the procedure of adjoining elements. Having extended f to an h whose domain contains x, we then choose an x' 􀁣 dom h and extend h to a linear functional h' that is dominated by p and defined at x'. We continue this until we run out of elements of V to adjoin. Recalling the discussion in Section 2.6, we see that this process involves the axiom of choice in selecting x, x', etc. Alternatively, we could "wellorder" V -W into a linear list along which we iterate the construction transfinitely often. This was precisely how Hahn and Banach (independently) proved their theorem, and in fact, Banach expressed it very briefly. Having explained the procedure that proves Lemma 19.10.1, he completed his argument by simply stating, It now suffices to well-order the set V -W, obtaining, by suc cessive extensions off, following the procedure described above, a functional g satisfying the conclusion of the theorem. Modern treatments of the Hahn-Banach theorem use Zorn's lemma instead to make a maximal extension of f and then show that its domain is the 19.10 The Hahn-Banach Theorem 277 whole of V. Here we will show that a proof can be developed in the language of enlargements. The essential idea is to iterate the construction of Lemma 19.10.1 hyperfinitely often, adjoining enough elements to the domain of the functional to include all of V. By iterating the adjunctions finitely many times we can take any finite sequence (xi : i :s; n) of elements of V and extend f to a linear functional dominated by p whose domain includes the subspace generated by the Xi's. Hence by applying transfer we can take any internal hyperfinite sequence (xi : i :s; n) and extend f to the hyperfinite-dimensional subspace of *V that it generates. In particular, this would allow us to lift f to the space v+ E * Fin(V) including V that was obtained in Theorem 19.8.2, and then push this extension down to V itself. In fact, an even more direct approach is to incorporate into our present situation the kind of concurrency argument used to derive v+. Given the hypothesis of the Hahn-Banach theorem, define a binary relation R by specifying that xRh if and only if x E V and h is a linear functional that extends f and is domi nated by p and whose domain is a subspace of V that includes WU {x}. If x􀇒, ... , Xn E V (where n EoN), then applying Lemma 19.10.1 at most n times produces an h having XiRh for all 1 :s; i :s; n. So R is current and has domain V. Working in an enlargement of a universe over V, we then obtain an j+ such that x(*R)j+ for all x E V. By transferring properties of R we conclude that: • j+ is a *􀃪-valued function whose domain is a hyperreal subspace of *V that includes V. • j+ is hyperlinear, meaning that it is additive and is homogeneous for hyperreal scalars: f ( >..x) = >..J ( x) whenever >.. E *􀃪. • j+ extends f: j+(x) = f(x) for all x E W. • j+ is dominated by the extension of p to *V: j+( x) :s; p( x) for all x E domj+. At this stage we cannot simply restrict j+ to V to produce our desired functional extension of f, because j+ may take nonstandard values on V. However, these values are always at least limited, so we can take their shadows. To see that this is so, note that in general, whence -p(-x) :s; j+(x) :s; p(x) 278 19. Hyperfinite Approximation for all x E domj+. Therefore when x E V, j+(x) is sandwiched between the real numbers -p( -x) and p(x), so is limited and has a shadow. Putting g(x) = sh(f+(x)) defines g : V ---+ lR as a function that extends f. Then g is dominated by p because g(x) 􀀵 j+(x) ::; p(x), implying g(x) < p(x), since both g(x) and p(x) are real. Finally, the shadow map sh : lL ---+ lR is a linear functional that composes with the restriction of the hyperlinear j+ to V to make g linear. This completes our nonstandard proof of the Hahn-Banach theorem. 19.11 Exercises on (Hyper) Linear Functionals (1) Verify that sh : ILo---+ lR is a linear functional on the real vector space IL of limited hyperreal numbers. (2) Write out in full the transfer arguments showing that j+ has the properties listed above. (3) (Due to W. A. J. Luxemburg) Given the hypothesis of the HahnBanach theorem, let {fi : i E I} be the set of all linear functionals that extend f, are dominated by p, and are defined on a subspace of V including W. For each x E V, put Axo= {i E I: x E domji}, and let rx : I ---+ lR be a function satisfying rx(i) = fi( x) for all i E Ax (for definiteness rx can be equated to 0 outside of Ax)· (a) Show that {Ax : x E V} has the finite intersection property. (b) Let F be an ultrafilter on I including {Ax : x E V}. Taking *JR to be the ultrapower of lR by F, define j# : V ---7 *JR by putting j#(x) = [rx]· Show that j# is hyperlinear. (c) Use j# in place of j+ to prove that there is a linear functional g on V that extends f and is dominated by p. This gives a direct derivation of the Hahn-Banach theorem from the ultrafilter theorem.