In this paper, methods of second-order and higher-order reverse mathematics are applied to versions of a theorem of Banach that extends the Schröder–Bernstein theorem. Some additional results address statements in higher-order arithmetic formalizing the uncountability of the power set of the natural numbers. In general, the formalizations of higher-order principles here have a Skolemized form asserting the existence of functionals that solve problems uniformly. This facilitates proofs of reversals in axiom systems with restricted choice.
The Schröder–Bernstein theorem is a well-known result about cardinality. In full generality, it states that if A and B are sets, there is an injection , and there is an injection , then there is a bijection from A to B. Unfortunately, this theorem is not ideal for reverse mathematics analysis. If we add the assumption that , the result is computationally trivial: whenever have the same cardinality, there is an -computable bijection between them.
In higher-order reverse mathematics, we might consider the case where or . In this setting, the Schröder–Bernstein theorem is no longer trivial. However, because the theorem does not postulate any relationship between the bijection being constructed and the original two injections, obtaining reversals presents a challenge.
Our focus is a classical theorem of Banach [1] from 1924 more suited to reverse mathematical analysis. Banach argued this theorem captures the essence of proofs of the Schröder–Bernstein theorem, such as the well known proof by Julius König.
(Banach).
If A and B are sets,is an injection, andis an injection, there are decompositionsandsuch that,,, and.
Restating this in terms of the existence of a bijection gives a corollary that strengthens the Schröder–Bernstein theorem, which we will also call Banach’s Theorem.
If f is an injection from a set A to a set B, and g is an injection from B to A, there is a bijectionsuch that, whenever, eitheror.
A brief history of Banach’s Theorem and the Schröder–Bernstein theorem is given by Remmel [16]. A version of Banach’s Theorem is studied in [13]. An early analysis of Banach’s Theorem for subsets of , using subsystems of second-order arithmetic, appears in Hirst’s thesis [7, §3.2] and a related article [8]. That development uses symmetric marriage theorems to prove the following second-order arithmetic results.
(Bounded Countable Banach’s Theorem) Letandbe injections such that the ranges of f and g exist. Then there is a bijectionsuch that for all m and n,implies eitheror.
In this paper, we use methods from higher-order reverse mathematics to study the uniformity of results like these. We are interested not only in the existence of the bijection h, but also whether there is a functional that can produce h uniformly from f and g. This question of uniformity is purely higher-order, and cannot be expressed directly in second-order reverse mathematics. To study this uniformity, we examine Skolemized versions of theorems. For example, instead of examining Banach’s Theorem in a form such as
we consider the form
Both versions of a theorem are of interest, of course, and the latter always follows from the former if we assume sufficient choice principles. We are interested in the Skolemized forms because they represent a particular kind of uniformity, and we typically do not assume enough choice to derive them directly from the un-Skolemized form. As discussed in Section 5, this is a different kind of uniformity than Weihrauch reducibility.
Kohlenbach studies the reverse mathematics of Skolemized forms like equation (1) for basic mathematical theorems in [10,12]. Similar functional existence statements have been studied by other authors, including Sakomoto and Yamazaki [17]. A central technique is Grilliot’s trick, a method for effectively deriving a uniform, functional existence version of arithmetical comprehension from the existence of discontinuous functions (on the reals or on Baire space).
Section 3 begins with a survey of reverse mathematics results on countability. Sections 4 and 5 present a number of supporting lemmas to prepare for the analysis of Banach’s theorem. Section 6 examines Theorems 1.3 and 1.4 from the viewpoint of Skolemized uniformity. Section 7 extends the study of Banach’s Theorem to subsets of and, more generally, subsets of compact metric spaces. Finally, Section 8 discusses moduli of uniform continuity and a fan functional.
Formal theories
This work relies on several well studied systems of second-order arithmetic and higher-order arithmetic. Simpson [18] and Dzhafarov and Mummert [3] provide thorough references for reverse mathematics. Kohlenbach [12] provides a reference for higher-order reverse mathematics. We follow Kohlenbach’s definitions of higher-order systems throughout this paper, noting any exceptions explicitly.
For the purposes of higher-order reverse mathematics, we assume that our systems use the function based language of higher-order arithmetic, rather than the set based language. Accordingly, is used throughout this paper to denote the set of all functions from to .
Many of our results will use fragments of the quantifier-free choice scheme. For types ρ and τ, we have the scheme
where A is a quantifier free formula. Here A can have parameters of arbitrary type.
The system is a fragment of higher-order arithmetic. It can be axiomatized by a set of basic axioms along with induction for formulas and the choice scheme . The syntax has term-forming operations for λ-abstraction and primitive recursion.
The system is a second-order fragment of , with only types 0 and 1 for elements of and functions , respectively. Formally, we have . This system is equivalent to the set based system presented by Simpson [18], and we will henceforth denote by when no confusion is likely.
A sequence is viewed as a map , so that , where is a suitable pairing function.
Emulating Kohlenbach [12], we use parentheses around the name of a functional to denote the principle stating that the functional exists. For example, the principle asserts the existence of the functional , defined below.
There are several ways to extend the comprehension axioms of second-order arithmetic to the higher-order setting. One particular functional (set) existence axiom for higher-order arithmetic is , defined by
The functional from this principle is itself called . The system implies the arithmetical comprehension scheme. Kohlenbach [12] showed that is conservative over first order Peano arithmetic.
Other functional existence principles correspond to . Kohlenbach [12] presents two such functionals. One, , selects a zero of a function if such a zero exists:
Another returns the least zero of a function, in the fashion of Hilbert and Bernays [6, page 411], also used by Feferman [4, §2.3.3]:
The following are pairwise equivalent over:,, and.
Kohlenbach [12, Proposition 3.9] proves the equivalence of and . Because any μ satisfying also satisfies , it suffices to show that proves that implies . Given a functional as in the definition of , is the least such that . This functional is primitive recursive in and thus exists by and . □
Finally, we note that Grilliot’s trick [5] is an algorithm in computability theory that yields the functional from any input coding a discontinuous function on Baire space. Kohlenbach [12] shows that Grilliot’s trick can be formalized in , so the existence of a discontinuous function (on Baire space or the reals) implies the principle .
Countability
One motivation for this research is the question: how difficult is it to prove is uncountable? As usual, being uncountable simply means not being countable. There are many ways to express the principle that is countable, with the following three being particularly natural:
: there is a sequence such that for all there is an with .
: there is a functional that is an injection from to .
: there is a functional that is a bijection from to .
The principles and cannot be stated in the language of second-order arithmetic, but they can be stated in . When we say we assume or , this means we assume the existence of a functional with the property stated. Similarly, if we assume or , this means we assume no functional has the property stated.
In context of set theory there is little reason to distinguish between and , because of the comprehension principles available. As discussed below, there are key distinctions between these principles in the context of theories of arithmetic with restricted comprehension principles.
Of course, , , and are classically false. There are two key questions: which systems are “strong enough” to disprove these false principles, and which are “weak enough” to be consistent with one or more of the principles. As is well known, Cantor’s diagonalization proof allows us to disprove in very weak systems (compare Theorem II.4.9 of Simpson [18] showing is not denumerable in ).
proves.
Given witnessing , can construct the function g defined by . Then , but g cannot be for any . □
The principles and have much more interesting behavior. Normann and Sanders [14] provide a detailed analysis of the negations of these principles, which they name and , respectively. (They formulate and for but the results hold equally for .) Normann and Sanders work in extensions of , , and , all of which prove the same second-order sentences, as shown in the thesis of Hunter [9]. Their Theorem 3.2 shows that the true principle is not provable in the system (which includes third-order functionals deciding formulas and consequently includes comprehension with parameters of type 1), and hence this system is consistent with [14, Theorem 3.26]. They also show that is provable in , a subsystem that includes the functional , a functional that emulates for inputs of type . This subsystem includes comprehension. In the remainder of this section, we discuss some aspects of their results related to and .
A key issue in analyzing is that the range of an injection from to may be hard to form with weak comprehension axioms. We will see that a similar issue arises in the study of Banach’s theorem, as well, where the existence of the range of a functional becomes a key question. By contrast, can be disproved using the weak choice principle [14, Theorem 3.28]. The following short disproof of uses more choice.
().
There is no injectionfor which the characteristic function for the range exists. In particular,is disprovable in.
We will work in and assume there is a bijection Φ from to with range given by a characteristic function. We will prove the principle by constructing a kind of left inverse of Φ, which will be a (possibly noninjective) enumeration of . Because proves , this gives a contradiction.
By assumption, for each there is a with . Therefore, by , we may form a function f so that . Then is an enumeration of , so holds, a contradiction. □
We now explain how the lemma implies certain higher-order formulations of the Schröder–Bernstein theorem are nontrivial. Suppose, in the context of set theory, we wanted to try to use the Schröder–Bernstein theorem to show is countable. Because there is a trivial injection from to , the other assumption in the Schröder–Bernstein theorem is the existence of an injection from to , that is, . The conclusion is the existence of a bijection, that is, . We can thus view the implication as a specific formal instance of the Schröder–Bernstein theorem. (Normann and Sanders [13] study a different formulation of the Schröder–Bernstein theorem, which they call .)
The implicationis not provable in.
is consistent with but not . □
Lemma 3.2 can also be used to obtain an upper bound on the strength required to disprove . Normann and Sanders [14] provide many examples of third-order theorems that disprove . Among them is a version of the following lemma using the principle as a formalization of comprehension with functional parameters.
is disprovable fromalong withcomprehension with parameters of type 2.
Assume Φ is a functional witnessing . Applying comprehension with parameter Φ, we can construct the range of Φ. We then obtain a contradiction from Proposition 3.2. □
A final point of interest is that the classically false principle , although consistent with , has nontrivial set existence strength. Normann and Sanders discuss the contrapositive of the following proposition in the guise of a “trick” related to excluded middle [14, §3].
impliesover.
If Φ is an injection from to then Φ is discontinuous at every point (here we identify elements of with constant functions from to ). The existence of a discontinuous functional implies by [12, Proposition 3.7] of Kohlenbach. □
Thus, for example, there is no model of in which holds and every element of is computable.
Bounding calculations of type 1 functions
This section contains several technical lemmas related to the range of a function . Each function of this type has a number of auxiliary functions related to its range. The most obvious is the characteristic function for the range. We write as shorthand for the formula asserting that g is the characteristic function for the range of f. More formally,
A bounding function can also be used to compute the range of f. We write for the formula asserting that g is such a bounding function. Formally,
The results below address the problem of converting between the characteristic function for the range of a function and a bounding function for the range, and the amount of uniformity present in the conversion. In the second-order setting, principles asserting the existence of characteristic functions and the existence of bounding functions are interchangeable, as shown by the following two results.
().
For all,.
Working in , suppose g is a characteristic function for the range of a function f. Then is the desired bounding function and exists by recursive comprehension.
Now suppose that h is a bounding function for the range of f. The characteristic function can be defined by the formula
and hence g exists by recursive comprehension. □
The relationship between characteristic and bounding functions is uniform in the sense that proves the sequential extension of the previous result.
().
For every sequenceof functions fromto, we have
We will write as a function of two variables, so . Adapting the proof of the preceding result, write
and
to translate the sequences of auxiliary functions in . □
Third order arithmetic can formalize “translating functionals” of type 2 to convert between characteristic functions and bounding functions for ranges. Principles asserting the existence of the translating functionals provide additional examples of Skolemized uniformity, distinct from the sequential uniformity often considered in second-order settings.
As shown below, the existence of a translating functional from bounding functions to characteristic functions can be proved in . However, the reverse translation functional requires stronger assumptions. Thus the interchangeability of the two sorts of auxiliary functions witnessed in the second-order setting by the previous two propositions does not extend to Skolemized functional formulations in third order arithmetic.
().
There is a functionalof typethat translates bounding functions into characteristic functions for ranges. That is,
Working in , by there is a functional Y from to such that if and only if . Note that the defining formula is quantifier free because the bounded quantifier can be rewritten using a primitive recursive functional. The desired functional is then
□
().
The following are equivalent:
.
There is a functionalof typethat translates characteristic functions for ranges into bounding functions. That is:
To prove that (1) implies (2), assume . The base system suffices to prove the existence of the functional χ which takes the (type 1 code for the) pair and maps it to the function satisfying . By [12, Proposition 3.9] of Kohlenbach, implies the existence of Feferman’s μ functional satisfying the formula
The function is the desired bounding function, and exists by composition.
In the proof of the preceding implication, the principle is sufficiently strong that we can discard the given characteristic function and calculate the bounding function directly from f and . We now show we can calculate from any translation functional in the reverse direction.
To prove that (2) implies (1), suppose is a translation functional as described in (2). We will show that this functional is not continuous in the usual sense (see [12, Definition 3.5] of Kohlenbach).
We can view inputs f and g as a single sequence and use the usual Baire space topology. The functional is defined for every input of two type 1 arguments, including inputs f and g for which g is not a characteristic function for the range of f. For example, let satisfy for all n. Let satisfy and otherwise. Then is not a correct characteristic function for . However, for some totally defined type 1 function h, and for some value b.
Suppose by way of contradiction that is continuous. Then for every pair in some neighborhood N of we must have . Let be a function that is 1 for every , outputs a sufficient number of ones so that is in the neighborhood N, and is eventually constantly zero. Then is a correct characteristic function for , so holds. However, . Thus is not a bounding function for because but . Thus fails. This contradicts the implication given in item (2) of the proposition. Thus must not be continuous. By [12, Proposition 3.7] of Kohlenbach, follows via Grilliot’s trick. □
Let R be the Weihrauch problem taking a type 1 function as an input and yielding output consisting of the characteristic function of the range of the input. Let B be the Weihrauch problem that outputs bounding functions as described above. Ideas from the proof of Proposition 4.1 can be adapted to show that R and B are weakly Weihrauch equivalent, and strongly Weihrauch incomparable. Summarizing, analyses based on sequential second-order statements, Skolemized higher-order statements, and Weihrauch reducibility yield different results. This indicates that there are three distinct notions of uniformity considered here.
Realizers for omniscience principles
The principle is closely related to a certain formulation of the limited principle of omniscience. The Weihrauch problem asks for a realizer that determines whether an infinite sequence of natural numbers contains a zero. Indeed, the definition of could be rewritten as
to emphasize asserts the existence of a realizer for this problem,
The Weihrauch problem , related to the lesser limited principle of omniscience, asks for a realizer to identify a parity (even or odd) on which a sequence of numbers is zero, assuming either that all even positions are zero or all odd positions are zero. We will use a principle asserting the existence of a realizer for :
Often it is more convenient to work with an equivalent form that asks for the parity of the first location where a sequence is zero, if there is such a location:
For example, suppose denotes the infinite sequence consisting of 1, 0, 1 followed by all zeros. Then because the sequence contains a 0; because for all n; and because the first zero occurs in position 1, which is odd. For the sequence , ; because the first zero occurs at position 2, which is even; and the value of is not determined by its defining formula.
One motivation of is that its value is determined for every sequence that includes a zero. The next proposition shows that and are equivalent for our purposes. For Weihrauch problems and expressible in the language of , we say that proves if proves there are functionals such that, for every realizer of , the functional is a realizer of .
proves that the problemsandare strongly Weihrauch equivalent, and that the principlesandare equivalent.
First, assume R is a realizer for . Define a pre-processing function such that if and . Define a post-processing function . Then is a realizer for .
To see this, assume g is an instance of . If g is identically zero, then whichever value in is produced by S is acceptable. If g is not identically zero, then is zero on exactly the inputs where g is nonzero. Thus is the parity of the first location where g is nonzero, and is the parity for which g is always zero. This shows .
Conversely, suppose S is a realizer for . Define a pre-processing function such that, for ,
Thus and f agree through the first zero of f, but afterwards takes only the value 1. Hence there is at most one input k for which is nonzero, and if there is such a k then it is the least input for which . This means that is in the domain of , and produces the parity of k. Thus is a realizer for . We have shown .
The pre-processing and post-processing functionals in this argument can all be formed in , which can verify the correctness of the argument. None of the post-processing functions require access to the original instance of a problem. Hence proves that or are strongly Weihrauch equivalent.
Concatenation of the pre-processing and post-processing functionals with any realizer yields an realizer, so the principle implies the principle . The converse follows in a similar fashion. □
Results of Weihrauch analysis include and the parallelized form . See Weihrauch [19, §4] and Brattka and Gherardi [2, Theorem 7.13] for proofs. Consequently, the following result may initially be surprising. The underlying difference is that Weihrauch reducibility requires a single reduction that works for all realizers; the argument below breaks into cases depending on the behavior of the realizer.
().
implies.
Working in , by Proposition 5.1, it is sufficient to assume the existence of as provided by , and prove holds.
Let be the infinite sequence of ones. Our goal is to show that R is sequentially discontinuous at f. We will construct a sequence such that and for each n, disagrees with . In particular, if , we want for all n, so we define as a sequence of ones followed by all zeros. On the other hand, if , we want for all n, so we define as a sequence of ones followed by all zeros. Summarizing, for each n and m we have
Note that proves the existence of the sequence , that , and that for all n, . Thus R is sequentially discontinuous and follows by [12, Proposition 3.7] of Kohlenbach.
The proof of Kohlenbach’s proposition is based on the proof of Lemma 1 of Grilliot [5]. We append that argument here to give a direct derivation of from . Let the function f and the sequence be defined as above. The system suffices to prove the existence of the operator defined for and by
Note that if . On the other hand, if i is the least value for which , then . Consequently, for all , if and only if . Thus , so the existence of follows by . □
().
The following are equivalent:
.
.
To show that (1) implies (2), as noted in Proposition 2.1, [12, Proposition 3.9] of Kohlenbach shows that the principle proves the existence of Feferman’s μ functional which satisfies:
The remainder function yielding the remainder of dividing n by 2 is primitive recursive. Thus proves the existence of the composition functional , which satisfies the definition of .
The converse was proved as Proposition 5.2 above. □
For an alternative proof of the forward implication of Theorem 5.3, we can use a formalized Weihrauch reducibility result. The next two results illustrate this process. The following proposition converts formal Weihrauch reducibility to proofs of implications of Skolemized functional existence principles.
Ifandare problems andandare the associated Skolemized functional existence principles, thenproves
Working in , suppose and hold. Let φ and ψ be the functionals witnessing and let witness . The system proves the existence of the composition , which can be directly shown to realize the principle . □
Next, we prove the formalized Weihrauch reducibility result corresponding to the forward implication of Theorem 5.3.
proves.
We again work with in place of . Let be the pre-processing functional defined by:
Note that is the sequence of all ones except when the first zero in the range of h occurs in an even location. Define the post-processing functional . If is a realizer for , then is a realizer for . Because ψ makes no use of h, this shows that . Applying Proposition 5.1, we see that . □
As mentioned before, the forward implication of Theorem 5.3 follows immediately from Propositions 5.4 and 5.5. The next result uses Theorem 5.3 to give a short proof of one direction of Proposition 3.4 of Kohlenbach [10], showing that a uniform version of weak König’s lemma is equivalent to . This equivalence is also included in [12, Proposition 3.9] of Kohlenbach. Kohlenbach emphasizes the uniformity provided by the functional by using to denote the principle appearing in the next proposition. We will apply the proposition in the analysis of Banach’s Theorem for in the next section. (For a discussion of uniform see Theorem 3.2 of Sakamoto and Yamazaki [17].)
().
The following are equivalent:
.
There is a functionalsuch that if T is a code for an infinite tree in, thenis an infinite path in T.
As noted in the proof of Proposition 3.4 of Kohlenbach [10], the proof that (1) implies (2) follows from the fact that, given the functional , primitive recursion can define a functional which selects an infinite branch of an infinite binary tree. For a short proof of the converse, it suffices to show that implies . Consider an instance for . Let denote the sequence of n ones. Define the tree by:
Only sequences of the form and are in ,
if and only if either or is even, and
if and only if either or is odd.
can prove the existence of a functional φ that maps each f to . For each f, the first element of is an solution for f. □
Banach’s theorem on
This section reformulates Theorems 1.3 and 1.4 as higher-order functional existence statements. In particular, Theorem 6.8 shows that, in the Skolemized higher-order setting, the bounded version is equivalent to the unbounded version. This collapse mimics that of the uniform principle . Our discussion begins with the formulation of the bounded principle and its proof from .
A bounded Banach functional on is defined as follows. For injective functions and with bounding functions and , is a bijective function such that for all , implies or . As usual, the parenthesized expression denotes the principle asserting the existence of a bounded Banach functional for .
().
implies.
We work in . For bounded injections , we will describe the computation of a related tree so that any infinite path through defines an injection h satisfying Banach’s theorem. If p is an infinite path through , the bijection h will be defined by:
A finite sequence is included in if it satisfies the following four conditions, (i)–(iv), each ensuring an aspect of the back-and-forth construction of the bijection h.
First, if there is an which is not in the range of , we ensure that .
If and then .
Next, if m is for some t and t is not in the range of , we set in the following fashion.
If , there is a such that , and , then .
The next clause ensures that h is injective.
If , , and , then .
This final clause ensures that h is surjective.
If , , and , then .
The sequences satisfying the clauses are closed under initial segments, so is a tree. The second-order proof of the bounded Banach theorem in (Theorem 1.4) shows that is infinite. Note that the construction of eventually halts for arbitrary choices of , even if and are not injections or if or gives incorrect bounding information. Thus proves the existence of a functional mapping to . Whenever is infinite, yields an infinite path. Concatenating these functionals with the one computing the bijection h as described at the beginning of the proof yields the desired Banach functional. □
The preceding proposition differs from the second-order analog (Theorem 1.3) in the formulation of the bounding functions. The original second-order version was formulated with characteristic functions for the ranges of the injections. However, in the calculation of , the use of an incorrect characteristic function could result in an unbounded nonterminating search, causing the functional mapping to to be undefined on some inputs. This difficulty could be circumvented by using the fact that the uniform principle implies , but the argument presented here uses a single application of .
Our proof of the unbounded version of Banach’s Theorem for from uses a proposition relating to the existence of bounding functions.
The functional maps any function to a bounding function for f. In the notation of from formula (3), for all f we have . As usual, the parenthesized expression denotes the principle asserting the existence of a bounding functional.
().
implies.
proves the existence of a functional where satisfies if and only if . By Proposition 2.1, proves the existence of Kohlenbach’s . By composition and λ-abstraction, there is a functional mapping f to the bounding function . □
The next definition formulates an unbounded form of Banach’s theorem on . Using the principle , the unbounded form can be derived from the bounded form.
A Banach functional on , denoted , is defined as follows. For injective functions and , is a bijective function such that for all , implies or . As usual, the parenthesized expression denotes the principle asserting the existence of a Banach functional for .
().
implies.
Working in , assume . By Proposition 5.6, we have , so by Proposition 6.2, we have . By Lemma 6.4, we have the functional mapping functions to associated bounding functions. The composition functional satisfies . □
The next proposition essentially shows that the principle can be deduced from the restricted form of Banach’s theorem for .
().
implies.
Assume and . Our goal is to prove the existence of the functional . Let and define bounded injections and as follows.
Figure 1 illustrates the construction of and for three choices of g. In general, for even inputs like , let and . For , let and . The appearance of 0 in the range of g affects the definitions of and on other odd values. Suppose that . If , then let and . If , write . If s is even, then let
and let . If s is odd, then let
and
In Fig. 1, is represented by solid arrows and by dashed arrows. Extending the chains to the left and right, each number has an exiting arrow, so both and are total. No number has two entering arrows, so and are injective.
Construction for Proposition 6.6. (a): (solid) and (dashed) when 0 is not in the range of g. (b): (solid) and (dashed) when . (c): (solid) and (dashed) when .
Figure 1(a) corresponds to the situation when 0 does not appear in the range of g. Any bijection h satisfying Banach’s theorem must either consist of all the (inverses of the) dashed arrows or all the solid arrows. In this situation, may be 0 or 1.
Figure 1(b) corresponds to the case when is the first zero in the range of g. In this case, 5 must be in the domain of h, so . The only bijection satisfying Banach’s theorem consists of solid arrows to the left of 5, so .
Figure 1(c) is for the case when is the first zero in the range of g. Here 5 must be in the range of h, so . The only bijection satisfying Banach’s theorem consists of (inverses of the) dashed arrows to the left of 5, and so . If 0 first appears in the range of g at an even value, the figure for and will be a shifted version of the second figure. Odd values yield a shifted version of the third figure.
Because is never less than , is a bounding function for . Similarly, is never less than n, so is a bounding function for . Routine verification shows that for any choice of g, and will be injections bounded by and . Suppose that h is any bijection satisfying Banach’s theorem for , , , and . If the first 0 in the range of g occurs at an even value, then . If it occurs at an odd value, then . If 0 is not in the range of g, then may be either 0 or 1. The system proves the existence of the functional mapping g to the bounded injections as defined above. The functional yields the bijection h for . Consequently, the functional mapping g to (which equals ) is . □
Concatenating the preceding arguments yields the desired equivalence theorem and concludes the section.
().
The following are equivalent:
.
.
.
Proposition 6.6 shows that (1) implies (2). Because is a restriction of , (2) implies (3) is immediate. Theorem 5.3 and Proposition 6.7 show that (3) implies (1). □
Banach’s theorem on compact spaces
Our next goal is to analyze the strength of Banach’s theorem restricted to uniformly continuous functions on complete separable metric spaces. We formalize complete separable metric spaces imitating Simpson [18, II.5]. The space is the collection of rapidly converging sequences of elements of an underlying (countable) set A. The metric is a function , extended to by defining . As in Definition III.2.3 of Simpson [18], a space is compact if there is an infinite sequence of finite sequences of points of of the form , such that for all and , there is an such that .
Uniform continuity can be witnessed by a modulus of uniform continuity as formalized in Definition IV.2.1 of Simpson [18]. The function is a modulus of uniform continuity for f if for all k, implies . If is a modulus of uniform continuity for f and is a modulus of uniform continuity for g, then h defined by is a modulus of uniform continuity for f and g. Consequently, a joint modulus can be used to simplify some statements.
Kohlenbach [12] defines two equivalent forms of continuity for functionals of type . First, is everywhere sequentially continuous if ([12, Definition 3.3]):
Second, is everywhere continuous if ([12, Definition 3.5]):
This second definition is similar to familiar textbook definitions of continuity for total functions. The use of n and k reduces the type of the quantifiers corresponding to δ and ε. Proposition 3.6 of Kohlenbach [12] proves in that F is everywhere sequentially continuous if and only if F is everywhere continuous.
For the reader’s convenience, we repeat the following portion of [12, Proposition 3.7] of Kohlenbach, a very useful tool for proving reversals.
There is a functional which is not everywhere sequentially continuous.
There is a functional which is not everywherecontinuous.
For uniformly continuous functionals on compact complete separable metric spaces, it is possible to find ranges using only . Indeed, the next two lemmas show that for Cantor space the existence of ranges is equivalent, a higher-order analog of Lemma III.1.3 of Simpson [18]. (Results in the next section show how to reformulate the following results without an explicit modulus of uniformity. See Theorem 8.3.)
().
Suppose X is a compact complete separable metric space. There is functional R such that ifis a function with modulus of uniform continuity h,is the characteristic function of the range of f. That is, for all,andif and only if.
Working in , let X be as hypothesized, with the compactness of X witnessed by . Consider a function with modulus of uniform continuity h. Informally, a value is in the range of f if and only if for every m there is an x with . By uniform continuity and compactness, such an x exists if and only if there is an such that . In , for and , we may define
Viewing K as a function in m with parameters f, h, and y, in we may define . Informally, by the definition of φ, if and only if for all m there is an x with . Note that the calculation of eventually halts even if f is not continuous or h is incorrect.
To complete our proof, we must verify in our informal claim that for each continuous function f with modulus of uniform continuity h, is the characteristic function for the range of f. First, if , then there is a sequence such that for every m, . The principle implies which implies the Bolzano–Weierstrass theorem (see Theorem III.2.7 of Simpson [18]), so we can thin to a sequence converging to some . By sequential continuity of f, we have .
Second, if , then for some natural number m, we must have . Suppose by way of contradiction that . Choose such that . Because f is uniformly continuous, . Concatenating inequalities, we have
a contradiction. Thus, implies , completing the proof. □
().
The following are equivalent:
.
If X is a compact complete separable metric space, then there is functional R such that ifis a function with modulus of uniform continuity h,is the characteristic function of the range of f.
There is functional R such that ifis a function with modulus of uniform continuity h,is the characteristic function of the range of f.
We will work in . By Lemma 7.2, item (1) implies item (2). Item (3) is a special case of item (2), so we need only show that item (3) implies item (1).
In , we can prove the existence of the function that maps an arbitrary function to an element of Cantor space so that, for all n, if and only if . In terms of the function from the definition of , . can also prove the existence of the transformation such that for all , if and otherwise. Let denote the set of functions from to . proves the existence of the function which maps each to the constant function in that takes the value . For each f, because is a constant function, the constant 0 function on , denoted by , is a modulus of uniform continuity for . (Any function could be used as a modulus.) For any , z is in the range of if and only if 0 is not in the range of , and this occurs if and only if . Using the functional from item (3), we have , so item (3) implies . □
Our proof of Banach’s theorem in compact metric spaces requires a functional that can calculate the inverse of a given function. The next two lemmas show that is sufficient and also necessary for this task.
().
Suppose X is a compact complete separable metric space. There is a functional I such that ifis a function with modulus of uniform continuity h, thenis a function that selects elements from the pre-image of f. That is, for all, if there is an p such that, then. In particular, if f is injective, then the restriction ofto the range of f is the inverse of f.
Suppose that the compactness of X is witnessed by the sequence of finite sequences . Thus for all , there is an in such that . Given a function f with a modulus of uniform continuity h, for each y we will calculate a rapidly converging subsequence such that if y is in the range of f then . We will argue that this calculation is sufficiently uniform that the desired functional I can be found using .
Fix with modulus of uniform continuity h, so that if then . Increasing h if necessary, we may assume that for all k. Using the witness points for compactness, if y is in the range of f, then for all j we have .
Given f, h, and y as above, we can define the desired . If y is not in the range of f, let . If y is in the range of f construct as follows. Let where is the least integer such that:
,
, and
if , then .
The third clause ensures that is a rapidly converging Cauchy sequence. By Proposition 3.6 of Kohlenbach [12], proves that f is sequentially continuous, so the first clause shows that . Informally, the second clause guarantees that each is sufficiently close to a pre-image of y that the construction can continue. We verify this next.
To initialize the construction, we must find . Suppose . Because we can fix an with . Because h is a modulus of uniform continuity, , so clause (1) is satisfied. For any , there is a such that , and so . For such a j and k,
so clause (2) is also satisfied. The third clause is vacuously true. We have shown that for some i, satisfies all three clauses. Let be the least such i, and set .
Suppose has been chosen satisfying all three clauses. By clause (2) for , we can find a sequence of points such that for every j, and . In Theorem III.2.7, Simpson [18] proves the generalization of the Bolzano–Weierstrass theorem for compact metric spaces in , so it is also a theorem of . Consequently, there is a subsequence of converging to a value with and . Choose so that . Clause (1) holds for because h is a modulus of uniform continuity and . For any , there is a such that and so . For such a j and k,
so clause (2) holds for . Finally,
We have shown that all three clauses hold for some choice of i, so let be the least such i and set . This concludes the argument that our construction never halts, yielding the desired pre-image .
It remains to show that suffices to prove the existence of the functional I from the statement of the lemma. Suppose we are given f with modulus h and a value y from the metric space. By Lemma 7.2, is the characteristic function for the range of f. If y is not in the range of f, output y. Otherwise, begin constructing p, searching for an satisfying clauses (1), (2), and (3) above. By , we may use a realizer for to check if clause (2) holds. As argued above, when y is in the range, this process calculates the desired pre-image of y. Summarizing, proves the existence of the function mapping f, h, and y to the desired value. Applying λ-abstraction yields . □
The use of in the previous lemma is necessary, as shown by the following reversal.
().
The following are equivalent:
.
If X is a compact complete separable metric space, then there is a function I such that ifis a function with modulus of uniform continuity h, thenis a function that selects elements from the pre-image of f.
There is a function I such that ifis a function with modulus of uniform continuity h, thenis a function that selects elements from the pre-image of f.
We will work in . By Lemma 7.4, item (1) implies item (2). Item (3) is a special case of item (2), so we need only show that item (3) implies item (1). By Proposition 5.2, it suffices to show that (3) implies .
Given an input for , we will show how to construct a function f with modulus of uniform continuity h such that information about the pre-image of f as provided by in item (3) can be used to calculate for w. In particular, we will control the pre-image of the constant 0 function, denoted . If the first t where is even, we require . If the first t such that is odd, we require . If 0 is not in the range of w, will be the set .
Now we can specify the behavior of f. Let be an element of . Evaluating f at x yields a function , which also maps into 2 and is defined as follows.
.
For , is defined by two cases:
if is not the least t such that then
if is the least t such that then
Routine arguments verify that the pre-image of f satisfies the requirements listed above. Also, if the sequences x and y agree in the first n values, the sequences and also agree in the first n values. Thus, the function is a modulus of uniform continuity for f. The construction of f from w is sufficiently uniform that proves the existence of a function g mapping each to its associated function f.
Let I be the function described in item (3) of the statement of the lemma, and let be the identity function on . The for any , is (an element of ) equal to if the first 0 in the range of w occurs at an even value, and if the first 0 occurs at an odd value. The sequences coding elements of output by I are rapidly converging sequences of finite approximations to or . By the definition of the metric on , the first entry in the third finite approximation for any sequence equal to will be 0, and similarly the value 1 can be extracted from any sequence equal to . Thus uniformly calculates for w. □
The previous results allow us to formulate and analyze some restrictions of Banach’s theorem. For compact complete separable metric spaces, a functional form of Banach’s theorem restricted to uniformly continuous functions is equivalent to the functional existence principle . Note that if f and g have moduli of uniform continuity and , then h defined by is a modulus of uniform continuity for both f and g. As a notational convenience, we will use common moduli of uniformity for pairs of functions.
For a complete separable metric space X, a Banach functional is defined as follows. For injective functions and with a common modulus of uniformity h, is a bijective function such that for all , or . The parenthesized expression denotes the principle asserting the existence of a Banach functional for X.
Emulating Proposition 6.6, the next result proves a version of Banach’s theorem for compact metric spaces using . The reversals and a summary appear in a second result.
().
implies that if X is a compact metric space, then.
Assume and . Suppose X is a complete separable metric space and that witnesses that X is compact. Let f and g be injections of X into X with a common modulus of uniform continuity h. Apply Lemma 7.2 to find the range functionals and . Apply Lemma 7.4 to find pre-image selectors and . Because f and g are injections, the restrictions of these functions to the ranges of f and g are inverse functions. Consequently, we will use the shorthand notation and . Note that the pre-image selectors and are defined for all inputs from X, and that if (for example) y is in the range of f, then . Our goal is to construct the bijection H in the statement of . This is achieved by a back-and-forth construction, alternately iterating applications of and , and basing the value of H on the terminating condition of this process.
First we construct a functional that alternately applies and . Using primitive recursion, define by and for , and . Calculating a few values yields , , , and . As noted above, and are total, so is defined for all x and all n.
A traditional back-and-forth construction using partial inverse functions might halt if, for example, x was not in the range of g, or if was not in the range of f, and so on. Define the function by and for , and . Consider the initial stages of the back-and-forth process displayed in the following table.
If x is not in the range of g then . If is not in the range of f, then . In general, the least n with will be the stage where the traditional back-and-forth process based on partial inverse functions will halt. If the back-and-forth process does not halt, then for all n. Writing for , we may view as a function from into . Apply φ as provided by , and we have if the back-and-forth process halts and if the process never halts.
By and Theorem 5.3, we may also use , a realizer for . Define the functional by
Finally, define the bijection by
One can argue that H is the desired bijection by the usual arguments. Briefly, consider the following diagram representing images and pre-images of an element x from X.
Each element of the lower copy of X appears in at least one bipartite subgraph of the sort pictured. Also, for each y in the upper copy of X, we know , so each element in the upper copy of X appears in at least one bipartite subgraph. Because f and g are injective, each element appears in exactly one bipartite subgraph. The choice of the values of ensure that if the bipartite graph ends on the left, the left most vertex is either in the lower copy of X and in the domain of H, or in the upper copy of X and in the range of H. Thus H is a bijection of X into itself, satisfying the requirements of . □
This section concludes with proofs of two reversals for instances of the previous lemma, summarizing the results for Banach’s theorem on compact metric spaces in the following theorem.
().
The following are equivalent:
.
If X is a compact metric space, then.
.
.
The previous lemma proves that item (1) implies item (2). Item (3) and item (4) are special cases of item (2), so we can complete the proof by reversing (3) and (4) to (1). For the first reversal, suppose is the Banach functional for . Consider the injections f and g defined by . Each x in is represented by a rapidly converging sequence of rationals, and dividing each element of the sequence by 2 yields a rapidly converging sequence representing . Thus proves that f and g are defined and total. The identity function is a modulus of uniform continuity for f and g. Suppose is the bijection satisfying Banach’s theorem for f and g. Consider and the sequence . For each n, is not in the range of g, so . Thus, . The functional H is bijective, so 1 is in the range of H. Fix x with . By the Banach theorem, or . Because 1 is not in the range of F, . Thus , so and . Summarizing,
Thus H is not sequentially continuous at , and follows by Proposition 7.1.
For the final reversal, suppose is the Banach functional for Cantor space. Consider the padding function that adds a zero after each entry in a binary input string. Formally, if , and otherwise. For example,
proves that is defined and total, and that the identity function is a modulus of uniform continuity for . Let and both be . Let be the bijection satisfying Banach’s theorem for f and g. For each n, let consist of n copies of the string 10, followed by 11, followed by zeros. The double 1 ensures that is not in the range of . Thus, for each n, , which consists of n copies of the string 1000 followed by 1010, followed by zeros. Thus is the string 1000 repeated infinitely. On the other hand, is . The string is not in the range of , so . Because , we have . Thus , so H is not sequentially continuous at . The principle follows by Proposition 7.1, completing the reversal and the proof of the theorem. □
We note that the functional R in Lemma 7.2, the functional I in Lemma 7.5, and the functional B in Theorem 7.8 are constructed uniformly in a code for the space X. Hence these functionals could, in principle, be defined with X as a parameter. This is another layer of uniformity in the constructions, although noting the parameter explicitly complicates the notation. It would be interesting to know how changing the representation of a fixed space affects the functionals I and B.
Moduli of uniform continuity
This section introduces a function that computes moduli of uniform continuity. Functions of this sort are not new to the computability literature. For example, the fan functional described by Normann and Tait [15] computes the modulus of uniform continuity for continuous functions on . As shown below, the strength of the existence of the function lies below , allowing us to streamline the definition of Banach functionals and Theorem 7.8.
Suppose X is a compact complete separable metric space and Y is a complete separable metric space. The principle asserts the existence of a function M such that if is continuous, then is a modulus of uniform continuity for f.
Near the end of his article, Kohlenbach [12] presents a functional form of Brouwer’s intuitionistic fan theorem, denoted by . He notes that is a consequence of , is conservative over for second-order sentences, and is inconsistent with . Because proves , the principle is also conservative over for second-order sentences. The next lemma shows that unlike , the principle and similar principles for other spaces are consequences of .
().
If X and Y are complete separable metric spaces and X is compact, thenimplies.
Let X be a compact complete separable metric space with compactness witnessed by the sequence of sequences . Let Y be a complete separable metric space. We will use d to denote the metric in both spaces. For we can define a prospective value of a modulus of uniform continuity for f at m by setting equal to the least n such that:
Informally, is a function from to that resembles a modulus of uniform continuity on the compactness witnesses for X. First we will show that suffices to prove the existence of the function M. Then we will verify that if f is continuous, then is a modulus of uniform continuity for f.
Working in , let X and Y be as above, and suppose . Recalling the reverse mathematical formalization of inequalities in the reals, the formulas and are . Thus proves the existence of a function which is 0 if t codes a witness that there are and such that and , and is 1 otherwise. Note that formula (4) holds if is 1 for all t, and fails if there is a t such that is 0. As noted in Section 5, implies the existence of the function . The λ notation denotes the function that maps each to the value . Applying λ-abstraction (which is a consequence of [12]) and , we can prove the existence of the function . Note that for all f, m, and n, if formula (4) holds and otherwise. By Proposition 2.1, proves the existence of Feferman’s μ, so by and an additional application of λ-abstraction, we can prove the existence of the function . Note that for each f and m, if there is an n such that formula (4) holds, then is the least such n. If there is no such n, for example if f is discontinuous, then still yields some value, but no useful information. By λ-abstraction, proves the existence of . For every , yields a function from to .
It remains to show that if f is continuous then is a modulus of uniform continuity for f. Fix a continuous and . Let . Suppose that satisfy . Choose . Because f is continuous and is dense in X, we can find an such that and . Similarly, find such that and . By the triangle inequality,
Because , and because , formula (4) holds, so . By the triangle inequality,
Summarizing, when f is continuous and , if then . Thus is a modulus of uniform continuity for f. □
The principle allows us to reformulate many of the results in the preceding section. For example, Theorem 7.8 can be simplified as follows, stripping all reference to moduli of uniform continuity.
().
The principleis equivalent to the statement that for every compact complete separable metric space X, there is a functionthat maps each pair of injections from X to X to a bijection satisfying Banach’s theorem.
Assuming , by Lemma 8.2 we may use the principle to find a function M that calculates moduli of uniform continuity for f and g. The pointwise maximum function is a joint modulus of uniform continuity for f and g. If is the function provided by Theorem 7.8 part (2), then the function defined by is the desired Banach function. The converse is immediate from Theorem 7.8. □
Because is a consequence of , the principle does not imply . That is, this restriction of the converse of Lemma 8.2 is not true. The next two results show that like , the second-order theorems of are exactly those of . As part of that proof, the next lemma allows us to change representations of functions, with the eventual goal of applying a traditional reverse mathematics result to show that implies . (The following lemma for functions from Baire space to appears as Proposition 4.4 in Kohlenbach’s [11] contribution to the Feferman volume.)
().
Suppose X and Y are complete separable metric spaces. Suppose that Φ is a code for a totally defined continuous function as described in Definition II.6.1 of Simpson [
18
]. Then there is a functionsuch that for all n, a, r, b, and s, ifthen.
Working in , suppose X, Y, and Φ are as above. Fix . Because x is in the domain of the function defined by Φ, for each m we can find (occurring first in some fixed enumeration of quintuples) such that and . Set . The sequence is a rapidly converging sequence of elements of Y converging to the desired value of . Applying , the system proves the existence of f. (Note that different representations of may yield different representations for , though the function values will be equal in the space X.)
We now verify the last sentence of the lemma. Suppose . Let and choose m so that . Let be the quintuple witnessing the value for . Then and . Let . Then the ball is a subset of , and is also a subset of . Applying property (2) of Simpson’s Definition II.6.1, we have and . By property (1) of Simpson’s definition, . By the choice of m, . By the triangle inequality . Because ε was an arbitrary positive value, . □
The preceding lemma allows us to completely characterize the second-order theory of .
The second-order theorems ofare exactly the same as those of.
As noted before, is a consequence of Kohlenbach’s , and so any second-order theorem provable using is provable in . It remains to show that implies . By Theorem IV.2.3 of Simpson [18], it suffices to show that if f is a continuous function (coded by Φ) on , then f is uniformly continuous. Suppose Φ codes a continuous function on . By Lemma 8.4, proves that there is a function matching the values of the coded function. Applying , the function is a modulus of uniform continuity for f, and so also for the function coded by Φ. Thus Φ codes a uniformly continuous function on . □
For an alternative proof of Proposition 8.5, one could emulate the proof of Proposition 3.15 of Kohlenbach [12], applying the extensional model of the hereditarily continuous functionals.
We conclude by pointing out the potential and limitations of this section. The principle can be viewed as a higher-order analogue of . A number of Skolemized forms of statements equivalent to may be equivalent to over . (But not all, as witnessed by Kohlenbach’s . See Proposition 5.6.) However, may not be the only reasonable candidate for a analogue. For example, reformulating for other spaces or different representations of continuous functions may affect the strength of the principle. There may be combinatorial principles (less affected by the details of formalization) that prove the same second-order sentences as . As noted in the last page of Kohlenbach’s article [12], the strength of these statements is related to the provability of the equivalence of their underlying second-order sentences over intuitionistic base systems.
References
1.
S.Banach, Un théorème sur les transformations biunivoques, Fundamenta Mathematicae6 (1924), 236–239. doi:10.4064/fm-6-1-236-239.
2.
V.Brattka and G.Gherardi, Weihrauch degrees, omniscience principles and weak computability, J. Symb. Log.76(1) (2011), 143–176. doi:10.2178/jsl/1294170993.
S.Feferman, Theories of finite type related to mathematical practice, in: Handbook of Mathematical Logic, Stud. Logic Found. Math., Vol. 90, North-Holland, Amsterdam, 1977, pp. 913–971. doi:10.1016/S0049-237X(08)71126-1.
5.
T.J.Grilliot, On effectively discontinuous type-2 objects, J. Symb. Log.36 (1971), 245–248. doi:10.2307/2270259.
6.
D.Hilbert and P.Bernays, Grundlagen der Mathematik. Vol. II, J. W. Edwards, Ann Arbor, Michigan, 1944, p. xii+498. doi:10.1007/978-3-642-86896-2.
7.
J.L.Hirst, Combinatorics in subsystems of second order arithmetic, PhD thesis, The Pennsylvania State University, 1987. http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqdiss&rft_dat=xri:pqdiss:8728018.
8.
J.L.Hirst, Marriage theorems and reverse mathematics, in: Logic and Computation, Pittsburgh, PA, 1987, Contemp. Math., Vol. 106, Amer. Math. Soc., Providence, RI, 1990, pp. 181–196. doi:10.1090/conm/106/1057822.
U.Kohlenbach, On uniform weak König’s lemma, Commemorative Symposium Dedicated to Anne S. Troelstra (Noordwijkerhout, 1999) (2002), 103–116, ISSN 0168-0072. doi:10.1016/S0168-0072(01)00077-X.
11.
U.Kohlenbach, Foundational and mathematical uses of higher types, in: Reflections on the Foundations of Mathematics, Stanford, CA, 1998, Lect. Notes Log., Vol. 15, Assoc. Symbol. Logic, Urbana, IL, 2002, pp. 92–116. doi:10.1201/9781439863763.
12.
U.Kohlenbach, Higher order reverse mathematics, in: Reverse Mathematics 2001, Lect. Notes Log., Vol. 21, Assoc. Symbol. Logic, La Jolla, CA, 2005, pp. 281–295. doi:10.1017/9781316755846.018.
13.
D.Normann and S.Sanders, On robust theorems due to Bolzano, Weierstrass, Jordan, and Cantor, J. Symb. Log. (2022), 1–51(03 October, 2022). Volume data pending. doi:10.1017/jsl.2022.71.
14.
D.Normann and S.Sanders, On the uncountability of , J. Symb. Log.87(4) (2022), 1474–1521. doi:10.1017/jsl.2022.27.
15.
D.Normann and W.Tait, On the computability of the fan functional, in: Feferman on Foundations, Outst. Contrib. Log., Vol. 13, Springer, Cham, 2017, pp. 57–69. doi:10.1007/978-3-319-63334-3.
16.
J.B.Remmel, On the effectiveness of the Schröder–Bernstein theorem, Proc. Amer. Math. Soc.83(2) (1981), 379–386. doi:10.1090/S0002-9939-1981-0624936-X.
17.
N.Sakamoto and T.Yamazaki, Uniform versions of some axioms of second order arithmetic, MLQ Math. Log. Q.50(6) (2004), 587–593. doi:10.1002/malq.200310122.
18.
S.G.Simpson, Subsystems of Second Order Arithmetic, 2nd edn, Perspectives in Logic, Cambridge University Press, Cambridge; Association for Symbolic Logic, Poughkeepsie, NY, 2009, p. xvi+444. ISBN 978-0-521-88439-6. doi:10.1017/CBO9780511581007.