This article expands our work in (LNCS9709 (2016), 225–233). By its reliance on Turing computability, the classical theory of effectivity, along with effective reducibility and Weihrauch reducibility, is only applicable to objects that are either countable or can be encoded by countable objects. We propose a notion of effectivity based on Koepke’s Ordinal Turing Machines (OTMs) that applies to arbitrary set-theoretical -statements, along with according variants of effective reducibility and Weihrauch reducibility. As a sample application, we compare various choice principles with respect to effectivity.
In mathematics, it has always been of interest to what extent theorems claiming the existence of certain objects are ‘effective’ in the sense that a procedure—an algorithm—can be found that produces an object of the desired kind. In the case of a -statements, which, roughly, are claims that, for every x from a certain range of objects, there is a y from another such range such that x and y relate in a certain (simple) way, effectivity would then mean that some appropriate y can be obtained effectively from a given x. In the finite realm, such questions are studied by the theory of Weihrauch reducibility, see e.g. [3].
With the notion of Turing computability serving as a formalization of the notion of a ‘procedure’, such questions can be turned into mathematical questions that potentially have provable answers; recursive analysis is an example of an area that deals with such questions (see e.g. [16]).
A large amount of standard theorems, however, turn out to be ‘ineffective’ in this sense. For these theorems, it is still possible to compare their remoteness from effectivity by asking whether one -statement, say ϕ, becomes effective once an effectivization for the other, say ψ, is given as an oracle. This yields the notion of effective reducibility. Insisting that only one instance of ψ may be requested when processing an instance of ϕ, one obtains the stronger notion of Weihrauch reducibility. All of these concepts have given rise to thriving areas of mathematics.
However, the reliance on Turing computability has the effect that the areas of mathematics thus surveyed are limited to countable objects or objects that, like separable spaces, can be encoded by countable objects. On the other hand, there seems to be an intuition of effectiveness even for general set theory; see e.g. [19].
So far, the literature on effectivity notions for the uncountable is rather sparse; we mention on the one hand Hodges [19], which is the first such approach known to us, and further [17], in particular the contribution by Shore [26]. While this leads to effectiveness notions suitable for the uncountable, suitable analogues of effective reducibility and Weihrauch reducibility remain unconsidered, save for a remark in [26].
The first paper dealing with effective (Weihrauch) reducibility in the uncountable realm was [6], where such notions were defined and some basic methods for their study were presented and applied. The guiding idea of [6] was to retain the usual definitions as far as possible while replacing Turing machines with Koepke’s Ordinal Turing Machines (OTMs) (see e.g. [22]), that have a working tape of proper class length, the cells of which are indexed by ordinals, and additionally work along an ordinal ‘time axis’. A related piece of work is Galeotti and Nobrega [15], where a notion of Weihrauch reducibility based on OTMs was applied to various problems on generalized real numbers.
In this paper, we will expand the work started in [6] by (i) extending the realm of methods for obtaining results, (ii) considerably extending the number of applications by investigating how various set-theoretical choice principles compare in the sense of ordinal Weihrauch reducibility, (iii) considering connections between reducibility and provability and (iv) extending the notions of reducibility from [6] to arbitrary set-theoretical statements.
Many of the results explained here will also appear in the forthcoming [7], and we want to express our gratitude both to the editors of Computability and the de Gruyter publishing company for the kind permission to use parts of this article in [7] and vice versa. Finally, this paper extends [6] in the sense that the contents of Section 2 and parts of the contents of Sections 3 and 4 can be found there. In most cases, we again present the result, but refer to [6] for proofs.
Setting the stage
In this section, we will briefly recall the underlying model of computation, namely Koepke’s Ordinal Turing Machines (OTMs) and then work towards appropriate notions of effectivity and reducibility for functions and problems on arbitrary sets.
Ordinal Turing machines
Ordinal Turing Machines (OTMs) were introduced by Koepke in [22] as a “symmetric” variant of the Infinite Time Turing Machines of Hamkins and Kidder that were introduced in [18]. The same concept was obtained almost at the same time independently by Dawson and is contained in his thesis [11].
Programs for OTMs are just regular Turing programs, where we imagine the inner states to be indexed by natural numbers. Also, like classical Turing machines, OTMs have a finite number of tapes, split into cells, each of which can contain one element of the alphabet ; moreover, as usual, there is a read-write-head for each tape that moves along the cells, reads out the symbol on the current cell and can replace it when instructed to do so. However, both the “hardware” and the working time of OTMs are considerably expanded: The tape of an OTM has proper class length On, with one cell for each ordinal, and programs are carried out along an ordinal “time axis”. At successor times, an OTM behaves like a classical Turing machine, with the additional rule that, if the head is commanded to go to the left while currently occupying a cell indexed by a limit ordinal, it is reset to the start of the tape, i.e. the cell indexed 0. At limit times, for each cell c, the content of c is the inferior limit of the sequence of earlier contents of c, and likewise, the index of the active program line and the head position are obtained as the inferior limits of the sequence of earlier values. For an explication of OTMs in formal detail, we refer the reader to [22]. In this paper, we will only consider parameter-free OTMs, i.e. we always start with an empty working tape.
In our framework, OTMs work with four tapes, one input tape , one working tape T, one output tape and one “miracle” tape that will be explained below. Occasionally, we will also use a fifth tape, the “oracle tape” , for storing extra information. An input for an OTM is always a set or class X of ordinals, which is given to the OTM by writing a 1 on the ιth cell of if and only if , and otherwise a 0.1
Sometimes, when X is a set, it will be important to know where it ‘stops’, i.e. to know some upper bound for X, e.g. to be able to “exhaustively search” through X. This can clearly be arranged in a number of ways (e.g. by having two input tapes, one for inputs and one for bounds), but in order to simplify our presentation, we will not discuss this point any further.
Thus, OTMs can compute on sets or classes of ordinals. In order to make OTMs compute on sets, we need to fix some encoding of sets by sets of ordinals, which we do as follows: For a transitive set x, pick some bijection between x and some ordinal, and then let . Then is an “α-code for x via f”; when c is an α-code for x via f for some α and f, we say that c “codes” x. Now, for arbitrary sets x, a code for x is a code for , where denotes the transitive closure of x.
Thus, OTMs can be used to compute on arbitrary sets, and we can say that a (class) function (where V is the set-theoretical universe) is OTM-computable if and only if there is an OTM-program P such that, for each and each set c of ordinals that codes , P will, when run with c on the input tape and all other tapes empty, eventually halt with some d on the output tape such that d codes . Note that we allow the specific code written to the output tape to depend on the code chosen for the input. If is such that, for any OTM-computable and any , we have , then we call X ‘closed under OTM-computability’ or ‘OTM-closed’. Similarly, a class is OTM-decidable if and only if there is an OTM-program that, on input c coding x, halts with output 1 if and only if and otherwise halts with output 0.
We recall a few results about OTMs that will be relevant below.
[Seyfferth, Schlicht, [
25
]] A set x is coded by an OTM-computable set if and only if, where σ is minimal such that.
[Koepke, [
22
]] There is a (non-halting) OTM-programthat “enumerates L”, i.e. that successively writes codes for all constructible sets (and no others) to the output tape.
[Koepke, [
22
]] There is an OTM-programsuch that, for any-statement ϕ and all sets,halts with the truth value ofon the output tape. Here,is the Gödel number of ϕ andare codes for the sets.
Effectivity of -statements
It has become customary to associate the term “effective” with “recursive” in mathematical contexts. However, as e.g. Hodges has pointed out in [19], there is apparently an intuition about a notion of effectiveness that goes beyond Turing computability, according to which e.g. forming quotient fields of integral domains is effective, while obtaining maximal ideals of rings is not (cf. Hodges, [19], p. 1). Hodges proposes a variant of the primitive recursive set functions of Jensen and Karp [21] as a formalization of this notion. Partly motivated by our argument in [5] that there are good reasons to accept OTMs as a formal counterpart to the intuitive notion of “higher effectivity”, we base our approach on OTMs instead.2
Though not formally equivalent, the two approaches agree with respect to the problems considered by Hodges.
Another reason is that the OTM-approach seems more suited to considering reducibility between problems, which is not considered by Hodges. A slight change in perspective is that the emphasis in Hodges’ paper is on “construction problems”, while ours is on statements. However, for the transition from “problems” to “statements”, it suffices to note that -statements are naturally associated with “problems”:
Let ϕ be a -sentence in the language of set theory, say ϕ is , where ψ is . We say that ϕ is “witnessed” by the proper class function if and only if holds for all . In this case, we also call F a “canonification” of ϕ.
More generally, when is a binary relation on sets, we say that F is a canonification of R if and only if, for every , we have .
A binary relation is ‘OTM-effective’ if and only if there is an OTM-computable canonification F for R. Fixing the notation for a -formula ψ, we say that the -statement with is OTM-effective if and only if has an OTM-computable canonification F; in this case, we also say that F is a canonification for ϕ.
We say that an OTM-program P is “functional” if and only if P computes a class function . When P is functional, we denote the function computed by it by .
In fact, this definition of effectivity can be generalized to arbitrary sentences in the language of set theory by (roughly) replacing Turing machines with OTMs in Kleene’s realizability interpretation intuitionistic logic. The above definition then becomes a special case of this more general approach and one can show that the axioms of KP are OTM-effective, while the “impredicative” ZFC-axioms, like the power set axiom or separation for formulas with arbitrary use of unbounded quantifiers, are not. We will briefly touch on this topic in Section 6 below and further refer the interested reader to [7,8] and the detailed study [9] for more information. We fix some further terminology. For , , we say that X is F-closed if and only if for all .
For , X is R-closed if and only if it is F-closed for some canonification F of R. Moreover, for , , we let and .
Clearly, every OTM-effective -statement ϕ holds in V. If we also require ϕ to be “provably OTM-effective”, i.e. that the existence of the desired program is provable in ZFC, then ϕ will be provable in ZFC. We will see below that not every -consequence of ZFC must be OTM-effective. However, if the universe is small, this is indeed the case:
If, then every-consequence of ZFC is effective.
Let be a -consequence of ZFC, where ψ is . Given a code c for a set x on the input tape, run from Theorem 1. Whenever outputs a code for a set y, use to check whether holds. If not, continue with . Otherwise, output and halt. Clearly, this will compute a canonification of ϕ. □
Reducibility
Though studying the effectivity, or otherwise, of various mathematical statements is certainly an extensive and worthwhile task, we will mostly be concerned with a more general notion, namely the “reducibility” of one statement or problem to another. In order to formalize this, we need to be able to use class functions as “oracles”. We will write to indicate that the OTM-program P is run in the oracle F. This is where the “miracle tape” comes in. We reserve a special inner state s as our “miracle command”. Now, whenever the state s is assumed during a run of , the following takes place: The content of is read out; if it does not code a set, nothing further happens and the program execution is continued. Otherwise, if it codes the set x, then the content of is replaced with a code for and then the execution of the program is continued. Thus, whenever the state s is assumed, a code for the F-image of the set coded by the content of the miracle tape appears on the miracle tape.
We recall some standard notation. As usual will denote the composition of the (possibly proper class) relations or functions Q and R. By slight abuse of notation, we will also write or for functional OTM-programs P, Q and a (class) function F to denote the (class) function obtained by composing F with or with . When F and G are (class) functions or functional OTM-programs, denotes the “parallelization” of F and G, i.e. the function mapping a set x to the ordered pair .
Now we can define our reducibility relations:
Let . We say that R is “OTM-reducible” to if and only if the following holds: There is an OTM-program P such that, for every canonification F of , computes a canonification of R. In this case, we write .
In this setting, arbitrarily many instances of may be used in computing a canonification of R. We may also demand that each instance of R is reduced to a single instance of . This yields the following reducibility notion, adapted from the classical notion of Weihrauch reducibility (see e.g. [2]) which is the one we will be primarily interested in in this article:
Let . We say that R is “ordinal Weihrauch reducible” to if and only if there are functional OTM-programs P and Q such that, for every canonification of , is a canonification of R (where denotes the identity function).
In this case, we write and say that witnesses the ordinal Weihrauch reducibility of R to .3
In [6], the notation was for ‘generalized Weihrauch reducibility’. D. Dzhafarov pointed out in his review of [6] that this name is already taken and suggested ‘ordinal Weihrauch reducible’ instead, a suggestion which we thankfully follow here.
The corresponding concepts and notations for -statements are defined in the obvious way.
If the above holds with being a canonification of R (i.e. with the identity function deleted in the middle term), we say that R is ‘strongly ordinal Weihrauch reducible’ to and write .
For each of these reducibility relations, we write with the respective index i to express that and .
If is an ∈-formula, we write for the construction problem and call the associated construction problem for ϕ. If and are -statements, we write to indicate that . If θ is another -formula, we write for .
Thus, in ordinal Weihrauch reducibility, the “preprocessed input” set is retained and passed on to for further processing, while in strong Weihrauch reducibility, it is “forgotten”. We note the obvious fact that strong ordinal Weihrauch reducibility implies ordinal Weihrauch reducibility, which in turn implies OTM-reducibility. We will see below that the second implication cannot be reversed. For the first one, this is easy to see: For example, every OTM-effective function is -reducible to any relation , while, for instance, when and 0 denotes the identitity and the constant zero function on V, respectively, then , as the image of will be for any choice of F and G, while the image of is clearly class-sized. This idea is well-known from the classical setting; see Proposition 12 below for a general version. Though seems to be the more natural reducibility relation, most reducibility results we will prove below will in fact be strong reducibilities, a phenomenon also well-known from the classical setting. (Our nonreducibility results, however, will mostly apply to .)
Furthermore, we note that all three reducibility relations are reflexive and transitive and the corresponding ≡-relations are thus equivalence relations.
We have stated the definitions of effectivity and reducibility in terms of proper classes. While this has the advantage of expressing the intuition behind these concepts most straightforwardly and is also in agreement with the approach taken in [6], it has some less attractive technical consequences in the proofs; for example, in the considerations about versions of the axiom of choice, we assume global choice in the background theory. This could be avoided4
We thank our anonymous referee for pointing this out to us.
by defining canonifications and reductions not on the whole set-theoretical universe, but rather on set-sized restrictions thereof, such as the levels . The proofs below would then go through in ZF. However, for the sake both of simplicity and keeping the concepts intuitive, we phrase our results in terms of classes.
To avoid misunderstandings, a note may be in order about the relation of ordinal Weihrauch reducibility to Weihrauch reducibility in the classical setting. In the classical theory of Weihrauch reducibility, it is not required that the reduction is coding-independent, i.e. yields same result on a code of a set x, no matter which code is chosen. In fact, demanding coding-independence in the classical setting would have rather detrimental effects, see e.g. [4]. We thank A. Pauly for pointing this out to us.
Basic methods
In this section, we will recall the methods for proving non-reducibility that we adapted in [6] from the methods used by Hodges in [19] for proofs of non-effectivity and add some more possibilities adapted from the classical theory of Weihrauch degrees, see [2].
Our first method for proving nonreducibility is adapted from Hodges’ “cardinality method”, see Lemma 3.2 of [19]. First, let us fix some terminology (cf. Hodges, [19]): For , we say that a class function ‘raises cardinalities above α’ if and only if there is some set x with such that . For , we say that R raises cardinalities above α if and only if every canonification of R raises cardinalities above α.
Let, and let α be an infinite ordinal. Suppose that R raises cardinalities above α and. Thenraises cardinalities above α. Thus, a relation that raises cardinalities above α is not OTM-reducible—and hence also not ordinal Weihrauch reducible—to one that does not.
In particular, if R raises cardinalities above ω, then R is not effective.
See [6], Lemma 1. The main ingredient is the well-known observation that OTM-computations cannot output sets of larger cardinalities than their input when the latter is infinite. □
The relation between an ordinal and its cardinal successor.
The relation between an ordinal and its cardinal successor in L.5
This requires relativizing the above to L.
The relation between a set and its (constructible) power set.
The relation between a linear ordering and its completions.
For the separation of -statements, however, the cardinality method is useless, as binary -relations never raise cardinalities for any infinite α by an easy collapsing argument.
Our second method is adapted from Hodges’ “Forcing Method”, see Hodges [19], Lemma 3.7. The idea is that, when is OTM-closed and -closed, then implies that it is also R-closed. Thus, to show that , it suffices to find some OTM-closed set or class M with some such there is no with .
If X satisfies the well-ordering principle and thus contains codes for all of its elements, then in order to guarantee OTM-closure, it suffices that M is a union of transitive KP-models that contain all ordinals (or is even only of sufficient ordinal height).
Below, we will be particularly interested in separating various versions of the axiom of choice. Thus, M will in general not satisfy the well-ordering principle. The problem is that, in this case, OTM-closure is not obvious: As M will not contain encodings for all of its elements, it will also not contain the corresponding computations and is thus not guaranteed to contain the set coded by the output. The idea then is that, for , there should be two mutually generic filters , for some forcing that makes x well-ordered. and will contain codes for x and also all halting OTM-computations on these codes, along with their outputs and the sets they code. For an OTM-program P that computes a class function , we will thus have , and if M is a model of sufficiently much set theory to guarantee that , it follows that , as desired.
Thus, it is necessary to guarantee the existence of the required filters. This requires some extra assumption beyond ZFC. Indeed, as noted above (see Proposition 3), some largeness assumption is necessary, for under , all -consequences of ZFC are (trivially) ordinal Weihrauch reducible to each other. To prove the results below, one needs to ensure the existence of certain generic filters, which follows from the (rather harmless) extra assumption that some sufficiently large L-cardinal is countable in V.6
This will in particular follow when we assume that exists, as it implies that every forcing with a parameter-free definition in L is countable and thus has a generic filter, see e.g. p. 59 of Friedman [14], thereby covering all cases we are interested in below.
For a set x, we denote by the set , partially ordered by ⊆, which forces x to be well-ordered in order type ω.
(Cf. [6], Thm. 8).
Let, and let M be an increasing union of transitive, class-sized KP-models such that. Suppose that there are a canonificationofand a set x such that M is-closed, but does not contain a y with. Moreover, suppose that there are two filtersandthat are mutually-generic over M. Then.
See [6]. We sketch the proof for the convenience of the reader: The condition on and implies that x has codes in and ; also, the forcing will preserve enough of KP so that all halting OTM-computations starting on these codes will be contained in both extensions. So both extensions will contain the coded set, which will thus lie in the intersection. Thus will be contained in M for all functional OTM-programs P and Q, while by assumption, no y with is contained in M. Hence, there are no P and Q that witness the ordinal Weihrauch reducibility of R to , as desired. □
The condition that M contains all ordinals is unnecessarily strong: It is only needed to ensure that the generic extensions are “long” enough so that all OTM-computations that do in fact halt halt there. Further, the condition “increasing union of transitive, class-sized KP-models” can be further weakened e.g. to M being a model of Mathias’ provident set theory (see [24]). So far we have not found applications for these variants.
Further methods
We mention a few other ways for proving nonreducibility. In general, many of the methods used in the classical theory of Weihrauch reducibility depend so little on the specific model of computation that they immediately carry over to our context; we give two examples of this observation, namely Propositions 10 and 12. The intuition between the following two propositions is that, when R is ordinal Weihrauch reducible to and there are subclasses X and Y of V that are closed under OTM-computability such that, for all , there is with , then the same holds for R; more informally, when ‘simple’ instances of have ‘simple solutions’, then the same holds for R.
(For the classical version, cf. Brattka et al., [2], Proposition 12.1).
When A is a set or a class of ordinals, a set x is called OTM-computable in A if and only if there is an OTM-program P such that, when P is run with A on the oracle tape, it halts with a code for x on the output tape.
Now let A and B be classes of ordinals, and let. Suppose that, whenever x is OTM-computable in A, then there is some y that is OTM-computable in B such thatand that. Then it also holds that, whenever x is OTM-computable in A, then there is some y that is OTM-computable in B such that.
In particular, when there is a canonification ofthat sends each OTM-computable set to an OTM-computable set and, then there is such a canonification for R as well.
The proof is the same as that for the classical version in [2]: Assume that the assumptions of the proposition are satisfied, let witness the reducibility of R to , and let x be OTM-computable in A. Let be the set coded by the output of Q when run on a code for x as its input. Then is also OTM-computable in A. Thus, by assumption, there is a set that is OTM-computable in B such that . Similarly, if y is the set coded by the output of P when run on a code for as its input, then y is OTM-computable in B, as y is. As witnesses the ordinal Weihrauch reducibility of R to , we have . Thus y is as desired. □
For , denotes the supremum of the halting times of OTMs that start with c on the oracle tape. For a set x, we let .
Letsuch that, and letbe OTM-closed and. Suppose that, for every, there is y such thatand. Then it also holds that, for every, there issuch thatand.
In particular, if any OTM-computable instance x ofhas a solution y with, then the same is true of R.
Let witness the reduction of R to and let be an instance of R. Let be the set coded by the output of when c codes x. Then by transitivity of computations. By assumption on , there is with a code such that and . If d is the output of P run on the input , then as above. Consequently, if y is the set coded by d, then and , so y is as desired. □
Another smooth generalization of a classically important observation to our setting is the following:
(For the classical version, cf. Brattka et al., [2], Section 13).
Let,,OTM-closed. Suppose that R is ‘slim on X’, i.e. for every, there is somesuch that z is unique with. Then.
The proof of Proposition 13.1 in [2] carries over almost verbatim. □
An application: Choice principles
We will now apply the methods obtained above to a comparison between various set-theoretical choice principles with respect to ordinal (Weihrauch) reducibility. Unfortunately, if the -statement ϕ is false in V, then every -statement ψ reduces to ϕ: As there are no canonifications of ϕ, the statement that every such canonification can effectively be turned into one of ψ is vacuous.7
We thank our anonymous referee for pointing this out.
To avoid this, we will assume in this section that the set-theoretical universe possesses a definable well-ordering and thus satisfies global choice, i.e there is a class function such that, for all nonempty x, we have .
Specifically, we will deal with the following statements, further variants of which will be introduced below; all of these statements are and thus naturally associated with a “construction problem”.
“Every non-empty set contains an element” (PP, “picking principle”)
“Every non-empty finite set contains an element” ()
“Every 2-element set contains an element” ()
“Every non-empty set has a non-empty finite subset” (MPP, “multiple picking principle”)
“For every family X of pairwise disjoint, non-empty sets, there is a set x that has finite non-empty intersection with any element of X” (MuC, principle of multiple choice)
“Every family of pairwise disjoint, non-empty sets has a system of representatives” (AC, axiom of choice)
“Every family of non-empty sets has a choice function” ()
“Every vector space has a basis” (VC)
“Every set can be well-ordered” (WO, well-ordering principle)
“Every partially ordered set has a maximal chain” (HMP, Hausdorff’s maximality principle)
“Every partially ordered set in which every chain has an upper bound contains a maximal element” (ZL, Zorn’s Lemma)
We begin with some obvious observations:
.
.
When ϕ is any of the above principles, then.
(i) The first three reducibilities work via , where P computes the identity function. For , given a non-empty set x to pick from, pass over to ZL, i.e. the partial ordering on x where all elements are maximal; then any canonification of ZL will pick out an element of x. For the other direction, given a partially ordered set satisfying the condition of ZL, ZL implies that it will contain some maximal element. Given , it is easy to compute the set of maximal elements. Applying PP to M then yields a maximal element of x. For reducing PP (or ZL) to AC, hand over to a canonification of AC when an instance x of PP is given as the input.
(ii) is trivial. works via the obvious OTM-effectivization of the equivalence proof over ZF, see e.g. Devlin [12], Section 2.7.
(iii) Again, the usual proofs of implication over ZF effectivize on OTMs. For the picking principles, one well-orders the set x and then picks the minimal element. For VC, given a well-ordered vector space V, it is easy to see that the subset of V consisting of those elements that are not linearly generated by those preceding it in the well-ordering forms a basis of V (see e.g. Shore, [26]) and thus OTM-computable from such a well-ordering.
(iv) This works by an easy effectivization of the proof in Blass [1]. □
The reduction of ZL to PP is somewhat unsatisfying. We hence propose to consider HMP instead, which is the combinatorial “core” behind ZL.
When arbitrarily many instances of a problem can be used, the hierarchy collapses:
We have. Thus, all of the above principles exceptandare-equivalent to WO.
Again, the usual proof effectivizes. Let P be an OTM-program that works as follows: Given a (code of a) set x, iteratively use PP to pick elements from the remaining set and exclude them from the set; eventually, every element of x has been picked, and the order in which they were picked is a well-ordering of x. □
Though PP may seem too obvious to be called a “principle”, note that this is not so in terms of effectivity: There is no general prescription for picking an element from an arbitrary set! Indeed, not even the simplest version is effective:8
Recall that we are working under set-theoretical extra assumptions that guarantee the existence of the required generic filters below (which in particular follow from, but are much weaker than, ).
is not effective.
Suppose otherwise, and let P be an OTM-program that computes a canonification F of . By Jech [20], Section 5.4, there is a symmetric extension M of L in which ZF holds while , the axiom of choice for countable sequences of pairs, fails. Let be such a sequence without a choice function in M. The model M is a symmetric submodel of a generic extension of L, where G satisfies the countable chain condition; thus, the cardinals in are the same as the cardinals in L. The and are sets of real numbers and thus contained in , so we have . Thus, the forcing for adding a bijection between and ω is contained in and its power set is contained in .
By our set-theoretical assumptions, , and hence
, is countable; thus there are two mutually generic filters over M that make well-ordered. Then both generic extensions, say N and , will contain codes for each , so P can be applied to each such pair in both models, and thus, is definable over N and . Thus will be a choice function for s contained both in N and by replacement, and so , a contradiction. □
A similar idea allows us to separate versions of the picking principle:
. Thus. In particular,for all.
We write for the axiom of choice for finite sets. Also, let us write ‘LO’ for the statement that every set can be linearly ordered. It is well-known that LO implies (given a set X of finite sets, pick a linear ordering of and then pick the minimal -minimal element of each element of X).
Now suppose otherwise, and let witness the reduction of PP to . By Theorem 5.21 of [20], there is a transitive set model of ZF+LO (and thus of which contains a set without a choice function. Under the assumption that sufficiently large L-cardinals are countable (in particular, when one assumes ), the same construction works over the ground model L (as the required filters exist) and yields a transitive class model M with the same properties. Moreover, M can be taken to be a submodel of a cardinal-preserving generic extension . In fact, X can be taken to be .
Let be a set without a choice function in M. By the above, we can assume that . For each code of an element , let be the set coded by the output of the computation .
By our set-theoretical extra assumptions, the forcing for making well-ordered is countable in V and thus has two mutually generic filters , over M. Thus, all elements of X have codes both in and , and therefore, the corresponding Q-computations exist in and . Thus, for each , we have and, by replacement in and , in fact . Now, since a canonification of is applicable to for each , is a set of finite sets. By in M, has a choice function . By global choice, let be a canonification of that extends f. Thus, is a canonification of PP.
Let . By the same argument as above, the collection Z of sets coded by the outputs of P when applied to an element of Y is contained in two mutually generic extensions of M that make well-ordered and thus we have .
But now, by the assumptions on P, Q and F, Z is a choice function for X contained in M, a contradiction. □
The same approach works to turn many independence results over ZF between versions of AC into non-reducibility results between versions of PP. Thus, for example, the results given in Section 7.4 of Jech [20] imply in the same way that and are -incomparable, that etc. Moreover, it in fact suffices to construct such (transitive, class-sized) models of KP (and possibly less), which is enough to guarantee the existence of the relevant computations.
On the other hand, by an obvious OTM-effectivization of the proof of the implication over ZF in [20], Example 7.12, we also get:
Is ? In general, let us write for the problem of picking k elements from a set with n elements. Then how do the relate with respect to and ?
.
.
Clearly, PP reduces to AC, since, given a non-empty set x, an element of x is obtained by applying AC to . So
We now want to show that the reduction is strict, i.e. that . To this end, suppose that F is a canonification of PP. Now let M be a transitive proper class such that . Being transitive, M contains for every non-empty .
Now M is closed under F, which is a canonification of PP, but contains some x contradicting AC. By Lemma 9, .
The arguments for (ii) and (iii) are completely analogous. □
On the other hand, given that PP is a provability-wise trivial principle, the following proposition about its computational power may come as a surprise. However, this becomes quite reasonable once one notices that a canonification of PP is in fact a global choice function.
Suppose that ϕ is a set-theoretical-statement such that. Then. In other words, PP is “-universal for ZF”.
Let a be an arbitrary set and write ϕ in the form with ψ a -formula. Let us write .
We show how to compute a (code for a) set Y such that holds for all . This is achieved by the OTM-program P that, given a code for a, enumerates until the first -level is found that contains some y such that . Whether or not holds is easily OTM-decidable, since ψ is .
Then the output is . As is a model of ZF, such a level must eventually be found, so the program always terminates with a non-empty output Y that has the desired properties.
Now applying PP to Y yields some b such that . Thus, we have reduced ϕ to PP. □
We have an analogous statement to Proposition 19 for ZFC and WO in place of ZF and PP: Namely, WO is, with respect to set-theoretical -statements provable in ZFC, maximal for oW-reducibility:
Let ϕ be a-statement in the language of set theory such that. Then.
Let , where ψ is . We describe two programs P and Q such that witnesses the oW-reducibility of ϕ to WO. Let F be an arbitrary canonification of WO. P works as follows: Given a code c for a set x, compute a code for tc(x), the transitive closure of x. Then will be a code for a well-ordering of tc(x). Let ⪯ be that well-ordering. Now Q works as follows: Given , a well-ordered set, will be a model of ZFC, so by assumption. Now Q enumerates until some is found that contains such a z and returns the -smallest such z.
Clearly then, is a canonification of ϕ. □
Finally, we note two further separation results (under our set-theoretical extra assumptions granting the existence of the necessary generic filters):
As we already know , it suffices to show that . This is shown in [[6], Thm. 10]. The idea of the proof is that Zarach’s construction of a model of in [27] can (under the set-theoretical assumptions necessary to guarantee the existence of the required generic filters, which follows, e.g., from ) be carried out over L and yields a transitive class model M of the same theory and contains a counterexample to WO that can be well-ordered by forcing over M in two mutually generic ways, so that the assumptions of Lemma 9 are satisfied. □
The preceding result is plausible, as the proof of over ZF applies AC to the power set of the set to be well-ordered, and power sets are not computable by Corollary 8. The following is thus to be expected. As usual, for , we write for ; moreover, we write “Pot” for the relation . Finally, we recall that is associated with the relation that holds between families of non-empty sets choice functions for these families.
.
follows by the obvious effectivization of the proof of the implication over ZF. Since a choice function on the power set of a set x has the same cardinality as that power set, strictness follows from Lemma 7. □
Note that we saw above that . Thus, we have seen that ordinal Weihrauch reducibility is strictly stronger than OTM-reducibility.
Figure 1 summarizes the situation; is indicated by arrows, by dotted arrows and by a dashed arrow. All indicated oW-reducibilities (whether strict or not) are strong.
Reducibility relations between various choice principles.
We do not know the answers to the following questions:
Is (or even )?
Is ?
Is ?
Effectivity and provability
A question that has recently received attention in the classical theory of Weihrauch reducibility is whether the reducibility of a statement ϕ to another statement ψ corresponds to the provability of the implication in some logical calculus; partial answers to this have been obtained in Kuyper [23]. We expect a similar analysis to work in the transfinite setting. As a first observation, we show below that reducibility of a set-theoretical -sentence ψ to for another such -sentence ϕ in the sense of (where arbitrarily many applications of a canonification are allowed) is implied by the provability of in KP. (PP is necessary here to pick a particular witness after a non-empty set of witnesses has been determined at the end of the construction.) The result is certainly not optimal, so that one cannot expect the converse to hold; and in fact we have , while the implication is not even provable in ZF, let alone in KP.
Letbe, and let a be a set. If there is b such thatholds in V, then there issuch thatholds in V and such that.
Form the elementary hull of in level that contains b and use condensation. □
Let ϕ be . A canonification of ϕ is called ‘reasonable’ if and only if for every set a.
Suppose that F is a canonification of the-statement ϕ. Then a reasonable canonification G of ϕ is OTM-computable from F.
Effectivize the proof of Lemma 23. That is, suppose a is given; to simplify the argumentation, we will assume that a is transitive, which we can assume without loss of generality by replacing a with if necessary. Use the miracle command to obtain . Then 9
We write for .
will contain some b with (namely ), and, by Lemma 22, it will contain such a b of the required cardinality. It thus suffices to enumerate using the relativized version of and output the element that is minimal in the sense of the canonical well-ordering of . □
Let, and suppose that. Then.
Let ϕ be and let ψ be , where and are . Moreover, let F be a canonification for ϕ and let a be an instance for ψ. By Corollary 25, we may assume without loss of generality that F is reasonable.
In order to compute, relative to F, a (code for a) set b such that , we proceed as follows:
Given a, compute up to the first such that . Let .
In general, let be minimal such that and let . For δ a limit ordinal, let .
There is a limit ordinal β such that.
To see this, first note that all are transitive and increasing unions of admissible sets. Clearly, whenever δ is a limit ordinal, as such sets are closed under F. Also, all axioms of KP expect -collection and -separation will clearly hold in such an . It thus suffices to show that there is a limit ordinal δ such that -collection and -separation holds in .
Let κ be the smallest regular uncountable cardinal . We claim that is as desired. Since F is reasonable, it follows inductively that for all . To see that -collection holds in , let , and let H be a -function over such that H is defined on u. We claim that there is such that H is defined on u in (i.e. contains the necessary witnesses); this suffices, as will satisfy -collection and thus contain the desired superset of , which will still work in by upwards absoluteness of -formulas. Now, if this failed, then the set of ξ such that a new value of H for some element of u is defined in would be cofinal in κ and have cardinality , which, by our initial observation, contradicts the regularity of κ. Thus -collection holds in . The same argument shows that satisfies -separation, and a fortiori -separation. This concludes the proof of the claim.
Now, to compute a b as desired, compute the sequence until a limit ordinal γ is found for which is admissible; that such a γ exists follows from the claim, and thus it will eventually be found. As and , we have . As , there is such that , and thus, by absoluteness of , will hold in V. But now, the set B of all such b from can easily be determined by searching through . By one application of PP to B, we finally obtain the required witness. □
Are there logical calculi C and axiomatic systems T such that reducibility of ψ to ϕ in the sense of corresponds precisely to the provability of the implication from T in C? What about or ? It seems particularly tempting to look for connections to constructive set theory here. Some first results in this direction can be found in [8], which is extended in [9].
We conclude by observing that the correspondence between provability and reducibility has its limits, at least as long as the underlying calculus is classical.
Let us say that an ∈-theory T effectivizes a binary relation if every model M of T has, for every , some such that .
Let T be an OTM-computable ∈-theory with transitive models in L. Then there is an ∈-definable binary relationthat is effective, but for whichis not provable in T.
Moreover, if the existence of transitive models of T is consistent with ZFC, then so is the existence of such a.
Consider the relation that holds between each set x and each transitive model of T in L. Then has an effective canonification: The program P that enumerates L until the first transitive model of T is found and then outputs that model is of the desired kind. But T, being consistent, cannot prove that there are such models. □
Consequently, for classical logic, the answer to our above question is negative. Given that connections between reducibility and provability have been based on intuitionistic or linear logic in the classical case, this is hardly surprising.
Beyond
The setting introduced above is restricted to -formulas. However, there are at least two ways to generalize canonifications, and thus reducibility notions, beyond this point, which we want to sketch briefly in this section. For technical convenience, let us assume from now on that all formulas are given in prenex normal form and let us consider the -statement , where ψ is . A straightforward generalization of the notion of a canonification given above Definition 2 would be to say that a canonification of ϕ is a function such that, for all , we have . Let us call this a “superficial” canonification, or s-canonification, of ϕ. A more radical approach would be to demand the canonification to “penetrate” the whole quantifier nesting; so we define a “thorough” canonification, or t-canonification, to be an n-tuple of functions with such that, for all and all , we have .
In the spirit of Definitions 4 and 5 above, we can then introduce reducibility relations between -statements for arbitrary n. For s-canonifications, the definitions carry over verbatim. For t-canonifications, let , .
We say that ϕ is -reducible to and write if and only if there is an OTM-program P using n “miracle” tapes and m output tapes , such that, whenever is a canonification of and carrying out the “miracle” command on replaces the i-tuple coded by the current content of by a code of its image under , then P computes the components of a canonification of ϕ on its output tapes.
We say that ϕ is ordinal -reducible to and write if and only if there are there are functional OTM-programs P and Q with and such that, for all and all canonication of , if and , then the function tuple such that maps to for is a canonification of ϕ.
In order to be able to consider cases like the ZFC axioms, we can slightly further extend this to axiom schemes and say that is ordinal Weihrauch reducible to if and only if we have a program P that, for any , computes a and further reduces to .
By an easy adaptation of the proof, one obtains a variant of Lemma 7, which can be used to show that the power set axiom is not reducible to any other ZFC-axiom, while an obvious effectivization of the well-known proof of the implication shows that the separation scheme is reducible to the replacement scheme in this sense.
The ideas sketched in this section are closely related to the concept of realizability for OTMs, which are discussed in [7] and studied in detail in work with Galeotti and Passmann [9].
Conclusion and further work
We have introduced notions of effectivity, reducibility and ‘case-wise’ reducibility applicable to mathematical objects of arbitrary cardinality. The approach to effectivity is supported by the remarkable conceptual stability of ordinal computability (see e.g. [13] or [5]) and moreover, while not equivalent to e.g. the approach by Hodges, agrees with it concerning the results obtained so far. With regard to reducibility, we have seen how set-theoretical techniques can be used to distinguish between various versions of set-theoretical principles usually regarded as equivalent.
Clearly, there is a host of questions asking which statements are effectively reducible or (s)oW-reducible to which others. This may be viewed as a cardinality-independent version of reverse mathematics (as e.g. considered in [26]) and the theory of the Weihrauch lattice. In particular, we did not consider important choice principles like the Boolean Prime Ideal theorem, the principle that any set can be linearly ordered, Tukey’s lemma, Tychonov’s theorem etc. Apart from that, it may be interesting to consider variants of these notions with parameter-free computability replaced by other models of transfinite computation, like Infinite Time Turing Machines ([18]) or OTMs with ordinal parameters. Another worthwhile topic would be to replace (relativized) computability with (relativized) recognizability (see e.g. [10]).
Although Hodges did not introduce reducibility notions in his [19], the notion of effectivity proposed there, based on primitive-recursive set functions, can easily be supplemented with such concepts following the example of Definitions 4 and 5. A potential advantage is that primitive recursive set functions apply immediately to sets with no need for coding, and thus the technical issues that in our approach need to be dealt with using forcing no longer present themselves. Moreover, it will be interesting to see whether reducibility and nonreducibility results like those obtained in the present paper are ‘stable’ in the sense that they hold for various formalizations of reducibility in the transfinite.
Another topic from the classical setting that might lead to interesting results in the uncountable realm is that of ‘decomposability’ of statements or relations; like, e.g., in [2], we can say that the -statement ϕ is ‘oW-decomposable’ if and only if there are -statements such that ϕ is a -least upper bound for ψ and in the -ordering. As a special case, we say that a -statement ϕ is ‘partitionable’ if and only if there are disjoint OTM-decidable classes such that and such that both and are strictly oW-reducible to .
A crucial step in studying partitionability of statements would be the verification of the following conjecture, which is currently open:10
See, however, the discussion on MathOverflow at https://mathoverflow.net/questions/314053/delta1-2-and-degrees-of-constructibility-textbfon-sets.
Let be OTM-computable. Then one of and contains sets of every degree of constructibility.
Relative to this conjecture, it is rather easy to obtain the following from the fact that the well-ordering principle holds in any when X is a well-ordered transitive set:
WO is not partitionable.
Finally, various notions from classical computability theory could be incorporated into our framework: For example, one should be able to make sense of the concept of a ‘random construction’ and ask whether there are interesting non-effective constructions that are reducible to them. We will also consider candidates for a sensible notion of a ‘jump operator’ for construction problems, a notion that led to a number of fascinating results about Weihrauch reducibility ([2]).
Footnotes
Acknowledgements
First of all, we want to express our gratitude to the organizers of the Dagstuhl workshop “Measuring the Complexity of Computational Content: From Combinatorial Problems to Analysis” for providing an inspiring incentive to continue our work on this topic and writing this paper.
We thank the anonymous referees of our CiE 2016 paper [] on which this paper is based for their help in improving the presentation. Moreover, we thank the anonymous referee of the current paper and A. Pauly for several valuable corrections, comments and suggestions.
Finally, we thank De Gruyter for the permission to use some of the material appearing in chapter 8 of the forthcoming [] in this article.
References
1.
A.Blass, Existence of bases implies the axiom of choice, in: Axiomatic Set Theory, S.S.J.E.Baumgartner and D.A.Martin, eds, American Mathematical Society, 1984, pp. 31–33. doi:10.1090/conm/031/763890.
2.
V.Brattka, G.Gherardi and A.Marcone, The Bolzano–Weierstraß theorem is the Jump of weak König’s lemma, Annals of Pure and Applied Logic163 (2012), 623–655. doi:10.1016/j.apal.2011.10.006.
3.
V.Brattka, G.Gherardi and A.Pauly, Weihrauch Complexity in Computable Analysis (2017).
4.
V.Brattka, S.Roux and A.Pauly, Connected choice and the Brouwer fixed point theorem, Journal of Mathematical Logic19 (2012).
5.
M.Carl, Towards a Church-Turing-Thesis for Infinitary Computations, 2013.
6.
M.Carl, Generalized effective reducibility, in: LNCS, Vol. 9709, 2016, pp. 225–233. ISBN 978-3-319-40188-1.
7.
M.Carl, Ordinal Computability: An Introduction to Infinitary Machines, 2019. ISBN 9783110496154.
8.
M.Carl, A Note on OTM-Realizability and Constructive Set Theories, 2019.
9.
M.Carl, L.Galeotti and R.Passmann, Realisability for Infinitary Intuitionistic Set Theory, 2020.
10.
M.Carl, P.Schlicht and P.Welch, Recognizable sets and Woodin cardinals: Computation beyond the constructible universe, Annals of Pure and Applied Logic169 (2015).
11.
B.Dawson, Ordinal Time Turing Computation, PhD thesis, University of Bristol, 2009.
12.
K.Devlin, Fundamentals of Contemporary Set Theory, 1st edn, Springer, New York, 1994.
13.
T.Fischbach, The Church-Turing-Thesis for Ordinal Computable Functions, Master’s thesis, Rheinische Friedrich-Wilhelms Universität Bonn, Bonn, 2010.
14.
S.Friedman, Fine Structure and Class Forcing, 2000. ISBN 9783110809114.
15.
L.Galeotti and H.Nobrega, in: Towards Computable Analysis on the Generalised Real Line, LNCS, 2017, pp. 246–257.
16.
R.Goodstein, Recursive Analysis, 2010. ISBN 9780720422122.
17.
N.Greenberg, J.Hamkins, D.Hirschfeldt and R.Miller, Effective Mathematics of the Uncountable, 2013, ISBN 9781107014510.
18.
J.Hamkins and A.Lewis, Infinite time Turing machines, Journal of Symbolic Logic65 (1998).
19.
W.Hodges, On the effectivity of some field constructions, in: Proceedings of the London Mathematical Society – Proc London Math Soc, Vol. 32, 1976, pp. 133–162. doi:10.1112/plms/s3-32.1.133.
20.
T.Jech, The Axiom of Choice, 1973. ISBN 9780720422757.
21.
R.Jensen and C.Karp, Primitive recursive set functions, in: Proc. of Symposia in Pure Mathematics, Vol. 13, 1971. ISBN 9780821802458. doi:10.1090/pspum/013.1/0281602.
22.
P.Koepke, Turing computations on ordinals, Bulletin of Symbolic Logic11 (2005). doi:10.2178/bsl/1122038993.
23.
R.Kuyper, On Weihrauch reducibility and intuitionistic reverse mathematics, The Journal of Symbolic Logic82 (2015). doi:10.1017/jsl.2016.61.
24.
A.Mathias, Provident sets and rudimentary set forcing, Fundamenta Mathematicae230 (2015), 99–148. doi:10.4064/fm230-2-1.
25.
P.Schlicht and B.Seyfferth, Tree representations via ordinal machines, Computability1 (2012). doi:10.3233/COM-2012-002.
26.
R.Shore, Reverse Mathematics, Countable and Uncountable: A Computational Approach, 2020.
27.
A.Zarach, Unions of ZF–Models which are themselves ZF–Models, in: Studies in Logic and the Foundations of Mathematics, Vol. 108, 1982. ISBN 9780444864659. doi:10.1016/S0049-237X(09)70519-1.