An important theorem in classical complexity theory is that , i.e., that languages decidable with double-logarithmic space bound are regular. We consider a transfinite analogue of this theorem. To this end, we introduce deterministic ordinal automata (DOAs) and show that they satisfy many of the basic statements of the theory of deterministic finite automata and regular languages. We then consider languages decidable by an ordinal Turing machine (OTM), introduced by P. Koepke in 2005 and show that if the working space of an OTM is of strictly smaller cardinality than the input length for all sufficiently long inputs, the language so decided is also decidable by a DOA, which is a transfinite analogue of ; the other direction, however, is easily seen to fail.
Ordinal Turing machines (OTMs), introduced by P. Koepke in [10] and independently by B. Dawson in [5], are a well-established and well-studied model of infinitary computability. Roughly, an OTM is a Turing machine with a tape of proper class size, whose cells are indexed with ordinals, and transfinite ordinal working time. When restricting the tape length of an OTM to ω, one obtains the Infinite Time Turing Machines (ITTMs), introduced in 2000 by J. Hamkins and A. Lewis ([6]) as the first machine model of infinitary computability. One attractive feature of OTMs when compared with ITTMs is that OTMs exhibit a symmetry between time and space. As a consequence, the complexity theory for OTMs resembles the classical theory much more than it does for ITTMs: For an ITTM, each input is of length ω, so that e.g. the ITTM-analogue of the class P of polynomial-time computable functions is just the class of functions computable with a constant time bound. The consideration of complexity theory for OTMs was taken up by B. Löwe in [12] and then continued by B. Löwe, B. Rin and the author in [4].
An important result in classical complexity theory is that the class of languages that are decidable by a Turing machine with double-logarithmic space bound coincides with the class of regular languages, see, e.g., [16]. A crucial point in the proof of this result is the characterization of regular languages in terms of deterministic finite automata (DFAs). In this paper, we consider an analogue of this theorem for OTMs. To this end, we introduce deterministic ordinal automata (DOAs) as a transfinite analogue of deterministic finite automata (DFAs). There are several transfinite variants of DFAs preceeding ours: Büchi introduced automata ([2]) that work like DFAs and process words of ordinal length by picking a certain element from the set of states that occured cofinally often during the processing at limit times. Similar models are considered in [15] and [9]. A common feature of all these models is that the set of states is finite, while the transition relation is given by a set. Moreover, Büchi automata work on languages consisting of words of a fixed maximal ordinal length (namely, ω). In contrast, we consider automata that operate on words of any ordinal length and allow class-sized transition relations that satisfy a certain mild coherence condition. It turns out that this condition suffices to carry over much of the theory of DFAs and regular languages (Section 2). To the best of our knowledge, this notion has not been considered before. We then proceed in Section 3 to define strictly space-bounded OTM-computations as those whose working space is of a strictly smaller cardinality than the input length and show that such OTMs in fact work with a constant space bound and that the languages so recognized can be recognized with DOAs. A brief account of DOAs based on this paper was given in [3], pp. 69 and pp. 256, where some of the below results, such as examples and counterexamples of -languages, were included as exercises. The present paper contains full results and proofs for the first time.
The paper is structured as follows: In the second section, we define deterministic ordinal automata (DOAs) (Definition 1), along with their non-deterministic analogue, the NOAs (Definition 5). We then develop the basic theory of these automata, mimicking the classical development: We show that DOAs and NOAs accept the same languages (Theorem 6) and a weak version of the Myhill–Nerode theorem (Theorem 10). Concerning the pumping lemma, we show a version of this in Theorem 12, but provide a counterexample to the most obvious transfinite analogue in Lemma 15. In Section 3, we introduce strictly space-bounded OTMs and prove our main result, Theorem 37, namely that any language that is decidable by a strictly spaced-bounded OTM is decidable by a DOA.1
We emphasize that, in contrast to the classical result, this is merely an inclusion, not an equality; in fact, we will see at the end of Section 3 that the reverse inclusion is false.
Notation
In the following, Σ denotes an alphabet, i.e., a finite set. We will denote by the class of sequences of ordinal length over Σ. When , we write for length of w, i.e., the order-type of w. Elements of will also be called Σ-words, or simply “words”, when Σ is clear from the context. When , are words, we denote by their concatenation; moreover, if is a sequence of words, we denote by the concatenation of all elements of the sequence in the order of appearance in s. When w is a word and P is an OTM-program, then denotes the computation of P with (an appropriate binary encoding of) w written to the input tape. As usual, we write when halts with output x and when the computation diverges.
We consider ordinal analogues of regular languages. In particular, we introduce notions parallelizing in the ordinal realm deterministic finite automata, nondeterministic finite automata and induced congruence relation. Though our generalization of a finite automaton may seem to be far too liberal at first, it preserves enough of the heart of the classical concept to make large parts of the classical theory go through, and in fact much for the same reasons.
We begin by introducing a notion DOA generalizing deterministic finite automata.
A deterministic ordinal automaton (DOA) is a quintuple where Σ is a finite alphabet, Q is a (possibly infinite) set, , and is a class function with the following property: For all , if is defined and , then is also defined and we have .
If is a DOA, then is the language accepted by .
If is such that for some DOA , then is .
A DOA is complete if and only if is defined for all and all .
Note that, by this definition, the transition relation D is an arbitrary class, only restricted by the ‘coherence’ or ‘forgetfulness’ condition in the definition; intuitively, by this condition, the automaton, when in a certain state, has no memory how it got there. It turns out that this rather weak condition seems to lie combinatorially at the heart of many results about regular languages. We demonstrate this by carrying over some of the main standard results in the theory of regular languages, along with their proofs. The classical counterparts can be found in any basic textbook on theoretical computer science, such as [8].
(i) The language is for the DOA with , , , , for , for and .
(ii) The language is not . To see this, suppose for a contradiction that is a DOA with and consider the words for . Since , must be defined for all . As , there must be such that . It follows that is defined, so that must also be defined and in fact identical to . But now, on the one hand, we have , while on the other hand , a contradiction.
(iii) Whenever is a set (for an arbitrary finite alphabet Σ), then S is . To see this, let , and let be the set of Σ-sequences of length at most α and , where ; moreover, for all , define and, for all , let
Then is a DOA that accepts S.
Note that there is a certain asymmetry between DOAs and OTMs: While OTMs have arbitrary transfinite resources available with respect to space and time, their programs are finite; in contrast, the transition function of a DOA – which roughly corresponds to the “program” of an OTM – can be a proper class. In particular, unless , it is not the case that all set-sized languages are OTM-decidable, in contrast to part (iii) of the preceding example.
For every DOA, there is a complete DOAsuch that.
This works as in the classical (finitary) case by letting and letting for all and all for which is not defined. □
We now consider analogues of non-deterministic finite automata.
An NOA is a quintuple with Q, , F, Σ as for DOAs and is a relation such that we have for all and all with that . Here, for and , denotes .
If is an NOA, then is the language accepted by .
We now imitate the classical power-set construction for simulating non-determinism by determinism in our setting.
The languages accepted by some NOA are exactly the languages in.
Clearly, all languages are accepted by some NOA, as every DOA is an NOA.
On the other hand, let where is an NOA. We construct a DOA as follows: Let , the power set of Q, , and define by , which is simply .
We claim that is indeed a DOA. So let , . We want to show that .
“⊆”: Let . Then there is such that . As is an NOA, we have , so . Let be such that . By definition of , we certainly have whenever . As and , we have . Now, since and , we have . Since q was arbitrary, this shows that .
“⊇”: Let . Then there is such that . Furthermore, there is such that . Thus . But now we have and therefore . As q was arbitrary, this shows that .
Hence is indeed a DOA. We finish by showing that :
“⊆”: Let . Hence , i.e. . Since , we have , hence .
“⊇”: Let . Hence . Thus , which implies , i.e. . □
is closed under complementation, union and intersection.
Let , be .
For complementation, consider by Proposition 4 a complete DOA for and let ; it is easy to see that .
For unions, let , be DOAs such that and . Assume without loss of generality that and are disjoint. Form an NOA by letting (where is neither contained in nor in ), and defining and for (). If or , then we also put in F. It is easy to see that . By Theorem 6, is .
Closure under intersection follows by de Morgan’s rules from closure under complementation and union. □
For, let,denote the subsequences consisting of the 0s and 1s in w, respectively. Then the following two languages are not:
(1) By Corollary 7, if was , then so was ; but we saw above in Example 2 that this is not the case.
(2) If was , then intersecting with would yield the ∞-regularity of . But now, essentially the same argument as the one used in Example 2(ii) shows that this is impossible. □
We turn to an analogue of Myhill–Nerode.
Let . The congruence relation on induced by is defined as follows: For , we have if and only if, for all , we have .
If there are only set many -equivalence classes (more formally: if there is a set X such that every is -equivalent to some element of X), we say that satisfies the ‘ordinal Myhill–Nerode condition’ (MN for short).
isif and only ifsatisfies MN.
Suppose first that is , and let, by Proposition 4, be a complete DOA such that . For let . Then, for each , the elements of are pairwise -equivalent: For and , we have , hence either both and belong to or neither does, i.e. . Since w was arbitrary, we have .
Thus, all the are subclasses of -equivalence classes and, as is complete, every belongs to one of the . Thus every -equivalence class is a union of some (at least one) , hence there are at most as many -equivalence classes as there are elements in Q, i.e. only set many.
Now let be such that has only set many equivalence classes on . Pick a representative from each equivalence class and denote by their collection; for , denote by the element of equivalent to w. We construct a DOA with as follows:
Let , (where ε denotes the empty word), .
Note that, for all , we either have or : For if we have , then , so .
Now define D by setting . It is easy to check that is as desired. □
We also get a rather straightforward analogue of the pumping lemma. As the proof is the same, we prove something slightly stronger, which is closer to one direction of an analogue of Jaffe’s theorem:
For and , let be the interval of w from index α to index β, with and included; similarly, define as that interval with excluded.
Moreover, let and .
Let S be,a DOA with. Letbe sufficiently long, more specifically. Then there aresuch that, for all,.
By the pigeonhole-principle and the fact that , there are such that . Let . Then for all , so , so . □
The proof clearly allows us to demand that v is ‘short’, i.e. that .
Note that the proof does not show that we can repeat also γ many times for : For example, a DOA with to states , may satisfy for any but also . In fact, this stronger version is false, as we will now show.
Let be the language consisting of those elements of that have a consecutive subsequence of the form , where .
is.
The DOA for deciding has two states and . is the only accepting state, is the starting state. The transition function sends to if and only if and to , otherwise and it sends to for any w. By noting that, whenever , we must have or , it is easy to see that this is a DOA that decides . □
There is a languagethat is, and for all α, there is someof lengthsuch that there are nowith,and.
Let be the complement of . Then is as the complement of a language that is . We claim that contains arbitrarily long words. This suffices, as by definition no word of the form can belong to .
It is easy to see that, if is generic over L for the forcing consisting of finite functions from α to 2 ordered by inclusion, then w will be as desired, as the set of conditions c that do not extend to any function where the ιth position starts a repetition of a word of length δ is dense for all . To avoid the sledgehammer of forcing and also the extra assumptions necessary to guarantee the existence of generic objects, we offer the following more direct construction:
Let us define the transfinite Morse-Thue-sequence as follows: , (where denotes the word that has 0s where s has 1s and vice versa) and for a limit ordinal λ, is defined as the union of , i.e., by letting for all . Now, let . We claim that there is no ι such that has a consecutive subsequence of the form , which is clearly much stronger than what we require.2
For the elements of finite index, this is well-known, see, e.g., the entry in the Online Encyclopedia of Integer Sequences https://oeis.org/A010060.
Let us denote by the sequence restricted to the indices ι up to ξ, (i.e., ) for . Clearly, for each , will either be (which is just the classical Morse-Thue sequence) or its “complement” . Thus, no finite subword of the form can appear in .
It is easy to see from the definition of that, for all , we have . Now suppose for a contradiction that, for some infinite word w, the substring appears somewhere in . Let be the length of w, and let us write α in Cantor normal form to the base ω as with and . Moreover, let ρ be the index at which the subword starts in for the first time and write ρ in the form with and . By elementary properties of ordinal addition, we have that for all .
Let be the word consisting of the elements of w with index of the form , . Then is also the sequence of elements of with indices of the form , with . Now for these i, so by observation (∗) above, this coincides with , which is thus a finite subsequence of consecutive elements of of the form , a contradiction. □
Thus, the transfinite analogue of the pumping lemma already fails for “ω-pumping”.
We now consider ∞-regularity for unary languages, i.e. languages over an alphabet with only one element.
All three proofs work by contradiction. For , let be a DOA with ; the start state is always denoted , the transition by D, the set of accepting states by F etc. In the following, + always denotes ordinal addition.
(1) As only has a set of states, but there is a proper class of cardinals, by the pigeonhole principle there must be such that . Now we have and . It follows that , a contradiction.
The proofs for (2) is similar, noting that is a power of ω for , while is never a power of ω.
For (3), consider the sequence given by , , for λ a limit ordinal. It is easy to see that for is always a square, while is never a square of an ordinal for . □
The languageconsisting of words of the form 1, 101, 101001,is not(here, ∘ denotes concatenation of words).
This is an easy consequence of Theorem 10: It suffices to note that no word in is -equivalent to any word of strictly smaller length, so that there is no set-sized class of representatives for the -equivalence classes.3
We thank our anonymous referee for pointing out this considerable shortening of our original proof.
□
For the main result of this paper in the next section, we will also need an ordinal version of ε-NFAs, see, e.g., [7], chapter 2.5.2.
Fix a special symbol ε. From now on, we will assume that Σ never contains ε. If , an ε-enrichment of w is defined as a sequence in in which the subsequence of elements of Σ is exactly w.
An ε-NOA with alphabet Σ is simply an NOA with the alphabet . If is an ε-NOA, then is the set of such that for some ε-enrichment of w.
When , we also say that there is an ε-transition from q to p.
A language is ε- if and only if there is an ε-NOA with .
A languageis ε-if and only if it is.
Clearly, if is , it is also ε-, as every DOA is also an ε-NOA (where all transitions for words containing ε are undefined).
On the other hand, let , where is an ε-NOA. Then we define an NOA as follows: For and , . It is easy to see that this defines an NOA and that . □
We also haven’t considered ordinal versions of Mealy [13] or Moore [14] automata, but we encourage the interested reader to do so.
Space-bounded OTMs
We now work towards our main result. To this end, we recall the definition of the space complexity of an OTM-program. This concept was introduced by Löwe in [12].
Let be a function and P an OTM-program. P belongs to if and only if there is an ordinal β such that, whenever w is a word of length , the computation of uses only the first many cells of the scratch tape.
A classical theorem in complexity theory is that, if the space usage s of a Turing machine T is such that, for any , we have on any input of length n for some , then T in fact has a constant bound on its space usage and hence decides a regular language. (See, e.g., [16], Corollary 2.1 for a proof of this; our proof will follow the guiding ideas of the classical proof and in particular make use of a modified version of crossing sequences.)
We work towards an infinitary version of this. In the following, let be a (class) function such that for all sufficiently large α, i.e. f ‘lowers cardinalities’. We will begin by showing that, if P belongs to for such an f, then P in fact belongs to , i.e. there is a uniform constant bound on the amount of cells P uses on the scratch tape.
Let P be an OTM-program, κ a cardinal, and let w be a 0-1-word of minimal length such thatuses at least κ many scratch tape cells. Then, and in fact.
Recall that the elementary hull of a set X in an ∈-structure M is the closure of X under Skolem function; in particular then, the cardinality of the elementary hull of X has cardinality .
of in (the set of sets hereditarily of cardinality less than , where denotes the cardinal successor of α). It is easy to see and well-known folklore (see, e.g., [3], p. 267) that a halting OTM-computation on an input of length α can only have a length of cardinality at most (∗). Thus, the computation of will take less than many steps and hence be contained in . Form the transitive collapse M of , and denote by the image of w under the collapsing map. Then and ‘ uses a set of scratch tape cells or cardinality at least κ’. Furthermore, we have , so the length of has cardinality at most κ. Since M is transitive, the computation of in M is the same as that in V. Thus uses already a set of scratch tape cells of cardinality κ. By minimality of δ, it follows that .
By (∗), we cannot have , so . □
Call an OTM-program P ‘strictly space-bounded’ if and only if there is a function such that for all sufficiently large α and P belongs to .
If P is strictly space-bounded, then P belongs to.
Assume otherwise, and let P be strictly space bounded, but not in . Pick f as in Definition 24, and let be such that for . Since P is not in , there is a word w such that uses at least many tape cells. Pick w of minimal length with this property. By Theorem 23, we have . By the choice of β, we have ; thus, as P belongs to , uses at most many tape cells, which contradicts the choice of w. □
Our final goal is to show that, for each OTM-program P with a constant use of scratch tape, there is an ε-NOA accepting exactly those words for which P halts. From now on, we will refer to the head on the input tape as the “reading head”, while “read-write head” will refer to the read-write head on the scratch tape.
The proof will use an ordinal version of crossing sequences, which are the main tool used in the proof of the classical result that , see, e.g., [16] for the proof or [1], pp. 185 for an introduction to crossing sequences. The guiding idea is the following: Let P an OTM-program such that halts and has constant space usage α for every word w over some alphabet Σ; moreover, P has two possible halting states, one of which we interpret as accepting, the other as rejecting w. For the sake of simplicity, we assume in the following overview that the length of w is a limit ordinal . Then w can be decomposed into blocks of length ω, say with , which we will call the ω-blocks of w. By slight abuse of notation, when w is stored on the input tape of an OTM, we will also use the term ω-block to refer to the tape portion on which the respective part of w is stored and say, e.g., that the reading head is inside a certain ω-block. Now, with respect to the input tape t containing w, the computation works as follows: The head starts on the first cell of t; it then moves through some initial portion of the ω-blocks from left to right; unless the computation halts during this procedure, the head will eventually be reset to position 0 again (the crucial observation is that the head can only move from left to right through the ω-blocks, as the only possibility to move left in the ω-blocks is a reset). Let us call the portion of the computation between the ιth and the th reset the ιth “run”, where we define the start of the computation as the 0th reset. We will see below (Lemma 28) that the number ρ of runs has an upper bound that depends only on the working tape length α. We can thus imagine (see Fig. 1 below) the computation of to be organized in a -table, where the -th entry consists of the operations of P while the reading head was in the ι-th ω-block in the ξ-th run (or a special “blank” symbol ∗ if the computation does not reach the ι-th ω-block in the ξ-th run or if there is no ξ-th run). Thus, will be a sequence of machine states in the execution of P, which means that we need to specify (i) the position of the reading head relative to the start of the ι-th ω-block (which can be represented by a single natural number), (ii) the position of the read-write head on the scratch tape, i.e., an ordinal strictly smaller than α, (iii) the content of the scratch type, which is an element of , i.e. a 0-1-sequence of length α and (iv) the inner state of the computation, which can be encoded by another natural number. As we will see below in Corollary 29, the length of each also has an upper bound θ that only depends on α; and consequently, there are only set many possible candidates c for sequences of the form .
Roughly, then, the idea is to take these candidates, i.e., sequences of length ρ of sequences s of machine states of length smaller than θ, as the states of our NOA; such a sequence is accepting if and only if the accepting state is assumed in any of the machine states contained in the one of its elements.
How do we define the transition relation? Well, suppose we are given a sequence and a word v.
A transition between the sequences and . Red lines indicate borders between the ω-blocks of w.
Thus, we assume that s is a sequence of partial computations while the reading head was in a certain ω-block, reading a word ; and now we want to see how the computation would continue if was supplemented to the right by v; more precisely, we are interested in the behavior of P when the reading head enters the final ω-block of v. That, however, is easily done: For every run, i.e., every , we consider . From this sequence, it is easy to determine whether the head leaves to the right or to the left due to a reset. Let us assume that the former is the case. Then will not have a last element (clearly, the head can only leave an ω-block to the right at a limit time), but from , we obtain the machine state right after leaving by the liminf-rules. Now, knowing both the machine state and the relevant part of the input word (namely, the part to the right of ) from v, we can just simulate the computation up to the point where the head enters the last ω-block of v. The partial computation taking place while the reading head is on this last ω-block is then the ν-th component of the state to which the transition from s via v should lead us; in particular, if a reset of the reading head on the input tape takes place during this run, we expect to be ∗ (and likewise when the computation stops before reaching this ω-block).
The above description should give the reader a fairly good idea of what is going on in the following construction; there are, however, various subtleties that need to be handled, making things somewhat more technical. First, there is the difference between leaving an ω-block due to a reset and to the right, which leads to special cases in the transition relation, forcing us to distinguish between the 0-th ω-block and the others. Second, v need not be of limit length, in which case the information given by v does not suffice to determine the behavior of P in the last ω-block of v, as only a finite initial amount of the necessary bits will be available. We solve this as follows: Let , where is empty or of limit length and is finite. Associated with a state will be both and a function , which will serve as a “guess” for the complete ω-block of w started by ; clearly, we require that is an initial segment of g and moreover that the associated partial P-computations are compatible with g being the relevant part of the oracle. Now, if a state s is given by a sequence c of partial computations, a finite 0-1-function f and an extension of f, and we want to determine the transition from s via a word u, we first determine whether is an initial segment of g; if not, we go to a special non-acceptance state, all of whose transitions lead back to itself. Otherwise, if u is finite, we leave s and g unchanged, but replace f with ; but if with of limit length and finite, the states that we can reach from s via u are given by those , and for which (i) is a sequence of partial computations in the oracle (with the reading head never leaving ), (ii) is equal to and an initial segment of and (iii) when continuing the partial computations in s by using as the input, we arrive at the initial configurations in the corresponding partial computations in when the no reset takes place before the reading head reaches the final ω-block of , and if a reset takes place, then has an ∗ in the respective place.
This concludes our overview; we will now carry out the above plan in formal detail.
Let P be an OTM-program with scratch tape use bounded above by γ, and let be a word. For , the interval is called the ι-th ω-block of w. (Recall the abuse of notation introduced above.)
The 0-th run of is the partial computation of between the start of the computation and the first time that the reading head is moved to the left from a limit position, resulting in a reset to position 0. The ι-th run is then the partial computation between the ι-th and the -th such reset.
A P-snippet is a quintupel , where is an inner state of P, is a candidate for a scratch tape content of P, is a candidate for the position of the read-write head on the scratch tape, is a candidate for the relative position of the reading head in the current ω-block on the input tape (i.e., if that position is ν, then i is the unique element of ω such that for some ordinal ξ) and is a candidate for the first i bits of the input word in that ω-block.
A P-guess is a sequence of P-snippets.
Let , and let δ be an ordinal. The interval is called the βth ω-block of w. The block-crossing sequence of length δ for associated with β is the sequence such that, if has a ξ-th run, then is the P-guess that describes the operations of P while the reading head is in the β-th ω-block during the ξ-th run; if the reading head does not enter the β-th ω-block during the ξ-th run or if there is no ξ-th run, then . Intuitively, a block-crossing sequence consists of those parts of the computation of that occur while the reading head is in a certain ω-block.
A δ-state is a quadruple , where , s is a δ-sequence of P-guesses, f is a map from a finite ordinal to and is such that f is an initial segment of g and each P-guess in s is a partial P-computation in the oracle g in which the reading head does not leave the portion on which g is written. When , we call t an initial state, otherwise, t is a transitional state.5
The idea here is that initial states are guesses for block-crossing sequences associated with the first ω-block of the input word, while transitional states are guesses for block-crossing sequences associated with the other ω-blocks of the input word.
We will eventually construct the desired NOA whose states will be the δ-states, i.e., the possible candidates for the block-crossing sequences, amended with some extra information. We begin by showing that the possible block-crossing sequences for a halting OTM-program P with constant space usage form a set. (Note that this is not trivial, since the possible inputs are a proper class.) Fix a program P that has constant scratch tape use bounded by γ, and assume without loss of generality (by increasing γ if necessary) that . Then g each P-snippet is an element of . Thus, there is only a set T of quintuples that can possibly occur in a P-guess. It remains to control the length of a P-guess, and the required number of P-guesses in a state, i.e., the number of runs of . These are our next goals.
We start by recalling the following looping criterion for infinitary machines, which can e.g. be found in [6] for the case of ITTMs:
Let P be an OTM-program. Suppose that, in the computation of P, a configuration c is repeated such that, for all times between the two occurences of c, every other configuration had all components (tape contents) at least as large as the corresponding component in c. Then P never halts.
After arriving at c for the second time, the same steps will be repeated, so c will occur a third, fourth etc. time. The only way to escape the loop would be at a limit time. However, the condition above ensures that the configuration at any limit time which is preceeded by cofinally many occurences of c will be c as well. □
Letbe such that. In the course of a halting computationwith scratch tape use bounded by γ, the reading head on the scratch tape will be positioned on the 0-th cell less thanmany times.
First, note that the sequence of machine configurations (i.e. the inner states and the scratch tape contents) occuring at times when the reading head is positioned at is continuous in the sense that for limit ordinals . This is due to the behaviour of OTMs at limit stages and in particular the fact that the reading head position at limit times is the inferior limit of the sequence of earlier reading head positions, i.e. at a limit of times at which the reading head was on , the reading head will again be at position 0.
Now assume otherwise and consider the sequence of the first many such configurations. Let be the index in the computation of at which the reading head is on the 0-th cell for the th time.
If every configuration occured only boundedly often before this time, then for each such configuration, the set of suprema of the indices of their occurences would be majorized by some , and the set of these ι would be a cofinal subset of of cardinality , contradicting the regularity of the successor cardinal .
The same argument shows that there is some such that all configurations occuring after time α in the computation occur cofinally often before . Again by the regularity of , there is which is simultaneously for each of these configuration a limit of the indices at which these configurations occur and a time at which the reading head is on the 0-th cell. As , will occur cofinally often below . Also, by the liminf-rule, will in each component be less than or equal to every other configuration occuring cofinally often before . But this implies by Proposition 27 that the computation is strongly looping after the first two such occurences of , which contradicts the assumption that halts. □
In the course of the computation of a halting computationwith scratch tape use bounded by γ, there will be at mostmany disjoint time intervals at which the reading head is in the β-th ω-block.
Note that, by the rules for moving the reading head, an ω-block other than the 0-th ω-block can only be reached “from the left”, while the 0th block can only be entered when the reading head is moved to the left from some limit position. Therefore, whenever the reading head is in the 0-th ω-block, it must have been on before without leaving the 0-th ω-block in the meantime. Thus, the 0-th ω-block can only be visited less than many times by Lemma 28.
As any other ω-block can only be entered from the left, the reading head must have been in the 0-th ω-block before entering such an ω-block anew, thus the same holds for all other ω-blocks. □
It remains to control how long the reading head can remain in an ω-block.
In the course of a halting computationwith scratch tape use bounded by γ, a time interval in which the reading head remains in the same ω-block can have no more thanmany elements.
Suppose for a contradiction that the reading head remains in some ω-block for many steps. Shifting the time interval if necessary, we assume that this starts at computation time 0. If every of the possible ω many reading head positions in this episode occured only boundedly often, then, by regularity of , there would be a common upper bound for the suprema of the times at which any head position occured, a contradiction. Thus, there is such that every reading head position occuring after time α occurs cofinally often before time . Let k be the minimal element of the set of these positions. Then the same argument as for Lemma 28 shows that the reading head can be on position k at most many times during the interval. But this leads to a cofinal subset of with cardinality at most , a contradiction. □
We now have the desired bound on the length of P-guesses that can actually occur as partial computations of P while the reading head is in a certain ω-block, as well as the number of visits to an ω-block:
Let P be an OTM-program whose scratch tape use is bounded by γ on every input. A relevant P-guess is a P-guess of length less than . A relevant P-guess sequence is a sequence of relevant P-guesses of length less than .
There are at mostmany relevant P-guesses.
There are at mostmany relevant sequences of P-guesses.
There are at mostmany-states for P.
(i) Since the set of P-snippet is a subset of by definition, there are at most many P-snippets. Consequently, by Lemma 30, there are at most relevant P-guesses.
(ii) Given (i), the number of relevant P-guess sequences is bounded above by
(iii) We have , so that the number of -states is bounded by . □
We will now construct an ε-NOA that accepts exactly those words for which halts in the accepting halting state. As announced above, in the definition of the states of this NOA, the -states, i.e., the possible candidates for block crossing sequences, amended by a candidate for an oracle portion of length at most ω, will play an important role.
We assume that the input word w has a mark rh to its right and that P notices (e.g., via special states) when rh is reached. Also, we assume without loss of generality that never moves the reading head to any position to the right of rh and only halts with the reading head on the cell containing rh; clearly, this can be arranged by changing P to move its head to the right up to rh once w has been accepted or rejected. Formally, this means that both our OTMs and our automata work over the three-element alphabet rather than merely over ; but in order not to clutter the already somewhat technical proof with additional subtleties, we will ignore this point in the following.6
If one wanted to be strict here, one could, e.g., replace 0 by 01, 1 by 10 and then take 11 to represent rh, thus reducing everything to rh again; it is not hard to see that this has no influence on OTM-decidability with constant space bound or ordinal regularity.
We need to note, however, that adding rh at the end of each word does not influence the property of being :
Let Σ be an alphabet,,, and let. Thenisif and only ifis.
Let be , and let be a DOA that accepts . Pick and define to be , where . Then is a DOA that accepts .
Conversely, suppose that is and let be a DOA that accepts . Then we obtain an ε-NOA accepting by replacing all σ-transitions in by ε-transitions, i.e., by setting . □
Let P be an OTM-program such that halts in one of two halting states , for every input w and has constant space usage bounded above by . The ε-NOA associated with P, denoted by , is then defined as follows:
Let be the set of initial -states for P, let T be the set of transitional -states for P, and let . An initial -state is called proper if and only if the first element of s is the initial configuration of P in which P is in its starting state, both heads are on position 0 and the scratch tape is empty; let I be the set of proper initial -states. Then the set of states of is given by . A state is accepting if and only if and the sequence of P-guesses in s contains a P-snippet in which the inner machine state is or , , and satisfies this condition. The initial state of is . r is taken as a special rejection state.
Now we define the transition relation . For any , we will define a separate transition relation on the subset . We have for all words w. There will be no transitions between and unless . The transition relation will then be given by taking the union and adding for and the ε-transitions .
Now, for any , any and any word , is defined as follows:
If and w is finite, then if is an initial segment of g, and , otherwise.
If and with limit and and :
If g is not an initial segment of w, then .
Otherwise, for each such that , we run the computation starting in the initial configuration of the first element of until either the reading head enters the last ω-block of or is reset due to moving left from a limit position.
If a reset takes place and the machine configuration after the reset is different from the first element of , then . (This means that the guess of the initial state is incompatible with the input.)
Otherwise, will be a pair with an initial segment of and is a sequence of partial computations of P in the input such that the reading head does not visit any cells besides those containing in the course of its computation; it remains to determine for all :
If , then .
If a reset takes place and the machine configuration after the reset is equal to the first element of , then .
If no reset takes place, then is the partial -computation c that starts in the first configuration in which the reading head enters the last ω-block of when starting in the first configuration of and is maximal in the sense that continuing c would move the reading head out of this ω-block.
If and w is finite, then if is an initial segment of g, and , otherwise.
If and with limit and and , and :
If g is not an initial segment of w, then .
Otherwise, for each such that , we again run the computation starting in the initial configuration of the first element of until either the reading head enters the last ω-block of or is reset due to moving left from a limit position.
If a reset takes place and the machine configuration after the reset is different from the first element of , then . (This again means that the guess of the initial state is incompatible with the input; note that, in order to determine this, it is necessary to store the initial configuration in the states.)
Otherwise, is defined exactly as in (c)–(e) above.
Let P be an OTM-program P such that, for someand all,halts and uses less than γ many scratch tape cells. Thenis an ε-NOA.
Let . All parts of the definition of an NOA except the coherence condition for D follow straightforwardly from the definition of . So let , and let . We need to show that . Let I denote the set of initial states and T the set of transitional states of . We distinguish the following cases:
First, we shall assume that .
Case 1: At least one of , is finite. Let when q is initial or when q is transitional. For the sake of simplicity, we assume that q is transitional; the case that q is initial works analogous.
If is finite and with either 0 or a limit ordinal and finite, then when is an initial segment of and , otherwise. In the first case, it is clear that the D-successors of under will coincide with the D-successors of under . In the second case, all further transitions will just lead back to ; moreover, will be incompatible with , so we will have , as desired.
If is finite and with limit and , then is a set of states of the form , where . From these, one obtains the elements of by taking the subset of those states satisfying that is an initial segment of and replacing with ; exactly the same will yield the elements of .
From now on, we assume that both and are infinite.
Case 2: q is an initial state; let . We show that . This suffices, since, by definition, will only consist of transitional states of the form (and possible r). The r-state is easily dealt with, so let us consider only the transitional states . On these elements, and D coincide, so that we will have . Let with limit and and similarly with limit and .
⊆: Let , ; this means that is a sequence of elements that are either ∗ or partial P-computations in the oracle starting in the configurations in which the reading head enters the last ω-block of when starting in the respective configurations in s. Let be the sequence of configurations occuring after the reading head had first read in such a computation, and let denote the ω-block of in which the first element of occurs. Then , and moreover, we have . Hence .
⊇: Now let , and let denote the first ω-block of . Now, will contain some element of the form , and for this element we will have by the definition of , as correctly guesses how the computations of will proceed with the reading head on the ω-block of containing the first element of . But then, also correctly describes the actions of the computation at these time intervals, and so is a candidate for how this computation might proceed after having read , i.e., an element of , as desired.
Case 3: q is a transitional state; let .
This works similar to Case 2, with the extra observation that each reset of the reading head that takes place while is processed either takes place while is processed or while is processed after has been processed, so that the extra clauses of Definition 34 taking care of resets are satisfied.
Finally, we consider .
Case 4: . In this case, by the Cases (1) and (2), we have .
Now, let . Then, by definition, we have if and only if there are and such that . By definition of D, we have ; it follows that , so that the last claim is equivalent to . Hence . □
Let P be an OTM-program such that, for someand all,halts in one of two halting statesandand uses less than γ many scratch tape cells. Thenaccepts exactly those words for whichhalts in the state.
Let .
Suppose first that terminates in the accepting state (with the reading head on rh, by definition). Let s be the state of that correctly describes the actions of with the reading head in the ω-block containing rh. Then and moreover, we have ; thus, accepts w.
Conversely, suppose that accepts w, so that contains an accepting state s. Assume without loss of generality that s is a transitional state, the case that s is initial being similar; so . Write , where all with the possible exception of have length ω. For each , contains a unique element of the form . But now, by definition, the sequence describes a halting computation of that terminates in the accepting state; thus, w is accepted by P. □
We now obtain our main result:
Let P be an OTM-program such thathalts and the scratch space usage of the computation ofis bounded by a constant γ for every. Then there is an ε-NOAsuch that.
Putting Corollary 38 and our negative results on together, we obtain:
None of the following languages is semi-decidable by a strictly space-bounded OTM:
This shows in particular that strictly-space bounded OTMs are strictly weaker than e.g. linearly space-bounded OTMs, as there is obviously an OTM-program in that decides by simply writing the 0s to the scratch tape, then going back to the start of the scratch tape and replacing 0s by 1s.
We cannot expect to turn Corollary 38 into a strict equivalence, as Proposition 16 shows that any subset of is , while not every such set will in general be OTM-computable.
Conclusion and further work
We have defined a transfinite version of regularity that generalizes the classical version in a rather straightforward manner. As it turns out, the theory is in large parts parallel to the classical theory, even though the relevant realm is considerably extended. As a by-product, our generalization allows us to see more clearly which structural features of regular languages play a role in proving their basic properties. A somewhat weak spot is the lack so far of a satisfying analogue of the pumping lemma that allows for “transfinite pumping”. Whether such an analogue exists and what the exact formulation should be is currently an open question.
Another question is whether there is any way to get an equivalence result from Corollary 38. One approach might be to strengthen the notion of an OTM: In his Master’s thesis [11], E. Lewis7
Not to be confused with A. Lewis, one of the authors of [6].
introduced OTMs with infinite programs, in which programs are (possibly infinite) sets of Turing commands and the states are indexed with ordinals; computations are then defined exactly as for OTMs with finite programs. One of the results of [11] is that these ‘Lewis machines’ are equivalent to OTMs with a (set-sized) oracle. The definitions of space complexity, strictly space-boundedness etc. clearly apply to Lewis-machines. Another approach might be to consider more restrictive versions of regularity, where the DOA is required to be OTM-computable within certain complexity restrictions. Whether these approaches will lead to a non-trivial result will be taken up in future work.
Given the fact that regular languages play an important role in computer science since there are many natural examples of regular languages, it will be interesting to see whether there are examples of -classes corresponding to natural classes in mathematics. If so, could serve as a complexity measure for such objects.
It is then natural to ask for transfinite generalizations of further stages of the Chomsky hierarchy, which we plan to take up in future work.
Footnotes
Acknowledgements
We thank our two anonymous referees for detailed and extensive comments that greatly helped to improve the presentation of the paper.
References
1.
D.Bovet and P.Crescenzi, Introduction to the Theory of Complexity, Vol. 46, 1994. ISBN 978-0-13-915380-8. doi:10.2307/2584070.
2.
J.Büchi, Decision methods in the theory of ordinals, Bulletin of The American Mathematical Society71 (1965). ISBN 978-1-4613-8930-9. doi:10.1090/S0002-9904-1965-11384-2.
3.
M.Carl, Ordinal Computability: An Introduction to Infinitary Machines, 2019. ISBN 9783110496154.
4.
M.Carl, B.Löwe and B.Rin, Koepke machines and satisfiability for infinitary propositional languages, 2017, pp. 187–197. ISBN 978-3-319-58740-0. doi:10.1007/978-3-319-58741-7_19.
5.
B.Dawson, Ordinal time Turing computation, PhD thesis, University of Bristol, 2009.
6.
J.Hamkins and A.Lewis, Infinite time Turing machines, Journal of Symbolic Logic65 (1998).
7.
J.Hopcroft, R.Motwani and J.Ullman, Introduction to Automata Theory, Languages, and Computation, 2006.
8.
J.Hromkovic, Theoretische Informatik, 2014. ISBN 978-3-658-06432-7. doi:10.1007/978-3-658-06433-4.
9.
M.Huschenbett, A.Kartzow and P.Schlicht, Pumping for ordinal-automatic structures, Computability6 (2017), 125–164. doi:10.3233/COM-160057.
10.
P.Koepke, Turing computations on ordinals, Bulletin of Symbolic Logic11 (2005). doi:10.2178/bsl/1122038993.
11.
E.Lewis, Computing with infinite programs, Master’s thesis, Universiteit van Amsterdam, 2018.
12.
B.Löwe, Space bounds for infinitary computation, 2006, pp. 319–329. ISBN 978-3-540-35466-6. doi:10.1007/11780342_34.
13.
G.Mealy, A method for synthesizing sequential circuits, Bell System Technical Journal34 (1955), 1045–1079. doi:10.1002/j.1538-7305.1955.tb03788.x.
14.
E.Moore, Gedanken-experiments on sequential machines, 1956, pp. 129–153. ISBN 9781400882618. doi:10.1515/9781400882618-006.
15.
P.Schlicht and F.Stephan, Automata on ordinals and automaticity of linear orders, Annals of Pure and Applied Logic164 (2013), 523–527. doi:10.1016/j.apal.2012.11.007.
16.
R.Stearns, J.Hartmanis and P.Lewis, Hierarchies of memory limited computations, 1965, pp. 179–190. doi:10.1109/FOCS.1965.11.