Allender, Friedman, and Gasarch recently proved an upper bound of for the class of decidable languages that are polynomial-time truth-table reducible to the set of prefix-free Kolmogorov-random strings regardless of the universal machine used in the definition of Kolmogorov complexity. It is conjectured that in fact lies closer to , a lower bound established earlier by Buhrman, Fortnow, Koucký, and Loff. It is also conjectured that we have similar bounds for the analogous class defined by plain Kolmogorov randomness. In this paper, we provide further evidence for these conjectures. First, we show that the time-bounded analogue of sits between and . Next, we show that the class obtained from by imposing a super-constant minimum query length restriction on the reduction lies between and . Finally, we show that the class obtained by further restricting the reduction to ask queries of logarithmic length lies between and .
The word “randomness” is used in at least two different contexts in computation theory: One is in the theory of Kolmogorov complexity, which measures randomness of a finite string in terms of incompressibility. The other randomness refers to coin flips used for efficient computation, and gives rise to (among others) the complexity class . Despite the remote origins of these two notions of randomness, some interesting links between them have been found in recent work by Allender et al. [2–5]: they conjectured, and gave some evidence, that is characterized as the class of languages truth-table reducible to the set of Kolmogorov random strings, that is, those decidable in deterministic polynomial time by asking the oracle non-adaptively whether a string is random. Our aim is to strengthen this connection between Kolmogorov randomness and .
Let denote the Kolmogorov complexity of a string , i.e., the length of the shortest description of x when a fixed universal machine U is used as a decoder (see Section 2.1). Depending on whether or not the machine is required to be prefix-free, this complexity is called the prefix-free or plain complexity, and for the former it is customary to write instead of . We consider the set (or ) of random strings, i.e., those that have no description shorter than themselves: .
Buhrman, Fortnow, Koucký, and Loff [7] showed that every language in reduces to the set of random strings via polynomial-time truth-table reducibility (denoted in the following theorem, see Definition 2.1), regardless of whether the machine defining the random strings is plain universal or prefix-free universal:
For every language, we havefor any universal machine U, andfor any prefix-free universal machine U.
Theorem 1.1 gives a lower bound of for the class of languages that reduce to or , but this bound is obviously not “tight,” as the sets and are undecidable [10]. However, we obtain an interesting upper bound if we take the intersection over all universal machines U. That is, let be the class of languages L such that for all universal prefix-free machines U. Surprisingly, Cai, Downey, Epstein, Lempp and Miller [8] showed that every language in is decidable, and moreover Allender, Friedman, and Gasarch [5] showed:
The conjecture is more plausible when we consider time-bounded prefix-free Kolmogorov complexity. Allender, Buhrman, Friedman, and Loff [2] defined the classes called and , which are time-bounded versions of , and showed the following (“” means advice strings of length α [6, Definition 6.16]):
for any nondecreasing unbounded computable function.
.
Less is known about the analogous classes , , for plain Kolmogorov complexity (see Definitions 4.4 and 3.1). We know that these classes all contain by Theorem 1.1 or by a similar argument. For the upper bound, Allender et al. [5] conjecture that (similarly to Theorem 1.2), but in reality we do not even know whether equals all decidable languages. The techniques used by Allender et al. [2,5] to prove Theorems 1.2 and 1.4 make use of the coding theorem and thus cannot be directly applied to (see the discussion in [5, Section 5]).
In Section 3, we prove the same inclusions for and as Theorem 1.4 states for prefix-free complexity.
In Section 4, we consider a weaker reduction (see Definition 4.1) in which we cannot query a string of length less than , where n is the input length. We prove that for the corresponding classes and , the upper bound can be improved to (while the lower bound remains true).
In Section 5, we restrict the reduction further, allowing only queries of exactly logarithmic length. This defines the subclass , which still contains . We prove that . The upper bound of comes from a 1-round game played between two players. Together with the lower bound, this gives an alternative explanation of the well-known inclusion .
In view of this upper bound of , we believe that the following characterization of is quite likely, and propose this as a step towards Conjecture 1.3.
.
Preliminaries and notations
We regard the set of strings as equipped with the length-increasing lexicographical order. For a set of strings and a natural number , , , and are defined as , , and , respectively. We abbreviate as . For a set A, denotes the complement of A.
For functions , we write if for all . A nondecreasing and time-constructible function is called a time bound. For a nondecreasing and unbounded function , define as . Then holds because and .
We assume that the reader is familiar with basics of complexity theory [6].
In this paper, we mainly discuss truth-table (i.e., non-adaptive) reductions to the set of random strings.
For languages , we write if there exists a polynomial time Turing machine M that, on input x, outputs an encoding of a circuit λ and a list of queries so that .
Kolmogorov complexity
We review some basic definitions and facts about Kolmogorov complexity. For details, see Li and Vitányi [10].
The Kolmogorov complexity of a string x on a Turing machine U is defined as the length of the shortest description d of x when using U as a decoder. We also consider the time-bounded version: for a time bound t, we write for the length of the shortest input d that causes U to outputs x in at most time steps (note that time is measured in terms of the output length).
As mentioned at the beginning, it is sometimes required that the domain of U is prefix-free, i.e., if is defined, then is undefined for any proper prefix of d. In this case, it is customary to write instead of (and call it prefix-free complexity).
We will be interested in the set of random (or incompressible) strings where , , etc.
Of course, Kolmogorov complexity, and hence the set of random strings, depend on U. It is therefore important to use the “best” machine U in the following sense:
A Turing machine U is said to be universal if for each Turing machine M, there exists such that for any , .
It is known that there exists a Turing machine U which simulates every Turing machine M (see [6]), and it is not hard to see that U is universal. Thus, we usually fix one such machine U and discuss complexity with respect to it.
For prefix-free Kolmogorov complexity, the Turing machines U and M in Definition 2.2 are required to be a prefix-free machine.
For time-bounded Kolmogorov complexity, some slowdown is needed when a universal Turing machine U simulates each machine M. Therefore, we introduce a parameter , which means that U must simulate M within steps if M halts within s steps. This is the notion of f-efficient universality:
For a time bound , a Turing machine U is said to be f-efficient universal if U is universal, and there exists a constant for each Turing machine M, such that for any time bounds t and , for all but finitely many x, implies . We say that U is time-efficient universal if it is p-efficient universal for some polynomial p.
Time-efficient universality means that the machine U is allowed to simulate each machine with a polynomial slowdown.
Bounds for time-bounded Kolmogorov complexity
Our goal for this section is to prove an analogue of Theorem 1.4 for plain Kolmogorov complexity. First, let us define the classes and by using time-efficient universality and f-efficient universality, respectively.
contains all the languages L such that for all sufficiently large t and for any time-efficient universal Turing machine U, . In short, .
contains all the languages L such that for each computable function f, there exists such that for all and any f-efficient universal Turing machine U, .
It is easy to see that by letting . It should be noted that the motivation of [2] for allowing such a slow prefix-free universal machine comes from the fact that it is not clear how to define time-efficient coding schemes for prefix-free Kolmogorov complexity (see [2, Section 5]). Here we adapted their definitions to the case of plain Kolmogorov complexity, and show the following:
for any nondecreasing unbounded computable function.
.
We have because this was proved in [2,7] regardless of whether the Kolmogorov complexity defining the classes is plain or prefix-free. All we have to show is that .
For simplicity, we show that for any function α that satisfies the stated property. Assume, by way of contradiction, that . Then holds for some time bound and some Turing machine U. Since is decidable, L is also decidable. Let denote a time bound such that it is decidable whether or not in time .
Given any time bound , we will show that there exist a time-efficient universal Turing machine U and a time bound such that . This means that , which contradicts the assumption. In order to show it, a time-efficient universal Turing machine U is constructed by combining two Turing machines, and M. The following lemma states that if is time-efficient universal, then U is.
Letbe a time-efficient universal Turing machine. Suppose that for a time bound t, M is a Turing machine such that, for all but finitely many x, M either halts insteps or does not halt. Then there exists a Turing machine U such that for some,for all large. Moreover, U is again a time-efficient universal Turing machine.
Define U as follows: On input , U runs on input d. Suppose that outputs x after s steps. Then U outputs x after it waits steps. On input , U outputs . Otherwise U does not halt. Note that the steps to calculate can be counted by adding some tapes to U.
One may think of p as by carefully constructing U so that U halts in exactly steps when halts in s steps. Here we avoided such a technical argument that depends on computational models.
it takes for U to halt on input , where s denotes the number of steps to calculate . Note that includes the steps it takes to run , to compute , and so on. We can assume that and does not depend on d, and that is again a time bound.
Then in at most steps if and only if in at most steps for any time bound t. One can verify that the lemma follows by this property. □
Fix a standard time-efficient universal Turing machine for reference. A Turing machine M will be defined later. Let U be the time-efficient universal machine defined in Lemma 3.3 for some time bound t fixed later in Claim 2. In order to derive a contradiction, M is constructed by a diagonalization argument so that it fools each Turing machine that computes a truth-table reduction . It should be noted that, in Lemma 3.3, M is allowed to run in steps whereas runs in at most t steps; hence, the output of M can be defined depending on the output of for a fixed output length. In the course of computation of M, M holds an approximation of denoted by R. More precisely, R is first set to be , always holds, and increases as calculation proceeds. Since is finite, we can store it. Note that, by the property of Lemma 3.3, except for finitely many strings.
Let us efficiently enumerate all polynomial-time Turing machines that compute polynomial-time truth-table reductions, namely . This can be done by putting a clock of on and by regarding the output of as a circuit and queries (see Definition 2.1). Since is out of our control, we will play a game against in order to make the output of the circuit differ from . As we will show, we can always do so for some x; otherwise L is equal to a winning side, and thus , which contradicts the assumption. This idea is originally due to Allender et al. [5]. Now let us precisely define the game for some and some x.
Description of the game. The game is played between two players, the YES player and the NO player. At the start of the game, randomness of the strings of length less than is fixed and each player does not disturb anymore. (More formally, compute and let .) During the game, R is going to be altered by moves of the players. The current value of the game, denoted by , is defined as . Note that is equal to the value of regarded as a truth-table reduction to R.
Let , where l denotes a variable of the algorithm of M. The game has rounds from to l. In the rth round, where , the YES player first decides such that . This means that the YES player declares that each element of is not random. Next, the NO player decides such that . Then let at the end of the round. After all the rounds end, the YES player wins if and the NO player wins otherwise. Since the game is finite and deterministic, exactly one of the players has a winning strategy. Let denote which player is going to win the game .
The side plays the game optimally according to a winning strategy against . Suppose that the player of the side chooses in the rth round and , where d is the input of M. Let , where . If, for , the input d is the ith string in in the lexicographical order, then M outputs and halts. This makes not random since and . If not, M does not halt, and continues computation.
On the other hand, plays the role of the opponent of the side. The opponent always moves in the rth round. Note that holds (see [10, Definition 2.2.1 and Theorem 2.2.1]).
The overall algorithm of M is shown in Algorithm 1. Note that what M outputs is determined by a move of the player who has a winning strategy.
Algorithm of M
Analysis. Let us move on to analysis of the algorithm. We first claim that there always exists a “witness” such that .
M can always find a string x such thatand.
Suppose that M fails to find such a string x at some point, which implies that for all x with . We present an algorithm to decide the language in . On input x, let the advice give the truth-table of . This can be encoded with bits. We know the value by the length of the advice. Thus, this advice enables us to construct the game . Then the winning side, , can be computed by the standard minimax algorithm. Note that there exist at most exponentially many moves since a move Z is meaningful only if , where k is bounded by a polynomial in .
Since for all sufficiently large x, we can conclude that . This, however, contradicts the assumption. □
Next, we fix a fast-growing time bound t so that M halts within steps.
There exists a time bound t such that M halts withinsteps for all largeif M outputs z.
Suppose that M halts at the rth round of the game . Note that . M has computed so far. It takes time steps. M has also calculated and for each . It takes time since we chose x so that in Algorithm 1. M needs to compute for each . Let denote the number of steps to compute .
The length of x is bounded by since . Then the number of overall time steps is roughly less than
The second term is linear in , and the other terms do not depend on t. Thus t can be defined as a sufficiently fast growing function so that (1) is less than for all large r. □
Claim 2 ensures that R converges to as time passes. Finally, we claim that each Turing machine is indeed fooled.
.
After the game has been played out, since the side wins, holds at this point. Let denote the value of l in the eth outer loop before the inner loop in Algorithm 1. Then, in the later game for , we have , which implies that the later computation does not disturb the value . Therefore, is actually equal to after ends. It follows that does not compute a truth-table reduction to since the answer of the truth-table reduction on input x is equal to , which is not equal to . □
Let us review why the slight nonuniformity α is needed: To determine a time bound to calculate M, is bounded by the inequality in Claim 2. The inequality is derived from fixing randomness of strings whose length is less than . If α is slow enough, then the initial segment is decidable in . However, since the definition of uses time-efficient universality, M must run in at most polynomial time in t. This time limit prevents α from growing slowly because becomes huge as α becomes slow.
In the case of , since a f-efficient universal Turing machine U is allowed to run longer than a time-efficient one, the Turing machine M can also run long enough to calculate the initial segment . Therefore, we can eliminate the slight nonuniformity as stated in Part 2 of Theorem 3.2.
The proof is similar to that of Theorem 3.2.1. We prove only the inclusion .
Suppose that . Let be a time bound to compute L, and let h be a fast growing time bound such that for any constant d and for all large n, .
Fix a p-efficient universal Turing machine for some time bound p. For a time bound t, let . We construct M in exactly the same way with the proof of Theorem 3.2.1, using , t and α. Let r denote the length of output of M. The running time of M is bounded by because the second and third terms of (1) in the proof of Claim 2 are bounded by the first term for all large r. Thus if M outputs x, then M halts in at most steps.
This time, α is so slow that if and only if . The query is decidable in roughly steps for all large t. Thus Claim 1 still holds because is computable in .
Define U to be a Turing machine so that , and on input , U outputs in steps overall if halts in s steps. We can assume that and g is a time bound. Let and . Note that f depends only on .
(Revised Lemma 3.3).
For all but finitely many x,, and U is f-efficient.
The former follows because in steps if and only if in steps, and because . The latter follows because for any Turing machine N and any t. □
For such a time bound f, there exists as in Definition 3.1.2. Let t be a large time bound so that . Then Claim 3 contradicts the assumption. □
A restricted reduction
Our crucial observation is that we do not use the initial segment to prove Theorem 1.1: Its proof relies on the hardness versus randomness framework of Impagliazzo and Wigderson [9]. That is, the set of random strings of length is highly complex, and therefore hard enough to construct the pseudorandom generator that can derandomize completely. Nevertheless, as discussed in Section 3, the initial segment is the very obstacle to proving Theorem 3.2.1 without the slight nonuniformity α.
For the purpose of characterizing using Kolmogorov complexity, we should not allow reductions to query any short string of length less than . This motivates the notion of α-restricted truth-table reductions:
For languages , we write if there exists a polynomial time Turing machine M that, on input x, outputs an encoding of a circuit λ and a list of queries , where , so that .
Then let us restrict to by imposing the α-restriction:
is the class of all languages L such that for all large t and for any time-efficient universal Turing machine U, .
At a glance, our requirement, which bounds the query lengths from below by a unbounded function (that could be very slow-growing), may seem to be an atypical, and perhaps rather weak, restriction. Nevertheless, this restriction allows us to eliminate the slight nonuniformity in the upper bound:
for any nondecreasing unbounded computable function.
As discussed above, we still have even if the reduction is restricted. Thus we present only a proof of the inclusion . Note that the proof is almost the same with that of Theorem 3.2.
We change a polynomial-time truth-table reduction into an α-restricted one. Thus, enumerate all polynomial-time Turing machines and regard them as α-restricted polynomial-time truth-table reductions. That is, whenever queries a string of length less than , we ignore the Turing machine because it turns out to be not an α-restricted reduction. More precisely, when M seeks a “witness” x in Algorithm 1, if has turned out to be not an α-restricted reduction, then M goes to the next e.
(Revised Claim 1).
M can find a string x such thatand, orturns out to be not an α-restricted reduction.
We can assume that on input x, does not query a string of length less than because otherwise turns out not an α-restricted reduction. Then, in order to compute , the initial segment is not needed. Thus . □
All α-restricted reductions are enumerated. Thus, in the same way with Claim 3, it follows that any Turing machine does not compute . □
□
In fact, we have another application of α-restriction to ordinary Kolmogorov complexity. Let us define a limited class of :
For a function , contains all the decidable languages L such that for any universal Turing machine, . Let denote .
Although we failed to prove any upper bound for , we show that sits within for any slow-growing function α.
for any nondecreasing unbounded computable function.
We still have in the same way with Theorem 4.3. We show the inclusion .
The inclusion is equivalent to saying that for any decidable language L, if for all universal Turing machine U, then . By taking its contraposition, it is sufficient to show that for any decidable language , there exists a universal Turing machine U such that . Let L be an arbitrary decidable language such that .
Fix a universal Turing machine . Another Turing machine M is constructed later. Define U so that , and . Then , and . In particular, ensures that U is universal.
Enumerate all the polynomial-time Turing machines . All that remains is to construct M so that for each e, does not compute . In order to fool , a game is constructed for some input .
Description of the game. For a string x, the game is constructed by . Regard as a circuit and queries . If for some , then does not compute α-restricted reduction, so is ignored and omitted from the enumeration .
The game is played between the YES player and the NO player. Let R denote the current knowledge of . The current value is defined as . Define S to be .
Each player has penalties for each length of . is first equal to 0. The players choose strings in turn, declare that the elements of Z are not random, and let . They are penalized as for each n, and they must move so that does not exceed .
Since the game is finite, the game ends at the point after which both of the two players always choose . The YES player wins if when the game ends; otherwise the NO player wins.
Let denote the player that has a winning strategy. The player of the side plays the game according to a winning strategy. If the player declares that is not random, then let for some string . There always exists some unused description d because the game rule requires the penalty not to exceed . Note that means the number of used descriptions of length .
On the other hand, is regarded as the opponent of the side. Unlike the time-bounded case, nonrandom strings cannot be decided, but can be enumerated. Therefore, let enumerate all nonrandom strings throughout the computation of M by calculating for each string d in parallel. Suppose that some turns out to be not random by the enumeration. Then the player of the side is considered to declare that z is nonrandom, i.e., choose . The player obeys the penalty rule because there are at most d’s such that for .
Algorithm of M
It is unknown when stops to enumerate nonrandom strings. Thus we maintain the games in parallel. As a result, we must make sure that the games are “independent”: For example, if the player of a game uses a description , then 1 should be added to of another game , which may destroy a winning strategy of a player of . For this reason, by using α-restriction, x is chosen so that the queries of and the queries of satisfy .
The whole algorithm of M is shown in Algorithm 2. In stage s, we construct a new game for some , and maintain the games for all .
Analysis.
plays the games legally.
Because , the penalties of do not exceed . □
Theside can play a game according to a winning strategy. (That is, after the construction of the game, the winning strategy is not disturbed by the other games.)
In order to declare that a string is not random in a game , some unused description is needed. Because of l in the algorithm of M, the games are independent of each other, and hence a description of length is not used by the other games. Thus, the number of overall needed descriptions of length is at most due to the penalty rule of . Therefore, there is some unused description of length whenever a winning strategy suggests that is not random. □
M can find a stringsuch thatand, orturns out to be not an α-restricted reduction.
After the player of the side chooses some Z in the game , holds since the player behaves optimally. Moreover, is not changed by the other games because of independence of the games. Thus, after the game , holds, and hence does not compute . □
□
Using the set of short random strings as advice
As discussed in Section 4, all we need to show the lower bound of is the hardness of , where the function log is regarded as . Thus, even if we further restrict the α-restricted reduction into one that can query only strings of length , we still have the same lower bound. It appears natural to present a class defined by the restricted reduction as advice:
A language L is in if for some constant , for all large time bounds t and each time-efficient universal Turing machine U, there exists a polynomial-time Turing machine M such that for any , for all , , where denotes a standard encoding of .
The notation gives some intuition, but does not express all. In particular, random strings in Definition 5.1 are defined by time-bounded Kolmogorov complexity.
The class can be regarded as a limited class of . Therefore, by Theorem 4.3, we have . In fact, we can get a better uniform upper bound:
.
It is obvious that since the length of advice is at most polynomial in n. To prove , one can show that, in order to construct a pseudorandom generator, the constant c in Definition 5.1 can be chosen so that it depends on neither a time-efficient universal machine U nor sufficiently large time bounds t (since the constant c depends on the running time of an algorithm that decides a language in ; see [7] for details).
The proof of the inclusion is based on that of Theorem 4.3. Assume, by way of contradiction, that . Then there exists a constant c given in Definition 5.1. Define α as . The advice of random strings can be regarded as an α-restricted truth-table reduction to the set of random strings. Therefore, we can prove the upper bound in the same way with Theorem 4.3.
There is one change in the rule of the games. Recall that R denotes the current knowledge of . The current value of the game is equal to , where . In the proof of Theorem 4.3, for , there is the rth round, where r corresponds to the length of queries. Now that there are only queries of length , there is only the th round. Thus the player who has a winning strategy, , can be computed in as shown in the proof of the next claim.
(Revised Claim 5).
M can find a string x such thatand.
First, the YES player decides a subset , where . Next, the NO player decides a subset , where . Then the winning player corresponds to . Therefore, if and only if , , , which is decidable in . If the claim is false, then , and it is decidable in , which contradicts the assumption. □
For any polynomial-time Turing machine, there exists a string x such that.
By Claim 11, M can find a string x such that . Since the side plays the games according to a winning strategy, after the game, . □
The claim above shows that , which contradicts the assumption that . □
While we focused on the case of plain Kolmogorov complexity, one can show an analogue of Theorem 5.2 in the case of prefix-free complexity by combining our proof idea and that of [2].
Footnotes
Acknowledgements
The authors thank Eric Allender for introducing the second author to the relevant line of research and thank anonymous reviewers of Computability for helpful comments. They are also grateful for the advice and suggestions provided by Hiroshi Imai and the members of his group at the University of Tokyo. This work was supported in part by the Kayamori Foundation of Informational Science Advancement and the KAKENHI projects 24106002 and 26700001.
References
1.
E.Allender, Curiouser and curiouser: The link between incompressibility and complexity, in: Proceedings of the 8th Turing Centenary Conference on Computability in Europe: How the World Computes, CiE’12, 2012, pp. 11–16.
2.
E.Allender, H.Buhrman, L.Friedman and B.Loff, Reductions to the set of random strings: The resource-bounded case, in: Proceedings of the 37th International Conference on Mathematical Foundations of Computer Science, MFCS’12, 2012, pp. 88–99.
3.
E.Allender, H.Buhrman and M.Koucký, What can be efficiently reduced to the Kolmogorov-random strings?, Annals of Pure and Applied Logic138 (2006), 2–19.
4.
E.Allender, H.Buhrman, M.Koucký, D.van Melkebeek and D.Ronneburger, Power from random strings, SIAM J. Comput.35(6) (2006), 1467–1493. doi:10.1137/050628994.
5.
E.Allender, L.Friedman and W.Gasarch, Limits on the computational power of random strings, Information and Computation222 (2013), 80–92. doi:10.1016/j.ic.2011.09.008.
6.
S.Arora and B.Barak, Computational Complexity: A Modern Approach, 1st edn, Cambridge University Press, 2009.
7.
H.Buhrman, L.Fortnow, M.Koucký and B.Loff, Derandomizing from random strings, in: Proceedings of the 25th Annual Conference on Computational Complexity, CCC ’10, 2010, pp. 58–63.
8.
M.Cai, R.G.Downey, R.Epstein, S.Lempp and J.S.Miller, Random strings and tt-degrees of Turing complete C.E. sets, Logical Methods in Computer Science10(3) (2014), Paper 15. doi:10.2168/LMCS-10(3:15)2014.
9.
R.Impagliazzo and A.Wigderson, P = BPP if E requires exponential circuits: Derandomizing the XOR lemma, in: Proceedings of the 29th Annual ACM Symposium on Theory of Computing, STOC ’97, 1997, pp. 220–229. doi:10.1145/258533.258590.
10.
M.Li and P.Vitányi, An Introduction to Kolmogorov Complexity and Its Applications, 3rd edn, Springer Publishing Company, 2008.