The notion of ‘modulus of regularity’, as recently studied in [Moduli of regularity and rates of convergence for Fejér monotone sequences, 2017, Preprint], unifies a number of different concepts used in convex optimization to establish rates of convergence for Fejér monotone iterative procedures. It generalizes the notion of ‘modulus of uniqueness’ to the nonunique case. In this paper, we investigate both notions in terms of reverse mathematics and calibrate their Weihrauch complexity.
In [19], the concept of modulus of regularity is introduced as a central tool to construct rates of convergence for classes of Fejér monotone sequences which appear in fixed point theory, monotone operator theory and convex optimization. The concept of modulus of regularity gives a unified account of various notions such as metric subregularity [10,21,22], Hölder regularity [4], error bounds [21] and weak sharp minima [9] which play a prominent role in nonlinear optimization. It is general enough to cover many equilibrium, convex feasibility, fixed point and minimization problems involving set-valued operators.
In the case where the solution in question is unique, the concept coincides with the notion of modulus of uniqueness [13,18,19] and can be understood as its generalization to the nonunique case.
While most problems in nonlinear analysis deal with classes of abstract, in general not necessarily separable, metric or normed structures, we restrict ourselves in this paper to the compact metric case. In this situation a modulus of regularity always exists for continuous functions F ([19], Proposition 3.2) and the concept has been anticipated already in [1]. We calibrate the strength of its existence in the sense of reverse mathematics as well as its Weihrauch complexity and compare this with the case of a modulus of (uniform) uniqueness in the unique case. We also consider the weaker -version of regularity (without stating the existence of a modulus function) and show that in this form, regularity is equivalent to while the existence of a modulus is equivalent to . This differs from the unique case where both the -form as well as the modulus version of uniform uniqueness are equivalent to . The difference also shows up in the Weihrauch complexity: the many-valued modulus-of-uniqueness operator MUNI is computable while the many-valued modulus-of-regularity operator MREG is Weihrauch equivalent to . Both phenomena are due to the fact that the proof for the -form of regularity already makes substantial use of classical logic (, where M denotes the Markov principle) while in the unique case only M is used.1
The latter is already implicit in [13] and [3] since to convert uniform uniqueness to prenex form as in [13], resp. reformulating it in the strong form stated on p. 714 in [3], just amounts to applying M.
Let be a metric space and let be a mapping. Let and . We say that F is regular w.r.t. and for if
If this holds with ‘’ replaced by ‘’ we say that F is regular w.r.t. .
A function providing given n a number satisfying the above is called a modulus of regularity of F w.r.t. and resp. w.r.t. .
In [19] the conclusion ‘’ is conveniently written using the metric distance functions as ‘’ but, of course, the concept does not presuppose the existence of ‘dist’, i.e. the locatedness of .
Again for convenience and to follow the style used in analysis, the concept of a modulus of regularity in [19] is written in -form, i.e. as a function . All results can be, however, easily re-casted in terms of the modulus as defined above which is more appropriate for the context of reverse mathematics and Weihrauch reducibility.
When is not a singleton set, effective moduli of regularity can only be expected to exist in rather restricted situations (due to their strong consequences on rates of convergence for numerous iterative procedures used in nonlinear analysis). However, [19] describes important cases where such moduli can be explicitly computed.
The concept of modulus of regularity generalizes that of a ‘modulus of uniqueness’ to the nonunique case:
We say that is uniformly unique w.r.t. if
If this holds with ‘’ we say that is uniformly unique.
is a modulus of uniqueness for F w.r.t. and for if
If this holds with ‘’ we say that ω is a modulus of uniqueness for F w.r.t. .
The concept of modulus of uniqueness can also be considered without assuming that in the form
Clearly, if , then any ω with (∗) is a modulus of uniqueness in the sense of definition 2.3 and conversely, if ω is a modulus of uniqueness, then satisfies (∗). Suppose that one has an algorithmic way to construct -approximate zeros , i.e. of F and ω satisfies (∗), then is a -Cauchy sequence whose limit (for complete X and continuous F) is a zero of F. In this way, moduli of uniqueness give rates of convergence for algorithms computing approximate solutions towards the actual solution and have been used in fixed point theory to prove even new existence results (see [7] and the literature cited there). As shown in [13] (for the case of compact metric spaces), explicit moduli of uniqueness can be extracted by proof-theoretic methods from given, even nonconstructive, proofs for the uniqueness of the zero of F. This has been carried out in the context of best Chebycheff approximation in [13 ,14] and best -approximation in [20] (see [18] for a comprehensive treatment of all this). Using the logical bound extraction theorems for abstract metric and normed structures (without separability or compactness assumptions) from [12,17] such extractions of moduli of uniqueness are also possible for abstract spaces in the absence of compactness (see [18], pp. 377–381) and have been used in metric fixed point theory e.g. in [7,11].
While the existence of a modulus of uniqueness is a uniform quantitative version of the plain uniqueness property
and can be extracted from a given proof of the latter, the existence of a modulus of regularity is a uniform quantitative version of the following trivially true (but logically more complex than (1)) property:
So in this generality, there is no meaningful property such that from a proof of this property a modulus of regularity can be extracted. Thus unless one is in the unique case (where the concept of a modulus of regularity coincides with that of modulus of uniqueness), one has to exploit rather specific features of the situation at hand in order to get an effective modulus of regularity (see also the comments in the introduction and [19]).
Reverse mathematics
In the following, is the usual base system used in reverse mathematics, i.e. the fragment of second order arithmetic with recursive comprehension and -induction only. and are its extension by the weak König’s lemma WKL for -trees and the schema of arithmetic comprehension ACA, respectively. For details we refer to [25].
We refer to the definition of compact metric spaces X as used in reverse mathematics ([25], Definition III.2.3), where X is given as the completion of a countable pseudometric space A which, additionally, possesses a sequence of finite ε-nets. We recall some crucial results from [25]:
The following is provable in. Let X be a compact metric space. Letbe a sequence of coverings of X by open sets. Then there exists a sequence of finite subcoverings.
The following is provable in. Let X be a compact metric space. Let C be a code for a closed subset in X. Then the nonemptyness of C can be expressed by a-formula.
The following is provable in. Let X, Y be complete separable metric spaces anda continuous function. Letbe (a code of) an open set, thenis open (with a code computable from a code for V).
The following is probable in. Let X be a compact space andbe continuous, then the property that F has a zero on X can be expressed by a-formula.
Clearly, proves that is open. Hence by Proposition 3.3, has a code as an open set and so has a code as a closed set. So by Theorem 3.2, provably in , the nonemptyness of can be expressed by a -formula. □
The proofs of the results above establish that even if we have sequences and uniformly given as sequences of codes that then proves that is a sequence of open sets and proves that the nonemptyness of can expressed as a -formula with n as parameter. The latter is particularly easy to see in our instances below, where the functions are defined in terms of a single function Φ and any modulus of uniform continuity for Φ can be modified into one for all uniformly in n (see also the explicit construction of the -formula in the proof of Lemma 5.5 below).
Reverse mathematics of moduli of uniqueness and regularity
proves that for every compact metric spaceany continuous mappinghaving at most one zero has a modulus ω such thatObviously, if, then ω is a modulus of uniqueness of F w.r.t..
Already for Lipschitz continuous functionswhich have exactly one zero, the uniform uniqueness of the zero impliesover.
1) Let F possess at most one zero and define
is a sequence of coverings of (w.r.t. the product metric, [25], Example II.5.4) by open sets. Here one uses Proposition 3.3 and the fact that for the continuous functions
one has
By Theorem 3.1 it follows that, provably in , there is a sequence of finite subcoverings. Clearly, is a modulus of uniqueness.
2) Assume ¬-WKL. Then there exists an infinite -tree T with no path. Consider the subtrees and of T consisting of all finite -sequences in T which start with 0 or which start with 1 resp. One of those subtrees must be infinite as well. W.l.o.g. assume that is infinite. Define as augmented by the constant-1 path. With this (playing the role of T) now define as in [25] (p. 129) a sequence of nonempty open intervals with rational endpoints. Adapting the reasoning there to our situation (using that for with and ) one gets
Now we define (see [28], p.309):
where
Then, clearly, () by (i), by (ii), and by (iii). F is nonexpansive but obviously is not uniformly unique w.r.t. .2
Alternatively, we could have modified the mapping in the proof of Theorem IV.2.3.5 in [25] by using ‘’ instead of ‘’ to achieve the Lipschitz property. However, we find the construction above more elementary. One can also adapt Specker’s [27] construction or the Lipschitz functions defined in [2].
□
proves that for every compact metric spaceany continuous mappinghaving a zero is regular w.r.t..
Already for Lipschitz continuous functionswith, the regularity of F w.r.t.impliesover.
1) Take and consider the finite cover of X by closed balls provided by the representation of X as a compact metric space (in the sense of [25], Definition III.2.3), where are in the countable set A whose completion the space X is defined to be. Using , the predicate
is in . Here we use that can be written as for the continuous function defined by
and Corollary 3.4. Hence by bounded -comprehension (provable in , see [24], Theorem 1, or [25], Theorems II.3.9 and II.2.5) one gets the existence of a code σ of a finite -sequence of length such that
Consider now an i with . Then, again by and using [25], Theorem IV.2.2, one gets the existence of an with
and hence
In (needed to show the existence of a modulus of uniform continuity for F and hence for ) one can show that the sequence with exists. So by -bounded collection (provable in ) we can prove
Now assume that for some . Then, by the definition of l, x must be in one of the balls (with ) for which , i.e. which contains a zero z of F. Since , the conclusion follows.
2) is an immediate corollary to Theorem 4.1(2) as in the unique case the concepts of regularity and uniform uniqueness coincide. □
The proof of Theorem 4.2(1) uses classical logic in the form of -LEM (implicit in the bounded -CA and M (implicit in concluding from .
proves that ifis a compact metric space andis continuous and has a zero, then F possesses a modulus of regularity w.r.t..
Over, the statement that every Lipschitz continuous functionwith a zero has a modulus of regularity implies.
1) Let . By Theorem 4.2(1), already suffices to prove:
We now show (in ) that
Define (uniformly in n, l) continuous functions by
Then
Hence, by Corollary 3.4, proves (see also the proof of Lemma 5.5) that .
(3) and (4) imply that
and so by (a single use of) , and thus, in particular, in , we get
Using the continuity of F and that is dense in X we get
and so is a modulus of regularity for F w.r.t. .
2) We use a construction from the proofs of Remarks 4.9 and 4.10 in [19] which in turn adapt a construction due to [23]. Let be a nondecreasing sequence of rational numbers in and define
f is nonexpansive and, therefore, T is nonexpansive too (even firmly nonexpansive) and 1 is a fixed point of T. By primitive recursion one can easily show in that the sequence defined by exists. By the comment after Corollary 1 in [16], is a rate of asymptotic regularity for , i.e.
All this can easily be established in . Suppose now that the 2-Lipschitz function , has a modulus of regularity (note that ) Then (reasoning as in the proof of Theorem 4.1 in [19]; note that obviously is Fejér monotone w.r.t. since T is nonexpansive) one can easily show in that is a rate of convergence for . So can be shown in to exist and is a fixed point of T, i.e. a zero of F. Since , it is clear that . Suppose that there would exist a with for all . Then by -induction (and hence in ) also for all in contradiction to : clearly . Assume that . Then
Hence . The claim now follows from the well-known fact that the convergence of increasing sequences of rational numbers in is equivalent to over ([25], Theorem III.2.2, and note that the sequence constructed in the relevant part of the proof of this theorem is an increasing sequence of rational numbers in so that we may take ; alternatively one can also adapt Specker’s [26] construction). □
Weihrauch complexity of moduli of uniqueness and regularity
We recall the standard concepts used in the notion of Weihrauch reducibility which can be found e.g. in [5].
A represented space is a pair , where X is a set and is a partial surjective function.
Let be represented spaces and let be a multi-valued function. Then is a realizer of f if
Let f, g be multi-functions on represented spaces. Then f is said to be Weihrauch reducible to g, in symbols , if there are computable functions such that is a realizer of f whenever G is a realizer of g. We say that f and g are Weihrauch equivalent, in symbols , if both and .
The parallelization of the ‘omniscience principle’ LPO (i.e. -LEM) is defined as3
The official definition is slightly different but modulo Currying trivially equivalent to this, see [5], Lemma 6.3, where our formulation is called C.
The formulation of the convergence principle for bounded monotone sequences of reals is formulated in the framework of Weihrauch complexity as follows
with . is the restriction of MCT to .
It is well-known (see e.g. [6], Facts 3.5 and 11.26; the result essentially is also already in [15], Proposition 5.5) that
Let us define
with , and
with .
We will show that is computable and so this a fortiori holds for its restriction to those F which have exactly one zero.
In the proofs below we refer to the standard representations of and but suppress explicitly mentioning them.
.
By the comments above, it suffices to show that
As the proof of Theorem 4.4(2) shows, uniformly in a given increasing sequence of rational numbers in one can compute a (2-Lipschitz) function such that if , then can be computed uniformly as a functional using that is a rate of convergence for whose limit is a (note that in , the mapping T, and hence the sequence , is computable). □
.
Let with . From a name of F in the usual standard representation of one can compute a modulus of uniform continuity for F, i.e.
and from this a (common) modulus of uniform continuity for the function from the proof of Theorem 4.4(1) for all . Using compactness (WKL) the statement
can be written equivalently as
where for (a name of) , ‘’ denotes the -rational approximation to x provided by that name. Note that WKL is only needed to verify the above equivalence but not to construct the -formula from a name of F.
With being some standard enumeration of the dyadic rational numbers in , the property in the proof of Theorem 4.4(1) can now be written (coding three universal quantifiers into a single one) as
for a function which can be uniformly computed as a function in (a name of) F. Now let . Then the statement
established in the proof of Theorem 4.4(1) has the form
so that an with
can be uniformly computed in p as
As in the proof of Theorem 4.4(1) it follows that is a modulus of regularity for F w.r.t. , i.e. . □
The above also holds for general computably compact computable Polish spaces instead of .
In contrast to this, we have that is computable (again this holds for every computably compact computable Polish space K instead of but for the sake of simplicity we treat here only the case ):
is computable.
We use the representation of from [18] (Chapter 4) by which
each represents a unique element in ,
primitive recursively in one can define such that represents the same real in as f does (here ).
On the level of names f, g for and a given name for one can express
as a -formula
where Φ is a (primitive recursively) computable functional .
If F has at most one zero, then
defines (computably in ) a total function . With also is computable in . Hence the restriction of to functions has (uniformly in n) a modulus of uniform continuity which is computable in , i.e.
Using ω one can compute (uniformly in )
Clearly, α is a modulus of uniform uniqueness in the form (∗) for . □
The proof above uses an unbounded search which terminates by the assumption of the uniqueness of the solution but which does not provide any complexity information. This is in contrast to the situation where one has a proof (even if that is prima facie noneffective) for the uniqueness from which – as discussed briefly in Section 2 – one can then extract a modulus of uniqueness which reflects the numerical content of that uniqueness proof.
Footnotes
Acknowledgement
The author has been supported by the German Science Foundation (DFG Project KO 1737/6-1).
M.Beeson, Foundations of Constructive Mathematics, Springer Verlag, Berlin, Heidelberg, New York, 1985.
3.
J.Berger, D.Bridges and P.Schuster, The fan theorem and unique existence of maxima, J. Symb. Logic71 (2006), 713–720. doi:10.2178/jsl/1146620167.
4.
J.M.Borwein, G.Li and M.K.Tam, Convergence rate analysis for averaged fixed point iterations in common fixed point problems, SIAM J. Optim.27 (2017), 1–33. doi:10.1137/15M1045223.
5.
V.Brattka and G.Gherardi, Weihrauch degrees, omniscience principles and weak computability, J. Symb. Logic76 (2011), 143–176. doi:10.2178/jsl/1294170993.
6.
V.Brattka, G.Gherardi and A.Marcone, The Bolzano–Weierstraß theorem is the jump of weak König’s lemma, Ann. Pure Appl. Logic163 (2012), 623–655. doi:10.1016/j.apal.2011.10.006.
7.
E.M.Briseid, Logical aspects of rates of convergence in metric spaces, J. Symb. Logic74 (2009), 1401–1428. doi:10.2178/jsl/1254748697.
8.
D.K.Brown, Functional analysis in weak subsystems of second order arithmetic, PhD thesis, Pennsylvania State University, 1987.
9.
J.V.Burke and M.C.Ferris, Weak sharp minima in mathematical programming, SIAM J. Control Optim.31 (1993), 1340–1359. doi:10.1137/0331063.
10.
A.Dontchev and R.Rockafellar, Implicit Functions and Solution Mappings. A View from Variational Analysis, Springer Monographs in Mathematics, Springer, Dordrecht, 2009.
11.
P.Gerhardy, A quantitative version of Kirk’s fixed point theorem for asymptotic contractions, J. Math. Anal. Appl.316 (2006), 339–345. doi:10.1016/j.jmaa.2005.04.039.
12.
P.Gerhardy and U.Kohlenbach, General logical metatheorems for functional analysis, Trans. Amer. Math. Soc.360 (2008), 2615–2660. doi:10.1090/S0002-9947-07-04429-7.
13.
U.Kohlenbach, Effective moduli from ineffective uniqueness proofs. An unwinding of de La Vallée Poussin’s proof for Chebycheff approximation, Ann. Pure Appl. Logic64 (1993), 27–94. doi:10.1016/0168-0072(93)90213-W.
14.
U.Kohlenbach, New effective moduli of uniqueness and uniform a-priori estimates for constants of strong unicity by logical analysis of known proofs in best approximation theory, Numer. Funct. Anal. Optim.14 (1993), 581–606. doi:10.1080/01630569308816541.
15.
U.Kohlenbach, Things that can and things that cannot be done in PRA, Ann. Pure Applied Logic102 (2000), 223–245. doi:10.1016/S0168-0072(99)00036-6.
16.
U.Kohlenbach, On the computational content of the Krasnoselski and Ishikawa fixed point theorems, in: Proceedings of the Fourth Workshop on Computability and Complexity in Analysis, J.Blanck, V.Brattka and P.Hertling, eds, LNCS, Vol. 2064, Springer, 2001, pp. 119–145. doi:10.1007/3-540-45335-0_9.
17.
U.Kohlenbach, Some logical metatheorems with applications in functional analysis, Trans. Amer. Math. Soc.357(1) (2005), 89–128. doi:10.1090/S0002-9947-04-03515-9.
18.
U.Kohlenbach, Applied Proof Theory: Proof Interpretations and Their Use in Mathematics, Springer Monographs in Mathematics, 2008, xx+536pp.
19.
U.Kohlenbach, G.López-Acedo and A.Nicolae, Moduli of regularity and rates of convergence for Fejér monotone sequences, Submitted, arXiv:1711.02130.
20.
U.Kohlenbach and P.Oliva, Proof mining in -approximation, Ann. Pure Appl. Logic121 (2003), 1–38. doi:10.1016/S0168-0072(02)00081-7.
21.
A.Y.Kruger, Error bounds and metric subregularity, Optimization64 (2015), 49–79. doi:10.1080/02331934.2014.938074.
22.
D.Leventhal, Metric subregularity and the proximal point method, J. Math. Anal. Appl.360 (2009), 681–688. doi:10.1016/j.jmaa.2009.07.012.
23.
E.Neumann, Computational problems in metric fixed point theory and their Weihrauch degrees, Log. Method. Comput. Sci.11 (2015), 44pp.
24.
C.Parsons, On n-quantifier induction, J. Symbolic Logic37 (1972), 466–482. doi:10.2307/2272731.
25.
S.G.Simpson, Subsystems of Second Order Arithmetic, 2nd edn, ASL Perspectives in Mathematical Logic, Cambridge University Press, 2009, xvi+444.
26.
E.Specker, Nicht konstruktiv beweisbare Sätze der Analysis, J. Symb. Logic14 (1949), 145–158. doi:10.2307/2267043.
27.
E.Specker, Der Satz vom Maximum in der rekursiven Analysis, in: Constructivity in Mathematics, A.Heyting, ed., Proceedings of the Colloquium Held at Amsterdam 1957, North-Holland, 1959, pp. 254–265.
28.
A.S.Troelstra and D.van Dalen, Constructivism in Mathematics, Vol. I, North-Holland, Amsterdam, 1988.