In this paper, we generalize the classical Freidlin-Wentzell’s theorem for random perturbations of Hamiltonian systems. In (Probability Theory and Related Fields128 (2004) 441–466), M.Freidlin and M.Weber generalized the original result in the sense that the coefficient for the noise term is no longer the identity matrix but a state-dependent matrix and taking the drift term into consideration. In this paper, We generalize the result by adding a state-dependent matrix that converges uniformly to 0 on any compact sets as ϵ tends to 0 to a state-dependent noise and considering the drift term which contains two parts, the state-dependent mapping and a state-dependent mapping that converges uniformly to 0 on any compact sets as ϵ tends to 0. In the proof, we adapt a new way to prove the weak convergence inside the edge by constructing an auxiliary process and modify the proof in (Probability Theory and Related Fields128 (2004) 441–466) when proving gluing condition.
Consider the following system
where ϵ is a small positive constant and is a standard two-dimensional Brownian motion defined on the stochastic basis . We assume that and are differentiable mappings having bounded derivatives and the mappings and converge to zero, as ϵ goes to zero, uniformly on any compact subset of .
In what follows, we shall assume that there exists a function such that
This means in particular that if we denote by the solution of the unperturbated system
then , for every and . This means that remains on the same level set of H, for every . Moreover, if we define
then it can be easily proved that is the density of the invariant measure for .
Now, for every , we denote by the x-level set of H, that is
The set may consist of several connected components
and for every we will denote by the connected component of the level set , to which the point z belongs. If we identify all points in belonging to the same connected component of a given level set of the Hamiltonian H, we obtain a graph Γ, given by several intervals and vertices . In what follows, we shall denote by the identification map, that associates to every point the corresponding point on the graph Γ. We have , where denotes the number of the interval on the graph Γ, containing the point .
In the present paper we are interested in the asymptotic behavior of the Γ-valued process . Namely, we want to show that converges in distribution in the space , as , to a Markov process in Γ, whose generator is explicitly described in terms of suitable differential operators in the interior of every edge and suitable gluing conditions at each interior vertex.
If we define , as an immediate consequence of the Itô’s formula we have
where
and
Next, for every , we define
and
where
Moreover, we define
With these notations, we can introduce the following operator acting on functions defined on the graph Γ.
For , let
where
We denote by the set consisting all functions f defined on the graph Γ such that is well defined in the interior of the edge and for every there exists finite
and the limit is independent of the edge . Moreover, for each interior vertex
where denotes the derivative of f with respect to the local coordinate λ, along the edge and the signs ± are taken if or .
Next, for every , we define
The main result of this paper is given by the following theorem.
Supposesatisfies the following stochastic differential equationwith initial condition. Assume the coefficients satisfy Hypothesis
1
and the Hamiltonianintroduced in (
1.2
) satisfies Hypothesis
2
. Then the processconverges weakly into the Markov process Y generated by the operator, as described in Definition
1.1
.
In the present paper, we generalize the well known result by Freidlin–Wentcell on the validity of an averaging principle for Hamiltonian systems (see [3, Chapter 8])to a more general case and introduce a new method which simplifies some steps in the proof. Compared with the original Freidlin–Wentcell theorem, here we can cover the case of a state dependent diffusion coefficient and we can also deal with a the drift term. Moreover, both the diffusion coefficient and the drift term are given by the sum of a term of order one and a term of order ϵ. For the terms of order ϵ, we assume that, as ϵ goes to zero, they converge uniformly to zero over any compact sets in .
In the proof of the weak convergence in the interior of every edge and the analysis of the behavior of the process near the exterior vertices, we introduce a new proof, based on the construction of a suitable auxiliary process. What is remarkable is that, unlike the original proof, this new method unifies the two cases together. In the proof of the svalidity of the gluing conditions, when dealing with the extra terms of order ϵ in the drift and in the diffusion coefficient, we first introduce an auxiliary vector field in order to be able to apply the classical results based on generalized differential operators. Then we apply Girsanov’s theorem to get rid of the compensated drift term.
Finally, we would like to mention that our main motivation in studying this type of problem is provided by the paper [1], where together with Cerrai and Wehr we prove the validity of the Smoluchowskii-Kramer approximation for a system with a finite number of degrees of freedom, in the presence of a state dependent magnetic field λ. In this case, the problem is regularized by adding a small friction of intensity . After the small mass limit for the regularized problem is taken, we obtain a Hamiltonian system (with Hamiltonian λ) perturbed by a deterministic and a stochastic term. Both perturbations are given by the sum of two terms of different order, that with the notations of the present paper correspond to the drifts b and and the diffusions σ and . Theorem 1.2 allows us to obtain the limiting behavior, as ϵ goes to zero, for the slow component of the solution of the Hamiltonian system obtained from the small mass limit of the regularized problem. As shown in Theorem 1.2, the limiting process is given by a suitable Markov process on the graph associated with the Hamiltonian λ.
Some preliminaries
Hypotheses and notations
Concerning the coefficients in equation (1.1), we assume the following conditions
The mappingsandare all continuously differentiable with uniformly bounded derivatives.
The functionsandconverge to zero as ϵ goes to zero, uniformly on any compact set in.
The Hamiltonian H satisfies the following conditions.
H belongs toand has bounded second derivatives. Moreover
H has finite many critical points and for any two critical pointsand,.
For any critical point of H, the matrix σ is invertible in some neighbor of it.
The matrix of second order derivative is non-degenerate at any critical point of H.
There exists a constantsuch that,, and, for allsuch thatlarge enough.
As we mentioned in the Introduction, if we identify all points in belonging to the same connected component of a given level set of the Hamiltonian H, we obtain a graph Γ, given by several intervals and vertices . The vertices will be of two different types, external and internal vertices. External vertices correspond to local extrema of H, while internal vertices correspond to saddle points of H. Among external vertices, we will also include , the endpoint of the interval in the graph corresponding to the point at infinity.
We have seen that the identification map associates to every point the corresponding point on the graph Γ. Thus, if , then denotes the number of the edge on the graph Γ, containing the point . If is one of the interior vertices, the second coordinate cannot be chosen in a unique way, as there are three edges having as their endpoint.
On the graph Γ, a distance can be introduced in the following way. If and belong to the same edge , then . In the case and belong to different edges, then
where the minimum is taken over all possible paths from to , through every possible sequence of vertices , connecting to .
If x is not a critical value of H, then each consists of one periodic trajectory of the vector field . If x is a local extremum of , then, among the components of there is a set consisting of one point, the rest point of the flow. If has a saddle point at some point and , then consists of three trajectories, the equilibrium point and the two trajectories that have as their limiting point, as .
We introduce some other notations that will be used through out the paper. Letter D is used to denote domain, while letter C is for the level set of the Hamiltonian system. For any , is the interior of
and
Finally, we write if and only if one end of the edge is .
Throughout this paper, we shall denote
Generalized differential operator
In the proof of Theorem 1.2, we will need to rewrite each operator in the form of generalized differential operator. That is we want to find two measures and such that
Let and be the Radon–Nikodym derivative of and with respect to the Lebesgue measure respectively. For a reason which will be clear later, we want to choose
and
However, with this choice of and , does not generally equal to . In fact
To calculate
by equation (1.3),
where is the unit normal vector of . Apply Divergence Theorem, we have
To deal with
we have the following lemma
If we assume that, for any, we have
Let be the solution to the following ordinary differential equation
Then
which means H can be served as the time and
Let denote the unit length on the level set , and notice that is orthogonal to the normal vector of the curve , is the Lebesgue measure on .
The Lemma follows easily by
□
Now we apply equation (2.1) and we get
Moreover,
Therefore
If we apply the Divergence Theorem again, we get
where is the formal adjoint of the operator . Therefore
and the following theorem follows.
Let H and f satisfy the condition in Lemma
2.1
, then
Apriori estimate
Consider the stopping time
We have the following lemma.
Under Hypothesis
1
and
2
, for any fixed, and arbitrary, there exists a constantsuch that
Recall that satisfies the following stochastic differential equation
where , . By our assumption that is of linear growth, b, , σ, are all Lipschitz continuous. There exists a constant C, such that
Therefore
and since for x large enough, there exists a constant C such that
If we apply the Gronwall’s inequality, this implies
Also,
So that,
Now pick R such that , and
Due to (2.4), the second term above is smaller than by our choice of R. For the first term, we have
Now,
so that
Therefore, we can pick large enough in (2.5) so that
□
Lipschitz continuity
Under Hypothesis
1
and
2
, for any continuous function f,,andare Liptichitz continuous inwhereandare inside the interior of some edge.
The tool we use here Lemma 1.1 in Chapter 8 of [3]. We first calculate the derivative of .
Let
By equation (2.2)
Next, we apply Lemma 1.1 in Chapter 8 of [3] again to calculate
Also we apply Lemma 2.1 to calculate the derivative of the residue in (2.7)
By our assumptions, it can be easily checked that is both bounded and bounded below above 0, and both and are bounded. Therefore and are bounded in , which implies their Lipschitz continuity. The Lipschitz continuity for is obvious. □
We first define the following sequence of stopping times. In the definition below, δ and are to be determined later.
Let S be the set of integers , such that is a saddle point. For any and , let ... be a sequence of stopping times with
and
Where is the stopping time defined in (2.3). Moreover, we define
Recall that we denote by L the infinitesimal operator of the process on Γ. If the Poisson problem has a unique solution then this solution has the representation
Replacing u by , gives
If we can prove that for all ,
then the tightness of the family , Prokhorov theorem and its corollary, and the fact that the range of the operator uniquely determines a measure guarantee that converges weakly to as in .
By Freidlin and Wenztell’s procedure, the tightness of the measure on follows easily from apriori estimate proved in Lemma 2.3. Hence it is sufficient to prove that, for any and , there exists such that for all
which contains two parts: the part where remains in the same edge and the part where approaches an inner vertex of the graph. We have
If we define
and
We have
Follow the same procedure in [3] and get, for sufficiently small ϵ,
so that for sufficiently small ϵ,
In order to study the term , we need the following result.
Letbe a one dimensional standard Brownian motion, and letbe the solution to the following equationAssume, and letbe the stopping time whenleaves the region. If either one of these three cases holds,
is an edge such that both of its vertex are interior vertex, andare any fixed values belonging to the interior of the interval.
is an edge such that one of its vertex is an interior vertex while the other vertex is an exterior vertex, where the Hamiltonian takes the value,belonging to the interior of the interval.
is an edge that has only one vertex, andare any fixed values belonging to the interior of the interval.
Then under Hypothesis
1
and
2
, for every function f onthat is three times continuously differentiable and for every,uniformly with respect to.
In order to study the term , we need the following result.
Letbe an interior vertex, then for every positive α and κ, there existssuch that for alland sufficiently small ϵ,and there exists constant C such that(see remark on page 310 in [
3
])
We can use Lemma 3.5 in [3] and apply the change of random time to prove this Lemma easily. Now if we take
in Lemma 3.3, we have
We will prove the following Lemma in Section 5 to handle the term
The following lemma will be proved in Section 5.
For everythere exists a positivesuch that forthere existssuch that for sufficiently small ϵ,whereis the constant defined in (
1.5
).
If we take , then for sufficiently small ϵ,
To this point, we first pick and then determine δ and according to Lemma 3.4. After that, we let ϵ sufficiently small so that Lemma 3.4 holds for and Lemma 3.2 holds for
Therefore
which proves the result.
Weak convergence inside the edge
We first prove the weak convergence of inside an edge of finite length. For , recall that is the stopping time when the process leaves the region . Noticed that the coefficients are all bounded in . Let be the period of starting at z. Suppose for all . Where we should remark that the constant and depends on the constant and . In this section, we consider the process starting from and denote . We will first prove a weaker version of Lemma 3.2.
Letandbe defined as in Lemma
3.2
. Under the same condition as in Lemma
3.2
, for every function f onthat is three times continuously differentiable and for every,uniformly with respect to.
In what follows we shall define a sequence of stopping times defined by , . We first define an auxiliary process trajectory by trajectory.
Let , and
For every , we define and for ,
We should notice that is a standard Brownian motion on .
We should also define the auxiliary process which has the same distribution as the limiting process of the slow motion.
For , is the solution of the problem
with the initial condition .
Closeness of the and
Let satisfies the equation with the initial condition . It can be easily seen that
Therefore, for ,
Due to equation (2.6), , we have
Apply Gronwall’s inequality, we have
and
In this subsection, we will introduce two critical term in the estimation of the closeness of the trajectories. For , let
and
It is easy to see that , if . Otherwise,
Where satisfies the equation , . Notice that the last two terms in the summation appears only once in all . For the first term, we have
Thus, if we set, for and every k except one, we get
Similarlly, for , let
and
Due to the inequality that , for , we have for
By perceeding as for , we have for and every k except one,
Closeness of and
Let
then as we assumed in Hypothesis 1. For , we have
If , then
So we consider the case when ,
below.
Therefore, by Kolmogorov’s inequality
Therefore, thanks to (4.1)
Finally, since there are at most ’s before time T, we get
So that
Therefore, we have proved the following Lemma
Under Hypothesis
1
and
2
, for any initial conditionon edgeandsuch that, we have
Closeness of and
Let . For , we have
Let . For ,
Since , we define to be the extension of in the interval such that , and , . The process is defined similarly. Then for ,
and
So
Let , then and
Since
we get
and
Therefore
If we apply Gronwall’s inequality, we have
and thus
All this implies,
Now, let . The calculation above implies the following recursive,
By induction, we can easily deduce that
Finally, let ,
Therefore, we have proved the following Lemma
Under Hypothesis
1
and
2
, for any initial conditionon edgeandsuch that, we have
By Ito’s formula, for any ,
Since and have the same distribution,
Moreover,
Apply Lemma 4.5 and 4.6, converges to 0 uniformly with respect to . And
Similarly, converges to 0 uniformly with respect to . So
Under Hypothesis
1
and
2
, and let the stopping timebe defined as before, where,are in the interior of, then
Let f satisfies and the boundary conditions . Then apply Lemma 3.2 with f and have
Since the second order differential operator is uniformly elliptic on , we can apply the maximum principle, and we get
where C is a constant depending on the coefficients in and . In particular, it is independent of T. Therefore, by Fatou’s Lemma
Since this inequality holds for all and . □
Under Hypothesis
1
and
2
, for any initial conditionon the edgeandsuch that,are inside the interior of, we have
Unlike the its previous counterpart, the number of intervals here is random. Let N be the largest k such that . For simplicity, we write as , eliminate the edge coordinate i and let in this proof. Thanks to (4.2) and apply Fatou’s Lemma and strong Markov property,
Moreover,
and thus prove for all ,
Finally, we take the sup over T and prove the result. □
Under Hypothesis
1
and
2
, for any initial conditionon edgeandsuch that,inside the interior of, we have
Similarly, we have
Now by the estimation (4.3), and process as the previous Lemma, the result is obvious. □
Now if we apply exactly the same proof as in Lemma 4.1, Lemma 3.2 is proved.
Properties near the saddle point
To prove the gluing condition, we follow a similar method used in [2]. Since the gluing condition is given by local property, we can assume without loss of generality in this section that
The coefficients,,,and,in equation (
1.1
) are uniformly bounded.
Averaging the measure
We first remark that: the probability for , starting from an initial point to reach the level set is approximately the same. The following Lemma is a consequence of Krylov and Safonov’s theory (see [5]).
There exists asuch that for everyand any, the following estimate holds for sufficiently small ϵ,
If the function f in the Lemma is , we have . Now we can average the probability by a measure μ on the level set .
Let μ be a measure on the level set. For anyand sufficiently small ϵ,
The measure μ will be determined by the long term behavior of as to be shown in the next subsection.
Representation of the invariant measure
Let
The sequence is a Markov chain and for all , . Moreover if , is a Markov chain on . Then every invariant measure of the process can be represented in the form
where and are measures on and such that
See [4]. Notice that and are the invariant measures for the Markov chains and on and respectively. Now we pick the measure μ in equation (5.1) to be and we get
Hence, we apply equation (5.3) with and since , for all , we have . Therefore
Finally, apply equation (5.2) with a bounded measurable function G and get
In this section we first assume that the following Hypothesis holds,
Let functionbe the function defined by (
1.3
). We assume that a satisfies the following relation for all.
Then is the invariant measure for the system as and is a generalized differential operator. Apply Lemma 2.1 again, the left hand side of equation (5.4) becomes
Let the window function depend only on the Hamiltonian,
where with support in and , , are the two end points of an edge . Then
Finally, we apply Lemma 3.2 to the right hand side of equation (5.4),
where and is the solution to
It can be easily seen that can be solved by the following formula,
Notice that f has support in , so if , then
and if , then
Now we combine equation (5.4) and equation (5.5),
which is an equation holds for all f, therefore we have:
Finally, letting and gives us Lemma 3.4 under Hypothesis 4.
Now we consider the case without Hypothesis 4. Instead of considering the process whose invariant measure is unknown when , we consider the system by compensating a smooth vector filed and satisfying
and , are the corresponding formal adjoint operator. We first claim that both such and exists under Hypothesis 1, 2, and 3. Indeed
Let and satisfy the first-order partial differential equation
Then
By Hypothesis 3, is well-defined, bounded and globally Lipschitz continuous. Similarly, such also exists. Therefore the stochastic process is well-defined for every fixed :
and satisfies Hypothesis 4. Therefore, the gluing condition holds for .
For the process , the drift term and need to be killed from . I apply Girsanov Theorem up to any fixed time T.
The measure induced by the process on is absolutely continuous to the measure induced by on with the density
where
Compare the two measures of a set , where the expectation, without specification, is taken with respect to the measure .
For simplicity, we denote
Apply Ito’s formula to :
By Hypothesis 3 and 4,
and
Then apply Girsanov theorem again to the last equation
This implies
Then let and for as in Lemma 3.4,
and thanks to Lemma 3.3, we have
Therefore, for sufficiently small δ,
Therefore, we proved the general case.
References
1.
S.Cerrai, J.Wehr and Y.Zhu, An averaging approach to the Smoluchowski–Kramers approximation in the presence of a varying magnetic field, Journal of Statistical Physics (2020), to appear.
2.
M.Freidlin and M.Weber, Random perturbations of dynamical systems and diffusion processes with conservation laws, Probability Theory and Related Fields128 (2004), 441–466. doi:10.1007/s00440-003-0312-0.
3.
M.Freidlin and A.Wentzell, Random Perturbations of Dynamical Systems, 3rd edn, Springer Verlag, 2012.
4.
R.Z.Khas’minskii, Ergodic properties of recurrent diffusion processed and stabilization of the solutions of the Cauchy problem for parabolic equations, Teor. Veroatn. Ee Primen.5 (1960), 179–196.
5.
N.Y.Krylov and M.V.Safonov, On a problem suggested by A.D. Wentzell, in: The Dynkin Festschrift. Markov Processes and Their Applications, M.I.Freidlin, ed. Birkhauser, Boston, 1994, pp. 209–220. doi:10.1007/978-1-4612-0279-0_12.