We consider a cubic nonlinear wave equation with highly oscillating cubic coefficient and wave packet initial data. Using a regularization step of the initial data, we give a low regularity justification of the Nonlinear Schrödinger equation as the envelope equation.
Amplitude equations are an ubiquitous tool to describe complicated physical systems modeled by partial differential equations. A phenomenon of particular interest is the propagation of wave packets, e.g. light pulses in dispersive media. This has been addressed in various physical settings, i.e. in linear and nonlinear media with or without oscillating coefficients. A non-exhaustive list of work on amplitude equations for wave packet propagation is [7,10–12,15–21]. In this amplitude formalism (AF), first by a multiple scaling ansatz the amplitude equation is derived, which typically takes the form of a (nonlinear) Schrödinger equation with constant so called effective coefficients. Then, in a second step, the amplitude equation is justified by estimates for the error between solutions of the full system and the approximation by the AF. This is also related to certain WKB methods (in geometric and diffractive optics, see, e.g., [2,8,13]), but since we restrict to one space dimension and to almost monochromatic waves (no explicit leading order phase modulation in the initial conditions) we prefer the more specific name amplitude formalism.
Our purpose here is to improve the AF to regularity assumptions on the data that are lower than in previous results. We consider the cubic Klein–Gordon equation
where , , , and where is highly oscillating with period ε, i.e., with small ε and extended via for . This can be generalized in various ways, for instance to oscillating coefficients for the linear terms, but for simplicity we stick to (1). On the other hand, the oscillating nonlinear coefficient gives some averaging effects which go beyond the case of constant c; see Remark 1.1 for further comments.
The conserved energy for (1) is
If for simplicity we restrict to the case for all , then we immediately obtain the uniform a priori estimate
in the energy norm , where here and in the following always , i.e.,
From (3) we obtain global existence of solutions of (1). However, without the sign condition on , which we do not assume in this paper, (3) is wrong, and good estimates for depend on the class of initial conditions (IC).
We consider IC in the form of wave-packets
with , , , , , and where is related to in a certain way, specified in Theorems 2.3 and 2.5 below. Here denotes the complex conjugate, from now on we will use the shorthand notation to denote the complex conjugate of the preceding terms. The spatial wave number and the temporal wavenumber are related by the dispersion relation, which for (1) takes the simple form
On a the level of formal asymptotic expansions it is well known, e.g., [9], that (1) has approximate solutions of the form
where (the group velocity) and fulfills the Nonlinear Schrödinger equation (NLS)
As our main results we give justifications of (7) with low regularity requirements on . For , letting we prove, on an time scale, the error estimate
For , , letting we show that
with the scaled norm . See Theorems 2.3 and 2.5 for the precise results. In particular, these also yield long time existence for (1) for the specific IC without assumptions on the sign of c. The reduced regularity assumptions are achieved by smooth approximations of the initial conditions via cut-off of Fourier modes of wave number k with , and balancing of errors: the approximation error is small for large n, while the error estimates of the amplitude formalism will grow in n. We balance the growth in n with some power of ε, track the errors and choose an appropriate to achieve the desired estimates.
a) As already mentioned, many related results are known in the literature, see, e.g., [7,11,19,21] and the references therein. Our model (1), (4) is a simple extension of the constant coefficient cubic Klein–Gordon equation, and the use of instead of a constant allows to illustrate some averaging effects in the effective coefficients. Moreover, it allows to illustrate a slight technical improvement (see Remark 2.1) which avoids the diagonalization of linear operators for instance used in [7] for the definition of improved approximations. On the other hand, contrary to, e.g., [3,4,7], we do not consider periodic coefficients in the linear part because we want to avoid the Bloch wave machinery needed in this case; the results given here can be extended to this case, but this requires some technical effort.
b) A more complicated version of (1) contains quadratic terms, e.g.,
where , b sufficiently smooth with . A priori estimates and justification results for this quadratic case are typically obtained via normal form transforms [7,17,18,22], and require certain non-resonance conditions and a careful handling of the regularity loss in the normal form transforms. We believe that the methods from this paper can be transferred to the quadratic case, but for now restrict to the simpler cubic case.
c) Other standard versions of (1), (4) are obtained from setting
which yields
with . Note that (12) does not explicitly depend on ε, which makes this scaling somewhat more natural, but the ε-dependence of the initial conditions justifies the subscript ε in . The advantage of (1), (4) is that it is somewhat closer to the underlying physics: x, t and , are called the macroscopic and microscopic scales, respectively, and consequently (7) is called the macroscopic equation.
d) A related class of problem, see, e.g., [14,15] and the references therein, is given by the case of spatially non-oscillatory initial data, corresponding to in (4), for (typically) constant coefficient and gauge invariant nonlinear Klein–Gordon equations with . In this case, the solutions formally decompose into fast oscillations in time modulated by slow envelopes in time and space, which again can be described via solutions of NLS equations, and [14,15] give low regularity approximation results for this situation, which would essentially correspond to (and ) in (4). While our proofs below use pointwise in time a priori estimates on the difference between the formal approximation and the solution in some energy norms, the proofs in [14,15] are based on Strichartz estimates, which are not easily available in our case of highly oscillatory initial data () and non gauge invariant nonlinearity.
Another approach to derive and justify effective equations for problems with rapidly varying data is the method of two-scale convergence (2SC). The 2SC method for wave equations is based on the same formal calculation as the amplitude formalism, and on an a priori estimate like (3). On the linear level, terms involving the solution then have good convergence properties within the 2SC, and this allows to derive and justify a limiting equation in one step, for initial data . See, e.g., [3], where this has been carried out for a linear wave equation with , which moreover on the linear level is much more general than (1).
A justification of (7) for (1) for using methods from 2SC would be an important achievement, and an improvement of, or at least an alternative to, our results (8) and (9). However, 2SC is considerably harder for nonlinear equations, as so called strong 2SC is needed to obtain some information on nonlinear expressions involving . In [23], 2SC methods have been combined with multiple scales expansions to prove justification results for the homogenization of nonlinear Schrödinger equations with a large rapidly varying potential and highly regular initial data. The linear Schrödinger case with is considered in [2,5], and in [1, Remark 7.5] it is claimed that the results of [1,5] for the linear case in some special cases generalize to the nonlinear case. However, it appears that sufficient information to treat nonlinear terms in the framework of 2SC is not easily available for solutions of (1), and we failed to make 2SC methods work for (1), (4) without directly using the results (8) or (9), which in particular means that again we cannot deal with , .
The justification results
Our derivation and justification of (7) proceeds in two steps. First we plug the ansatz
into (1), where , , and denotes at this point unspecified higher order (in ε) terms. Sorting with respect to ε then yields a hierarchy of equations, to be successively solved. All terms vanish due to the dispersion relation , all terms vanish by the choice , and at we obtain
Clearly, the splitting into harmonics and in (15) is not self-evident, unless c is constant. However, based on the separation of scales between the arguments of v and of c, the standard procedure is to average the coefficients of in y over the periodicity cell of c, i.e., to require
which is the NLS (7) for v. Thus, the terms vanish in an averaged sense, i.e., at the so called residual has the form
The complete residual is
and it is easy to see that under mild conditions on v the terms are bounded in natural norms (see below).
Given a solution v of the NLS, the question is whether the ansatz (14) gives an approximation of a solution of (1) on an time-scale, which is the natural time-scale for the NLS evolution. In the context of the AF this problem of justification is usually dealt with as follows. Defining the (scaled) error via
we want to use a-priori estimates on the error equation
to show that ρ stays bounded in a suitable norm, on an time scale. In the energy-norm we obtain
and using a Gronwall argument it would be sufficient to have
to have bounded. However, already from (17) we see that (20) does not hold in general if v is in , say.
The improved residual
Our first main idea is to use an improved approximation in the form
with to be determined. First we note that if is bounded in and , then we have
such that by the triangle inequality it is sufficient to estimate .
Plugging into (1) we require, at , using (16),
where and are elliptic operators on . Thus, we need to solve
where . Even though has the one dimensional kernel spanned by constants, here we obtain with since
Similarly using a Fourier representation, we see that is invertible due to the non-vanishing imaginary part of the symbol for non-constant modes and due to the nonresonance condition for the constant term, hence we have .
If c is constant, then and , and such relations have been used since at least [11,12] to achieve small residuals. On the other hand, for non-constant c, equations related to (24) have often been solved via diagonalization in Fourier (or Bloch) space, see, e.g., [7]. The ansatz (21) instead of the less specific version gives the splitting (24) and thus avoids this diagonalization.
Our main idea is to carefully keep track of terms of low and high orders of derivatives in the residual, and the associated ε orders. This is aimed at approximating , , by sequences with , and trading some powers of ε for regularity.
Let,. There exists asuch that for all solutionsof (
7
) and allwe have
In the following we drop the notation of complex conjugate terms and suppress the explicit t dependence of v in the estimates, i.e., for instance, . The term involving the highest derivatives in is
where we replaced by the rhs of (7). Doing this once more we obtain a fourth derivative as the highest spatial derivative of v, i.e.,
Thus, by using and we have
The other terms involving are , and these can be estimated in a similar fashion to obtain
The remaining terms involving are of the form , such that their -norm can be estimated by
The lowest order terms in ε are due to residuals of the form
which can be estimated by
All other terms involving are of order and in a similar fashion can be estimated by
All further terms do not involve any derivatives of v and are of order or higher. Their -norm can be estimated by . Collecting the above estimates and estimating the linear terms in by yields (25). □
The error estimates
We now give two theorems that estimate the error under weak regularity conditions on . The first one, with yields the expected scaling with .
Let. For allandthere existsuch that for all solutionsof (
7
) with, and allthe following holds. Ifwhere,,,are bounded byin, then there exist a unique mild solutionof (
1
) with initial conditionsand, which can be written aswith
a) In 1D, given and , , the associated solutions of the NLS fulfill with . The constant is independent of for : In the defocussing case this follows directly from the conservation of the coercive Hamiltonian . For the focussing case we can use the mass conversation and the Gagliardo–Nierenberg estimate (see (37) below) to bound the negative part from below by , which is enough to give a uniform bound for . The general statement then follows with a result on the preservation of regularity, e.g. [24, Prop. 3.11]. The result is applicable as the bound implies uniform estimates. Note that the constant will in general grow exponentially in for . From [24, Prop. 3.8] we obtain local Lipschitz continuity with respect to initial conditions, i.e., for two solutions with IC and .
b) Given Lemma 2.2, a naive condition to close the a-priori estimate (19) would be , and our main contribution is to improve this by trading powers of ε in (25) for lower regularity of v. A similar idea has been used in [7] for the justification of the NLS for a generalization of (1) (in its rescaled form (12)) to periodic coefficients also for the linear terms, using a somewhat heavy machinery of Bloch wave transform and diagonalization of the linear part. Transferring back the result from [7] for (12), (13) to (1), (4) we obtain Theorem 2.3 under the condition , and thus a somewhat weaker result, with a significantly more complicated proof.
c) By requiring (31) we consider a special case of initial data (4), i.e., IC for wave-packets that move to the right, and which thus can be described by a single NLS equation for v. In general, e.g., for , the solution will decompose into two wave-packets, one moving left, and described by a NLS equation for , and one moving right described by a NLS equation for . As this is merely a question of book keeping, here we restrict to a single NLS.
d) Our choice of perturbations , ensures that , where local existence and uniqueness holds by, e.g., semigroup methods, or Duhamel’s formula. However, our theorem also provides the existence of a long-term solution.
Using (22) we work with the improved ansatz (21). First we want to estimate , but for we cannot use Lemma 2.2 directly. Instead we approximate by smooth using a cutoff in Fourier space at wave-number n, i.e.,
Here is the characteristic function of , and and denote the Fourier transform and its inverse, respectively, which is an isomorphism of and
Then in by Lebesgue dominated convergence, for all m, and for all there exists a such that .
We set where is the solution of the NLS (7) with IC , and start with the triangle estimate
From Remark 2.4(a) we have and hence
where the second estimate is obtained by choosing , since
It remains to estimate . Setting , the equation for reads
such that . Next,
and we need to estimate . By the (1D) Gagliardo–Nirenberg inequality
see, e.g., [6, Section 8.6.1], we obtain, with , and recalling that ,
as long as with a constant which depends on R, determined below, but not on ε.
As is bounded, from Lemma 2.2 we obtain , where again we used for . Thus,
Now we use that for any there exists an such that
Then, by Gronwall’s inequality, for ,
and we are done, i.e., for this R we find in (38) and then from (40). Combining (34) and (41) yields (32). □
We now state and prove a justification theorem when the envelope v is only in with . For solutions of (7) with we cannot ensure because one time derivative of v corresponds to two space derivatives and hence is equivalent to . Thus, we will bound the error in the scaled norm
The error is smallest for s close to 2 and the proof shows that a larger s also yields larger . The case cannot be treated in the same way due to the lack of a bound of the nonlinearity as in (47).
Letand. For alland, there existsuch that for all solutionsof (
7
) with, and allthe following holds. Ifwhere,,,are bounded byin, then there exist a unique mild solutionof (
1
) with initial conditionsand, which can be written aswith
We follow the same strategy as in the proof of Theorem 2.3, and approximate by smooth using the same cutoff in Fourier space at wave-number n. The difference in the proofs lies in the norm instead of , yielding first (45), and subsequently in (47), and after (49), which together yield the condition . We again set where is the unique solution of (7) with initial data , and start with the triangle inequality
By Lipschitz continuity with respect to initial conditions for (7) (see Remark 2.4(a)) we have , and hence
since .
To bound we again use the energy norm and the estimate (36). Letting , the equation for becomes
hence . With we obtain
as long as with a constant which depends on R, determined below, but not on ε.
As is bounded, from Lemma 2.2 we obtain
if and . Thus,
with , and
With we obtain that
is also bounded for . As above we use that for any fixed there exists an such that
Then, by Gronwall’s inequality, for ,
and the remainder of the proof works exactly as in the proof of Theorem 2.3. □
Footnotes
Acknowledgement
The authors are grateful for the support by The Royal Society (International Exchange reference IE131675).
References
1.
G.Allaire, Periodic homogenization and effective mass theorems for the Schrödinger equation, in: Quantum Transport, Lecture Notes in Math., Vol. 1946, Springer, Berlin, 2008, pp. 1–44. doi:10.1007/978-3-540-79574-2_1.
2.
G.Allaire and M.Palombaro, Localization for the Schrödinger equation in a locally periodic medium, SIAM J. Math. Anal.38(1) (2006), 127–142. doi:10.1137/050635572.
3.
G.Allaire, M.Palombaro and J.Rauch, Diffractive behavior of the wave equation in periodic media: Weak convergence analysis, Annali di Matematica188 (2009), 561–589. doi:10.1007/s10231-008-0089-y.
4.
G.Allaire, M.Palombaro and J.Rauch, Diffractive geometric optics for Bloch wave packets, Arch. Ration. Mech. Anal.202(2) (2011), 373–426. doi:10.1007/s00205-011-0452-9.
5.
G.Allaire and A.Piatnitski, Homogenization of the Schrödinger equation and effective mass theorems, Commun. Math. Phys.258(1) (2005), 1–22. doi:10.1007/s00220-005-1329-2.
6.
H.Brezis, Functional Analysis, Sobolev Spaces and Partial Differential Equations, Universitext, Springer, New York, 2011.
7.
K.Busch, G.Schneider, L.Tkeshelashvili and H.Uecker, Justification of the nonlinear Schrödinger equation in spatially periodic media, ZAMP57 (2006), 1–35.
8.
R.Carles, WKB analysis for nonlinear Schrödinger equations with potential, Comm. Math. Phys.269 (2007), 195–221. doi:10.1007/s00220-006-0077-2.
9.
C.M.de Sterke and J.E.Sipe, Envelope-function approach for the electrodynamics of nonlinear periodic structures, Phys. Rev. A38(10) (1988), 5149–5165. doi:10.1103/PhysRevA.38.5149.
10.
M.D.Groves and G.Schneider, Modulating pulse solutions for a class of nonlinear wave equations, Comm. Math. Phys.219(3) (2001), 489–522. doi:10.1007/s002200100423.
11.
L.A.Kalyakin, Asymptotic decay of a one-dimensional wave packet in a nonlinear dispersive medium, Math. USSR Sbornik60(2) (1988), 457–483. doi:10.1070/SM1988v060n02ABEH003181.
12.
P.Kirrmann, G.Schneider and A.Mielke, The validity of modulation equations for extended systems with cubic nonlinearities, Proc. of the Royal Society of Edinburgh122A (1992), 85–91. doi:10.1017/S0308210500020989.
13.
D.Lannes, Dispersion effects for nonlinear geometrical optics with rectification, Asymptotic Analysis18 (1998), 111–146.
14.
S.Machihara, K.Nakanishi and T.Ozawa, Nonrelativistic limit in the energy space for nonlinear Klein–Gordon equations, Math. Ann.322(3) (2002), 603–621. doi:10.1007/s002080200008.
15.
N.Masmoudi and K.Nakanishi, From nonlinear Klein–Gordon equation to a system of coupled nonlinear Schrödinger equations, Math. Ann.324(2) (2002), 359–389. doi:10.1007/s00208-002-0342-4.
16.
T.Schäfer and C.E.Wayne, Propagation of ultra-short optical pulses in cubic nonlinear media, Phys. D196(1–2) (2004), 90–105. doi:10.1016/j.physd.2004.04.007.
17.
G.Schneider, Justification of modulation equations for hyperbolic systems via normal forms, NoDEA5 (1998), 69–82. doi:10.1007/s000300050034.
18.
G.Schneider, Justification and failure of the nonlinear Schrödinger equation in case of non-trivial quadratic resonances, J. Diff. Eq.216(2) (2005), 354–386. doi:10.1016/j.jde.2005.04.018.
19.
G.Schneider and H.Uecker, Nonlinear coupled mode dynamics in hyperbolic and parabolic periodically structured spatially extended systems, Asymptot. Anal.28(2) (2001), 163–180.
20.
G.Schneider and H.Uecker, Existence and stability of exact pulse solutions for Maxwell’s equations from nonlinear optics, ZAMP54 (2003), 677–712. doi:10.1007/s00033-003-1048-5.
21.
G.Schneider and H.Uecker, The mathematics of light pulses in dispersive media, Jahresberichte der DMV109 (2007), 139–161.
22.
J.Shatah, Normal forms and quadratic nonlinear Klein–Gordon equations, Comm. Pure Appl. Math.38 (1985), 685–696. doi:10.1002/cpa.3160380516.
23.
C.Sparber, Effective mass theorems for nonlinear Schrödinger equations, SIAM J. Appl. Math.66(3) (2006), 820–842. doi:10.1137/050623759.
24.
T.Tao, Nonlinear Dispersive Equations, CBMS Regional Conference Series in Mathematics, Vol. 106, American Mathematical Society, Providence, RI, 2006.