In this work we consider a finite dimensional stochastic differential equation(SDE) driven by a Lévy noise , . The transition probability density , of the semigroup associated to the solution , of the SDE is given by a power series expansion. The series expansion of can be re-expressed in terms of Feynman graphs and rules. We will also prove that , has an asymptotic expansion in power of a parameter , and it can be given by a convergent integral.
A remark on some applications will be given in this work.
During last years a growing attention has been put on stochastic modeling techniques in a heterogeneous ensemble of applications. In a continuous interplay between theory, simulation, and concrete experiments, significant steps have been made reaching a better understanding of surrounding nature by means of mathematical models enriched by random components. Following this line of research, our work aims at reaching new frontiers of comprehension the asymptotic behaviour of the probability transition density , of the semigroup associated to the solution of a stochastic differential equation at first. We will discuss then possible applications to one of the most significant and challenging areas of the contemporary scientific research, namely: Neural networks, which is characterized by a huge amount of interconnected agents (neurons) acting in a rapidly changing environment, where informations (electrical signals) can be affected by a number of random influences.
It is essential to develop stochastic versions of deterministic models used in neural networks, see, e.g. [42]. For these purposes Stochastic Differential Equations (SDEs) driven by Gaussian noises have been studied intensively over the past 60 years and recently have been extended to Lévy type noises and even more general jump-diffusion processes, see, e.g., [1,2,4–7,9,10,12,15,18,27].
In the case of SDEs driven by Gaussian noise, transition densities are known and given by the Fokker–Planck equations, see, e.g. [8]. However, and to the best knowledge of the author, explicit formula for the transition densities for the Lévy processes, is not yet available and only upper and lower bounds for the densities are shown, see, e.g., [25,26,41].
In the current work we consider the following SDE:
where the stochastic process , takes values in and , is a Lévy process in generated by the triplet (more details about will be given in next section).
α, σ are globally Lipschitz continuous mappings, respectively from into itself and into the space of symmetric positive definite matrices.
Under suitable regularity on α and σ and assumptions on , , the solution , , , of the SDE (1) is known and is a time homogeneous Markov process, see, e.g. [20,33]. Moreover , has a transition density , , and we assume for , the mapping is continuous for . (For the existence of we refer to [1,16].)
To , we can associate the transition semigroup on by setting
( being the space of continuous functions on vanishing at infinity and equipped with supnorm.)
The transition semigroup is Markov and Feller, i.e., it maps into itself, see, [1]. We can associate a transition density , , to (for the existence of see, [34]).
Let G be the infinitesimal generator corresponding to defined by:
(such that the limit in (3) exists).
For , satisfies the jump-diffusion Kolmogorov equation and the generator G satisfies:
with and ν a Lévy measure on satisfying and .
Asymptotic expansions of transition probability density associated to a solution of stochastic differential equations driven by Gaussian noise are discussed in several papers, see, e.g., [8], their usefulness in many contexts including analysis, finance, quantum mechanics and statistical mechanics are pointed out in several papers, see, e.g. [8,19]. Transition probability densities associated to solutions , of the SDEs (1) are less studied, and to the best of our knowledge, no results gives an asymptotic expansion of the kernel , as proved in this work.
This paper is devoted to the search of series expansion of the transition probability density , . The perturbative expansion of will be given in the sense of formal power series in terms of Feynman graphs and rules, more details about such graphs can be found in [21,22,36,38,39]. We intend to prove an asymptotic expansion of in power of a parameter , the series will be given by a convergent integral. For this aim we rely essentially on Watson’s lemma and Borel summability, see, e.g., [40], and other recent results in [1,6–8].
The structure of this paper is as follows:
Section 2 is reserved to the basic concepts, technical assumptions and definitions useful for this work.
In Section 3 we recall an analytic expression of the transition probability density , , of the semigroup associated to the solution , of the SDE (1). The given power series expansion will be re-expressed in terms of Feynman graphs and rules.
Section 4 will be reserved to the main result of this paper, we prove an asymptotic expansion of the transition probability density , in power of a parameter .
Section 5 is devoted to some applications on neural networks.
Basic results, assumptions and definitions
During past years, Lévy processes are intensively studied due to their variety of applications. In the following we state some essential results and technical assumptions useful for this work.
Denote by a compensated jump measure, then will be defined as:
here π is the Lévy measure satisfying . More detail about Lévy measure can be found in [13,30,34].
The Lévy process , , is represented, see, [34], by
where m and k are constants and , is a standard Brownian motion on .
For the particular case , the Lévy process , is called a pure jump Lévy process denoted by , and defined on . is characterized by its characteristic function:
Moreover, the process can be represented as follows:
where B is a Borel set, i.e., .
A transition kernel on , is a family of mappings , , and , with values in and satisfying:
is a probability measure as a function of B for any fixed x;
is measurable in x for any fixed B;
;
(called semi-group or Chapman–Kolmogorov property).
Let be a probability space with a filtration . A stochastic process adapted to is said to posses the Markov property if for each , ,
A Markov process is a stochastic process which satisfies the Markov property with respect to its filtration.
Markov processes are characterized by transition probability functions, moreover if , is a Markov process then for ,
If , , is the transition semigroup of a homogeneous Markov process , , then by a result in [34] we have:
where Ψ is the Lévy characteristic given by:
Here and ν is a positive Lévy measure satisfying:
The transition probability density , of the solution , can be characterized by the Fourier transform:
Following Hartman and Wintner [24], existence of the transition probability density , associated to , is guaranteed in terms of the characteristic exponent of Ψ. To guarantee the smoothness of , we assume for :
Define the functional , by:
Assume , then by Kasminski lemma, see, e.g. [14], there exists constants such that
The construction of as given in (17) is not essential and we consider instead representation with respect to some signed measure μ, which is in Kato class , and the transition probability density by assuming .
The family of operators forms, since is Markov and Feller, the well known Feynman–Kac semigroup, given by
Following [25] and under some assumptions on the measure , it is shown that
Our aim is to show an asymptotic expansion of the transition probability density , as . We shall estimate both the transition probability of the solution of the SDE (1) and . Estimation of , has been discussed in several papers, see, e.g. [26,41]. However, and to the best of our knowledge, estimation of the expectation seems non-trivial and not yet available at least for the SDE of type (1). In the next section we will construct new approach through the Feynman graphs and algorithms to show an asymptotic expansion of as .
Expansion of the transition probability density
In this section we develop a formal power series expansion in the coefficient of the transition probability density , . The perturbative expansion will be re-expressed in terms of Feynman graphs and rules.
Let ψ be a continuous function on , then the following holds:
Let , , be n-real valued random variables defined on a probability space . Denote by the moments of ν under any partition of the set . The truncated moments functions are recursively defined by
where is the set of all partitions I of a finite set A into nonempty disjoint subsets .
Let , be the solution of the SDE (1), then there exists two independent processes, , and , , such that , . The process is stationary and , , , satisfying
The transition probability , of the semigroup is expanded in a formal power series as:
Expanding formally equation (20), we get
where in the last equality we used Lemma 3.1. □
Let S be a subset of , , . Assume that the measure ν has moments of all orders, then the transition probability density , of the semigroup is given by
where and , , are the Kronecker symbol.
If S is a subset of , from Lemmas 3.3 and 3.4 we have
where, in the last equality we used the correspondence between the ordinary and truncated moments w.r.t the measure ν as given in Definition 3.2. □
The expression of the transition probability density, , given by Proposition 3.5 is messy! Since the terms in the perturbative series becomes rapidly unwieldy and the series diverges, by means of the developed graph techniques we prove that it gives the meaning of asymptotic expansion in power of the parameter .
Our graph formalism consists of Feynman graphs and algorithms in which the graphs are formed by sets of edges connected by vertices that represents each terms in the expansion of , given by Proposition 3.5.
The truncated moments have also a graphical interpretation in our developed diagram.
A Feynman graph is a graph with inner, empty and outer vertices that are directed and with distinguishable edges. The set of all vertices(resp. edges) is denoted by V (resp. E).
For a vertex , the degree of v, noted by is defined as
where stands for the cardinal of a given set S.
The order n of a Feynman graph G is just the number of edges that are connected to the inner vertices. The set of such graphs will be denoted by , see, Fig. 1.
Example of Feynman graph with 5 inner vertices, one empty and one outer vertex.
For each Feynman graph we assign an algorithm which gives the analytic value of G, denoted by , , .
Let , and . The number is defined as follows:
For each inner vertex , , assign an index to each edge connected to .
For , multiply by , each outer vertex connected to the l-th edge.
Multiply by , each empty vertex with m legs, where .
Sum up over .
The transition probability density , of the semigroup associated to the solution , of the stochastic differential equation (1) is given by:
From Proposition 3.5 we have:
Now application of Definitions 3.7 and 3.8 concludes the argument. □
The truncated moments for the stationary process is given by:
where ;
Where δ is the Dirac distribution and Ψ the Lévy characteristic function given by equation (13).
Topologically, the Feynman algorithm, which gives the value of a given Feynman graph, have two forms: the form given by Definition 3.8 and another form evaluating the so called connected graphs. For the later, the truncated moments are simplified by a combinatoric factor (see, Definition 3.12, this is known in quantum field theory under the name of vacuum graphs which consists of closed graphs).
The graphical interpretation of Theorem 3.10, is that each empty vertex together with the edges connected to it is replaced by one edge ! (see Fig. 2(b)).
We denote the collections of connected Feynman graphs of n-th order by . We construct this collections of graphs as follows:
For a given Feynman graph . Let , we say that the graph connects q and j in notation , if the inner vertices and are connected in G. Clearly is an equivalence relation and if is the equivalence classes of then . For we denote by the connected Feynman graph with inner vertices such that , and we assign an analytic value to each connected graph denoted by .
Let and , we define the number as follows:
For each inner vertex , , assign an index to each edge connected to .
For , multiply by , each outer vertex connected to the l-th edge.
For m distinct ways of connecting edges to vertices that yields the same diagram, i.e., same topology, multiply by the combinatoric factor , where and are the coefficients given by Theorem 3.10.
Sum up over .
We have established the following result:
The transition probability density , of the semigroup associated to the solution , of the stochastic differential equation (1) is expressed in terms of connected Feynman graphs whose analytic values , are fixed in Definition 3.12 i.e.
Figures 2(a) and (b) give a construction of a connected Feynman graph and a graphical interpretation of Theorem 3.10.
(a) Example of a Feynman graph with 5 inner vertices, one empty vertex and one outer vertex. (b) Construction of a connected Feynman graph (from the graph in figure (a)) where the empty vertex together with the edges connected to it is replaced by one edge.
Asymptotic expansion of the transition probability density
The expansion of the transition probability density , of the semigroup associated to the solution , of the stochastic differential equation (1) given by Theorem 3.13 is divergent in general since the corresponding terms are rapidly unwieldy, however it can be given the meaning of an asymptotic expansion in the sense of Borel, see, e.g., [40]. In this section we recall the main result of this paper which proves the asymptotic behaviour of the transition probability density , as .
For f, g two functions defined on , we have: , as , if , such that .
The following result, due to George Neville Watson (1886–1965) is one of the significant result in the theory of asymptotic expansion:
(Watson’s Lemma).
Suppose , as and in some neighbourhood of , can be expanded as
where , for and . Then
has the asymptotic expansion
where Γ is the gamma function given by .
Let , be a given function. The series is said to be asymptotic to , if
i.e.,
Under the assumption given by equation (16) and for , there exists such that:
The proof can be found in [41], however we will develop other techniques by means of Fourier transform:
Since , is characterized by the Fourier transform, see, equation (15), and from assumption (16), the following holds:
the result is immediate by taking , ( is given by equation (16)). □
The transition probability density , given by Theorem 3.13 has an asymptotic expansion in power of β.
From Theorem 3.13 we have , ,it suffices then to prove:
But
where in the last inequality we used the result given by Lemma 4.4, the factor is a real number associated with the graphs , , is a partition of , and is the cardinal of .
Now if , then is nothing else then the “Bell numbers”, which counts in this case, the number of partitions of a set of size .
In the other hand from a result in [17], the authors proved that
From another result in [28], it has been shown that the Bell numbers are asymptotic to:
with defined by
where is the Lambert W-function, see, e.g. [23]. is also defined implicitly by
Putting together equation (37) and the properties of , we have , therefore
Hence
Where in the last inequality C is a positive constant and we used Stirling’s formula, see, e.g., [29]. □
The integral has the asymptotic expansion:
By Theorem 4.5, the transition probability density , has an asymptotic expansion in power of β, moreover it is expanded as:
where and for . Hence by Watson’s lemma, see, Theorem 4.2, we get:
The result is then immediate. □
Applications to stochastic and probabilistic neural networks
Artificial neural network (ANN) is an information processing that is inspired by the biological nervous system such as brain process information. In this section we will focus on two types of (ANN), namely stochastic and probabilistic neural networks.
Stochastic neural networks (SNN)
For SNN, the neurons will be modified either by stochastic transfer functions or given them stochastic weight.
To analyse the behaviour of models describing a network of interacting neurons, we apply our graph formalism, the graphs consists of m vertices (inner/outer and empty) and n edges where every vertex will be identified as a soma (body of the neuron cell) and every edge will represent a cylindrical which models the interaction between different neurons.
More precisely, we are concerned with a finite connected network, represented by a finite graph G with m edges and n vertices . Following, e.g. [31], we normalize and parametrize the edges to be identified with the interval and we denote again by the parametrization of the edge (for more details about the above notation we refer to Mugnolo and all., see, e.g. [31,32]).
The valency of each vertex is denoted by ; precisely:
The graph G will be described then by the incidence matrix , where and are given by
For functions , with an abuse of notation, we use the abbreviations to denote their values at 0 or 1, if or , respectively.
On G we consider the following system of stochastic diffusion equations:
In the above equations, and () are suitable smooth functions, while , are suitable polynomials with odd degree. Moreover, , represents the stochastic perturbation acting on each edges due to external surrounding, and is the distributional derivative of the process . Biological motivations lead to model this term by a Gaussian process. In fact, the evolution of the electrical potential on the molecular level can be perturbed by different types of random terms, each modeling the influence, at different time scales,of the surrounding medium. The generality of the above diffusion is motivated by the discussion in the biological literature.
The above equations shall be endowed with suitable boundary and initial conditions. Since we are dealing with a diffusion in a network, we require, following for example [31,32], a continuity assumption and a Kirchhoff’s law on every nodes:
Initial conditions at time are taken for simplicity to be of the form
Applying now the results recalled in Section 3, we can find the correlation functions which plays a central role in the Gaussian or Lévy process picture of wide networks (see, e.g. [11,42]). For instance if , is Gaussian process centered at , , , „ the correlation function of linear networks are expressed in terms of covariances:
As an example, for a linear networks with 2 hidden layers, and following Proposition 3.5 and Theorem 3.10 we will have:
Where the last equality holds by taking . Here being the Kronecker delta functions.
For multilayer network, if , is the probability density, where x is the network input, the out put will be then the estimate of .
For and from the results given in Section 3, the probability density , will be given by the following expansion:
Since the estimation of the probability density is done in discrete case (through estimation of graphs), the neural networks methods will be more realistic.
From the above example we notice that correlation functions of linear network are simplified and consequently the expansion of the transition probability density , can be approximated through the new simplified Feynman graphs and rules as follows:
For a correlation function of a network with d hidden layers. We define the set of all Feynman graph as follows:
For each inner vertex assign an index corresponding to each edge connected to it.
Multiply with each empty vertex connected to the m-th edge, where .
Probabilistic neural networks (PNN)
For the case of (PNN), the probability distribution of each class is approximated by a distribution. Our current work can be also applied, since from Definitions 2.1, 3.7 and Remark 2.3, a Markov process , can be given as a graph , where V (resp. E) is the set of vertices (resp. edges) and , the transition probabilities along the edges between the vertices. (See Theorem 3.13 for the expression of .) Therefore the PNN can be defined as follows:
Given an input values (vectors product and bias value), which passes through a linear function to produce output values, in our approach, the input can be connected to several vertices (neurons) yields a hidden layer, the later can be also connected to other layers to produce out put layer all together leads a PNN. (See Fig. 3.)
Example of probabilistic neural network.
If an input layer is connected to hidden layers and leads an output , the transition probabilities can be approximated, as done in this work, and consequently one can generate arbitrary number of input/output, see, e.g. [42]. A simple example arise when we consider the Gaussian case, the transition probabilities densities are given by:
Now application of Theorem 3.13, the graphical representation of the probabilistic neural network given by Fig. 3 becomes as in Fig. 4, and the transition probabilities densities will be then given by:
Example of PNN where the hidden layer 1 is connected directly to the output layer.
Footnotes
Acknowledgements
This work was supported by King Fahd University of Petroleum and Minerals under the project ♯ SB181034. The author gratefully acknowledges this support.
The author would like to thank Sergio Albeverio for many stimulating discussions.
References
1.
S.Albeverio, L.Dipersio, E.Mastrogiacomo and B.Smii, A class of Lévy driven SDEs and their explicit invariant measures, Pot. Anal.45(2) (2016), 229–259. doi:10.1007/s11118-016-9544-3.
2.
S.Albeverio, L.Dipersio, E.Mastrogiacomo and B.Smii, Explicit invariant measures for infinite dimensional SDE driven by Lévy noise with dissipative nonlinear drift I, Comm. Math. Sci.15(4) (2017), 957–983. doi:10.4310/CMS.2017.v15.n4.a3.
3.
S.Albeverio, H.Gottschalk and J.-L.Wu, Convoluted generalized white noise, Schwinger functions and their analytic continuation to Wightman functions, Rev. Math. Phys8(6) (1996), 763–817. doi:10.1142/S0129055X96000287.
4.
S.Albeverio, E.Lytvynov and A.Mahnig, A model of the term structure of interest rates based on Lévy fields, Stoch. Proc. Appl.114(2) (2004), 251–263. doi:10.1016/j.spa.2004.06.006.
5.
S.Albeverio, V.Mandrekar and B.Rüdiger, Existence of mild solutions for stochastic differential equations and semilinear equations with non-Gaussian Lévy noise, Stochastic Process. Appl.119(3) (2009), 835–863. doi:10.1016/j.spa.2008.03.006.
6.
S.Albeverio, E.Mastrogiacomo and B.Smii, Small noise asymptotic expansions for stochastic PDE’s driven by dissipative nonlinearity and Lévy noise, Stoch. Proc. Appl.123 (2013), 2084–2109. doi:10.1016/j.spa.2013.01.013.
7.
S.Albeverio and B.Smii, Asymptotic expansions for SDE’s with small multiplicative noise, Stoch. Process. App.125(3) (2015), 1009–1031. doi:10.1016/j.spa.2014.09.009.
8.
S.Albeverio and B.Smii, Borel summation of the small time expansion of some SDE’s driven by Gaussian white noise, Asymp. Anal.114 (2019), 211–223. doi:10.3233/ASY-191525.
9.
S.Albeverio, J.-L.Wu and T.S.Zhang, Parabolic SPDEs driven by Poisson white noise, Stoch. Proc. Appl.74 (1998), 21–36. doi:10.1016/S0304-4149(97)00112-9.
10.
O.E.Barndorff-Nielsen and N.Shephard, Non-Gaussian Ornstein–Uhlenbeck-based models and some of their uses in financial economics, J. R. Stat. Soc. Ser. B Stat. Methodol.63(2) (2001), 167–241. doi:10.1111/1467-9868.00282.
11.
A.Blake, P.Kohli and C.Rother, Markov Random Fields for Vision and Image Processing, The MIT Press, 2011.
12.
S.Boyarchenko, I.Svetlana and Z.Sergi, Non-Gaussian Merton–Black–Scholes Theory, Vol. 9, World Scientific, Singapore, 2002.
13.
Z.Brzeźniak and E.Hausenblas, Uniqueness in law of the Itô integral with respect to Lévy noise, in: Seminar Stoch. Anal., Random Fields and Appl., Vol. VI, Birkhauser, Basel, 2011, pp. 37–57.
14.
K.L.Chung and M.Rao, General gauge theorem for multiplicative functional, Trans. Am. Math. Soc.306 (1988), 819–836. doi:10.1090/S0002-9947-1988-0933320-1.
15.
R.Cont and P.Tankov, Financial Modelling with Jump Processes, Vol. 2, CRC Press, 2004.
16.
Ph.Courrège, Sur la forme intégro-différentielle des opérateurs de dans satisfaisant au principe du maximum”, Sém. Théorie du potentiel (1965/66) Exposé 2.
17.
N.G.De Bruijn, Asymptotic Methods in Analysis, North-Holland, Amsterdam, 1958, Zbl. 82,42 (2nd ed. 1975, 3rd ed., New York: Dover 1981).
18.
E.Eberlein and K.Glau, Variational solutions of the pricing PIDEs for European options in Lévy models, Appl. Math. Finance (2014).
I.I.Gihman and A.V.Skorohod, Stochastic Differential Equations, Springer-Verlag, Berlin–Heidelberg. New York, 1972.
21.
H.Gottschalk and B.Smii, How to determine the law of the solution to a SPDE driven by a Lévy space–time noise, Journal of Mathematical Physics43 (2007), 1–22.
22.
H.Gottschalk, B.Smii and H.Thaler, The Feynman graph representation of general convolution semigroups and its applications to Lévy statistics, Journal of Bernoulli Society14(2) (2008), 322–351. doi:10.3150/07-BEJ106.
23.
R.L.Graham, D.E.Knth and O.Patashnik, Concrete Mathematics, Addison-Wesley, 1994.
24.
P.Hartman and A.Wintner, On the infinitisimal generators of integral convolutions, Am. J. Math.64 (1942), 273–298. doi:10.2307/2371683.
25.
V.Knopova, On the Feynman–Kac semigroup for some Markov processes, Mod. Stoch: Theor. App.2 (2015), 107–129. doi:10.15559/15-VMSTA26.
26.
V.Knopova and R.L.Schilling, A note on the existence of transition probability densities of Lévy processes, Foru. Math.25 (2013), 125–149.
27.
A.Løkka, B.Øksendal and F.Proske, Stochastic partial differential equations driven by Lévy space–time white noise, Ann. Appl. Prob.14 (2004), 1506–1528. doi:10.1214/105051604000000413.
28.
L.Lovász, Combinatorial Problems and Exercises, 2nd edn, North Holland, Amsterdam, Netherlands, 1993.
29.
G.Marsaglia and J.C.W.Marsaglia, A new derivation of Stirling’s approximation of , Amer. Math. Monthly97 (1990), 826–829. doi:10.1080/00029890.1990.11995666.
30.
T.Meyer-Brandis and F.Proske, Explicit representation of strong solutions of SDEs driven by infinite dimensional Lévy processes, J. Theor. Prob.23 (2010), 301–314. doi:10.1007/s10959-009-0226-6.
31.
D.Mugnolo, Gaussian estimates for a heat equation on a network, Netw. Heter. Media2 (2007), 55–79. doi:10.3934/nhm.2007.2.55.
32.
D.Mugnolo and S.Romanelli, Dynamic and generalized Wentzell node conditions for network equations, Math. Meth. Appl. Sciences30 (2007), 681–706. doi:10.1002/mma.805.
33.
B.Øksendal, Stochastic Differential Equations: An Introduction with Applications, 6th edn, Springer-Verlag, Berlin Heidelberg, 2010.
34.
K.I.Sato, Lévy Processes and Infinite Divisibility, Cambridge University Press, 1999.
35.
L.S.Schulman, Techniques and Applications of Path Integrations, John Wiley and Sons, New York, 1981.
36.
J.-P.Serre, Trees, Springer, Berlin, 1980.
37.
T.Shiga, A recurrence criterion for Markov processes of Ornstein–Uhlenbeck type, Prob. Th. Rel. Fields.85 (1990), 425–447. doi:10.1007/BF01203163.
38.
B.Smii, A linked cluster theorem of the solution of the generalized Burger equation, Applied Mathematical Sciences. (Ruse)6(1) (2012), 21–38.
39.
B.Smii, A large diffusion expansion for the transition function of Lévy Ornstein–Uhlenbeck processes, Appl. Math. Inf. Sci.10(4) (2016), 1–8. doi:10.18576/amis/100434.
40.
A.D.Sokal, An improvement of Watson’s theorem on Borel summability, J. Math. Phys.21(2) (1980), 261–263. doi:10.1063/1.524408.
41.
R.Song, Two-sided estimates on the density of the Feynman–Kac semigroups of stable-like processes, Elec. J. Prob.11 (2006), 146–161. doi:10.1214/EJP.v11-308.
42.
C.Turchetti, Stochastic Models of Neural Networks, IOS Press, 2004.