Quantitative remote sensing is an appropriate way to estimate atmospheric parameters and structural parameters and spectral component signatures of Earth surface cover type. Since the real physical system that couples the atmosphere, water and the land surface is complicated, its description requires a comprehensive set of parameters, so any practical physical model can only be approximated by a limited mathematical model. The pivotal problem for quantitative remote sensing is inversion. Inverse problems are typically ill-posed; they are characterized by: (C1) the solution may not exist; (C2) the dimension of the solution space may be infinite; (C3) the solution is not continuous with variations of the observations. These issues exist for nearly all inverse problems in geosciences and quantitative remote sensing. For example, when the observation system is band-limited or sampling is poor, i.e. few observations are available or directions are poorly located, the inversion process would be underdetermined, which leads to a multiplicity of the solutions, the large condition number of the normalized system, and significant noise propagation. Hence (C2) and (C3) would be the difficulties for quantitative remote sensing inversion. This paper will address the theory and methods from the viewpoint that the quantitative remote sensing inverse problems can be represented by kernel-based operator equations and solved by coupling regularization and optimization methods. In particular, I propose sparse and non-smooth regularization and optimization techniques for solving inverse problems in remote sensing. Numerical experiments are also made to demonstrate the applicability of our algorithms.
Model-based inversion is quite important for quantitative remote sensing. Here, model-based inversion mainly refers to using physical or empirically physical models to infer unknown but relevant parameters. Hundreds of models related to atmosphere, vegetation and radiation have been established during recent decades (Liang, 2004), and model-based inversion in geophysical and atmospheric sciences is well understood. However, model-based inverse problems for earth surface studies have received attention only in recent years. Compared to modelling, model-based inversion is still in the exploration stage (Wang et al., 2009c) because intrinsic difficulties exist in the application of a priori information, inverse strategy and inverse algorithms. The development of hyperspectral and multiangular remote sensors has enhanced exploration and provided us more spectral and spatial information than before. However, to utilize the information to solve problems faced in quantitative remote sensing is still a challenge. Remote sensing inversion for problems in different areas is being paid more attention. In a series of international study projections, such as International Geosphere-Biosphere Programme (IGBP), World Climate Research Programme (WCRP) and NASA’s Earth Observing System (EOS), remote sensing inversion has become a focal point of study.
Model-based remote sensing inversion usually requires solving optimization problems with different constraints. Therefore, knowledge of how to incorporate the methods developed in operational research field into the remote sensing inversion field is needed. In quantitative remote sensing, since the real physical system that couples the atmosphere and the land surface is very complicated, sometimes it requires a comprehensive set of parameters to describe such a system, so any practical physical model can only be approximated by a model which includes only a limited number of the most important parameters that capture the major variation of the real system. Generally speaking, a discrete forward model to describe such a system is in the form:
where is single measurement, is a vector of controllable measurement conditions such as wave band, viewing direction, time, Sun position, polarization, and so forth, is a vector of state parameters of the system approximation, and is a function which relates with , which is generally non-linear and continuous.
With the ability of receiver sensors to acquire multiple bands, multiple viewing directions and so on, while keeping essentially the same, we obtain the following inhomogeneous equation:
where is a vector in , which is an M dimensional measurement space with M values corresponding to M different measurement conditions, is the vector of random noise with same vector length M. Assume that there are m undetermined parameters that need to be recovered. Clearly, if M = m and the system is linearly independent, (1.2) is a determined system, so it is not difficult to develop some suitable algorithms to solve it. If more observations can be collected than the existing parameters in the model (Verstraete et al., 1996), i.e. M > m, the system (1.2) is overdetermined. In this situation, the traditional solution does not exist. We must define its solution in some other meanings, for example, the least squares error (LSE) solution. However, for physical models with complete parameters for a single band, it is questionable whether remote sensing inversion can be overdetermined in the foreseeable future (Li et al., 1998). Therefore, inversion problems in geosciences seem to be always ill-posed (underdetermined) in some sense. Moreover, faced with an underdetermined system, the solution will be non-unique. In other words, there will be infinite solutions in the null space of solutions. To tackle the ill-posedness, proper a priori information must be found and involved into the inversion procedure to provide a useful approximate solution (Li et al., 2001).
Developed methods in literature for quantitative remote sensing inversion are mainly statistical methods with several variations from Bayesian inference (Combal et al., 2003; Pokrovsky and Roujean, 2002, 2003; Pokrovsky et al., 2003; Quaife and Lewis, 2010) and regularization methods (Phillips, 1962; Twomey, 1975; Wang et al., 2007a, 2008). According to Combal et al. (2003), there are three classes of algorithms for solving the inverse problems in remote sensing, including minimization algorithms, lookup tables (LUT) and neural networks (NNT). Many efforts have been made to invert the radiative transfer models using these algorithms (Goel and Strebel, 1983; Jacquemoud and Baret, 1993; Kuusk, 1991; Weiss et al., 2000) and to validate these algorithms (Combal et al., 2003; Knyazikhin et al., 1998a, 1998b; Privette et al., 1996; Weiss and Baret, 1999). We pay attention to the minimization algorithms in this paper. Minimization algorithms based on filtering techniques have been extensively studied (Cohn et al., 1994; Samain et al., 2008; Sarkka et al., 2004). We will show in this paper that the Tikhonov regularization can be also expressed as filtering methods. Using kernel expression, we analyse solution theory and methods for quantitative remote sensing inverse problems from an algebraic point of view. We first review regularization methods for retrieval of parameters, and then propose sparse/non-smooth inversion by total variation and sparse inversion in () spaces and introduce advanced optimization techniques. These methods, as far as we know, are novel to literature in the earth sciences.
The outline of the paper is as follows: in section II.1, we formulate the inverse problems by operator equations of the first kind and define the ill-posed nature of the problems. Sections II.2 and II.3 introduce two typical inverse problems in remote sensing; one is the linear kernel-based bidirectional reflectance distribution function (BRDF) model inversion, and another is the atmospheric aerosol particle size distribution function retrieval problem. Both problems are of great importance for calibration and for parameters retrieval. The ill-posed nature of both problems is explained in the respective sections. In section III, a simple linear algebraic system is introduced to explain the ill-posedness of the finite linear inverse problems. In section IV, the regularization theory and solution techniques for ill-posed quantitative remote sensing inverse problems are described. Beginning from the Bayesian inference, section IV.1 discusses constrained optimization; section IV.2 fully extends the Tikhonov regularization; section IV.3 discusses the conceptual regularization scheme formulated in the Bayesian statistical inference; in section IV.4, the direct regularization method based on spectrum decomposition for equality-constrained problem is introduced, and equivalence to the standard Tikhonov regularization is established by introducing filtering functions. In section V, the sparse regularization and optimization theory and solving methods are discussed for finding an optimized solution of a minimization model. Section V.1 develops a total variation method for the ill-posed inverse problem; section V.2 discusses sparse and non-smooth inversion in space; section V.3 discusses some computational issues in inversion. In section VI, the applications of regularization and optimization solution methods for retrieval of land surface parameters and aerosol particle size distribution functions are presented. Finally, in section VII, some concluding remarks are given.
II Linear kernel-based models and ill-posedness
1 General model of the first kind operator equation
Inverse problems are usually formulated as operator equations of the first kind. We consider the first kind operator equation in the general form:
which is an appropriate expression for an observing system, with the response function (linear or non-linear) from Hilbert space to Hilbert space , the unknown input and y the observed data. Particularly, if is a linear mapping, we will denote the response system as:
which is clearly a special case of (2.1). We will also use as an operator in infinite spaces sometimes, and a matrix sometimes. We assume that readers can readily recognize them.
The problem (2.1) is said to be properly posed or well-posed in the sense that it has the following three properties:
(C1) There exists a solution of the problem, i.e. existence;
(C2) There is at most one solution of the problem, i.e. uniqueness;
(C3) The solution depends continuously on the variations of the right-hand side (data), i.e. stability.
The condition (C1) can be easily fulfilled if we enlarge the solution space of the problem (2.1). The condition (C2) is seldom satisfied for many indirect measurement problems. More than one solution may be found for the problem (2.1) and information about the model is missing. In this case, a priori knowledge about the solution must be incorporated in the model. Stability is the most important requirement. If the problem (2.1) lacks the property of stability, then the computed solution will be far from the true solution since the practically obtained solution is contaminated by unavoidable errors. Therefore, there is no way to overcome this difficulty unless additional information about the solution is available. Again, a priori knowledge about the solution should be involved.
If equation (2.2) is well-posed, then has a well-defined, continuous inverse operator . In particular, for any and . In this case both the algebraic nature of the spaces and the topologies of the spaces are ready to be employed.
2 Linear kernel-based BRDF model
As an example of linear kernel-based models in geosciences, we consider a well-refereed model. It is based on the assumption that the anisotropy of the land surface can be described by the bidirectional reflectance distribution function (BRDF). With the progress of the multiangular remote sensing, it seems that the BRDF models can be inverted to estimate structural parameters and spectral component signatures of Earth surface cover type (Roujean et al., 1992; Strahler et al., 1994). The state-of-the-art of BRDF is the use of the linear kernel-based models, mathematically described as the linear combination of the isotropic kernel, volume scattering kernel and geometric optics kernel. Information extraction on the terrestrial biosphere and other problems for retrieval of land surface albedos from satellite remote sensing have been considered by many authors in recent years; see, for instance, the survey papers on the kernel-based bidirectional reflectance distribution function (BRDF) models (Combal et al., 2003; Pokrovsky and Roujean, 2002, 2003; Pokrovsky et al., 2003; Quaife and Lewis, 2010) and references therein. Computational stability is characterized by the algebraic operator spectrum of the kernel-matrix and the observation errors. Therefore, the retrieval of the model coefficients is of great importance for computation of the land surface albedos.
The linear kernel-based BRDF model can be described as follows (Roujean et al., 1992):
where is the bidirectional reflectance; the kernels and are so-called kernels, i.e. known functions of illumination and of viewing geometry which describe volume and geometric scattering, respectively; and are the zenith angle of the solar direction and the zenith angle of the view direction, respectively; is the relative azimuth of sun and view direction; and , and are three unknown parameters to be adjusted to fit observations. Physically, , and are closely related to the biomass such as leaf area index (LAI), Lambertian reflectance, sunlit crown reflectance, and viewing and solar angles. The vital task then is to retrieve appropriate values of the three parameters.
Generally speaking, the BRDF model includes kernels of many types. However, it was demonstrated that the combination of RossThick () and LiSparse () kernels had the best overall ability to fit BRDF measurements and to extrapolate BRDF and albedo (e.g. Li et al., 1999; Privette et al., 1997; Wanner et al., 1995). A suitable expression for the RossThick kernel was derived by Roujean et al. (1992). It is reported that the LiTransit kernel , instead of the kernel , is more robust and stable than LiSparse non-reciprocal kernel and the reciprocal LiSparse kernel (LiSparseR) where the LiTransit kernel and the LiSparse kernel are related by:
To use the combined linear kernel model, a key issue is to numerically solve the inverse model in a stable way. However, it is difficult to do in practical applications due to the ill-posed nature of the inverse problem.
a Ill-posedness
Note that (2.3) is a linear model in finite spaces, therefore it is easy to rewrite it into a finite rank operator equation:
by setting and with the entries , where y is the measurement data. The inverse problem is to recover the model parameters given the limited measurement data y.
Numerically, the discrete ill-posedness lies in that the operator may be inaccurate (can only be approximated), and the model is usually underdetermined if there are few observations or poor directional range, or the observations are highly linearly dependent and noisy. For example, a single angular observation may lead to an underdetermined system whose solutions are infinite (the null space of the kernel operator contains non-zero vectors) or the system has no solution (the rank of the coefficient matrix is not equal to the augmented matrix). In practice, random uncertainty in the reflectances sampled translates into uncertainty in the BRDF and albedo. We note that noise inflation depends on the sampling geometry alone. For example, for MODIS and MISR sampling they vary with latitude and time of year; but for kernel-based models they do not depend on wavelength or the type of BRDF viewed. Therefore, the random noise in the observation (BRDF) and the small singular values of control the error propagation.
3 Aerosol particle size distribution function inversion
It is well known that the characteristics of the aerosol particle size, which can be represented as a size distribution function in the mathematical formulation, say , play an important role in climate modelling due to their uncertainty (Houghton et al., 1996). So, the determination of particle size distribution function becomes a basic task in aerosol research (Bockmann, 2001; Bockmann and Kirsche, 2006; Bohren and Huffman, 1983; Davies, 1974; McCartney, 1976; Twomey, 1977). The particle size distribution is usually retrieved by extinction measurements using a sun-photometer. The attenuation of aerosols can be written as the integral equation of the first kind:
where is the particle radius; is the columnar aerosol size distribution (i.e. the number of particles per unit area per unit radius interval in a vertical column through the atmosphere); is the complex refractive index of the aerosol particles; is the wavelength; is the aerosol optical thickness (AOT); is the error/noise; and is the extinction efficiency factor from Mie theory. The AOT can be obtained from the measurements of the solar flux density with sun-photometers; therefore, one can retrieve the size distribution by the inversion of AOT measurements through the above equation. Equation (2.5) can be simply expressed by the operator equation:
where the operator is specified by the kernel function .
a Ill-posedness
The particle size distribution model (2.6) is a linear model in infinite spaces. It is clear to see that the kernel function is continuous, differentiable and bounded on ; therefore the operator is compact. According to the definition of the ill-posedness and the operator theory (Tikhonov and Arsenin, 1977; Xiao et al., 2003; Yosida, 1999), the ill-posedness of the aerosol problem is self-evident because at least one of the three items for well-posed problems is violated.
III A simple ill-posed mathematical example
Let us consider a simple linear algebraic system:
where ,. It is clear to see that is positive semi-definite, and the equation has an infinite number of solutions. However, the normal solution can be found, i.e. . If we perturb the matrix a little, i.e. , where is a diagonal matrix in the form of , is a typically small number and , in this case there is a unique solution of (3.1), i.e..
Let us look at another example: consider the same linear system (3.1), where , . It is clear that the equation has no solution at all. However, the normal solution can again be found, i.e..
The simple example indicates that if the nearest two observations are linearly dependent, the number of solutions may be infinite, non-unique or non-existent, i.e. the ill-posedness is fulfilled. In such case, we can only obtain the solution with minimum energy. To tackle the difficulties, we have to resort to regularization.
IV Regularization methods
For ease of notation, from this section to the end of the paper, we use to represent the linear system, where refers to the operator either from kernel-based BRDF model (2.3) or from the aerosol particle attenuation equation (2.5).
Bayesian inference provides a conceptually simple process for updating uncertainty in the light of evidence. Initial beliefs about some unknown quantity are represented by a priori distribution. According to Bayesian rule, the estimated parameter could be expressed as a posteriori probability :
where is the a priori probability distribution about the parameter , is the a priori probability distribution about the data, and describes the probability distribution function about the discrepancy between the model and the data. Using logarithm, we obtain the following equation:
Therefore, the maximum a posteriori probability can be obtained by minimizing the following function:
Information in the data is expressed by the likelihood function . Therefore, minimization of the function is equivalent to maximization of the likelihood function . The a priori distribution and the likelihood function are then combined to obtain the posterior distribution for the quantity of interest. The a posteriori distribution expresses our revised uncertainty in light of the data, in other words, an organized appraisal in the consideration of previous experience. For example, we can choose as:
where is a constant related to the number , is the variance related to data and . In the following context, we will see how the Bayesian inference is related.
1 Applying constraints to solutions
According to Bayesian inference theory, for effective inversion of the ill-posed kernel-driven model, we have to impose an a priori constraint to the interested parameters. This leads to solving a constrained minimization problem:
where denotes an object function, which is a function of , is the constraint to the solution , and and are two constants which specify the bounds of . Usually, is chosen as the norm of with different scales. If the parameter comes from a smooth function, then can be chosen as a smooth function, otherwise can be non-smooth.
The constraint can be smooth (e.g. Sobolev stabilizer) or non-smooth (e.g. total variation or norm () based stabilizer). A generically used constraint is the smoothness. It assumes that physical properties in a neighbourhood of space or in an interval of time present some coherence and generally do not change abruptly. Practically, we can always find regularities of a physical phenomenon with respect to certain properties over a short period of time (Wang et al., 2007b, 2008). The smoothness has been one of the most popular a priori assumptions in applications. The general framework is the so-called regularization which will be explained in the next section.
2 Regularization: incorporate a priori information naturally
Most inverse problems in real environment are generally ill-posed. Regularization methods are widely used to solve such ill-posed problems. The complete theory for regularization was developed by Tikhonov and his colleagues (Tikhonov and Arsenin, 1977; Tikhonov et al., 1995). For the discrete model (2.4), we suppose the right-hand side is the measurement with noise which represents the bidirectional reflectance or the AOT. The Tikhonov regularization method is to solve a regularized minimization problem:
instead of solving:
In (4.6), is the regularization parameter and D is a positively (semi-)definite operator. By a variational process, the minimize of (4.6) satisfies:
or can be written as . It is clear to see that the Tikhonov regularization is a special case of Bayesian inference if we take in equation (4.4) and choose the a priori distribution as the 2-norm. The operator D is a scale matrix which imposes smoothness constraint to the solution. The scale operator D and the regularization parameter can be considered as some kind of a priori information; see Wang et al. (2008, 2009c) for details.
Choosing the regularization parameter can be performed in an a priori way or an a posteriori way. The a priori way means setting fixed values of parameter in calculations, while the a posteriori way means determining an optimal regularization parameter. We recall a well-known discrepancy principle discussed in Wang et al. (2008) that the optimal value of the regularization parameter should be the solution of the non-linear equation:
where indicates that the unknown parameter is related to the regularization parameter, and is the upper bound of error in the observation. Once an optimal value , i.e. the root of the above equation, is found, the optimal parameter is obtained. The root is easy to find through Newton’s iterative method:
where with a pre-assigned scale matrix and . It is clear that Phillips-Twomey’s regularization shares similarity with Tikhonov’s regularization and can be written in consistent form. The form of the operator is determined by the norm of the second differences . However, the matrix is badly conditioned and will cause numerical instability. We consider choosing a proper from Sobolev space, i.e. in (4.6) the matrix is determined from discretization of , where the inner product of two functions and in space is defined by:
According to the Sobolev imbedding theorem, this form of regularization possesses good convergence property and stability and can yield a smooth solution (Tikhonov and Arsenin, 1977; Xiao et al., 2003).
3 Statistical regularization
The role of Bayesian statistics is very similar to the role of regularization (Wang et al., 2007b). Now we establish the relationship between the Bayesian estimation and the regularization. A continuous random vector is said to have a Gaussian distribution if its joint probability distribution function has the form:
where , is an N-by-N symmetric positive definite matrix, and det(·) denotes the matrix determinant. The mean is given by and the covariance matrix is .
Suppose is a Gaussian distribution with mean and covariance , where is the noise covariance of the observation noise and model inaccuracy. Then by (4.13) we obtain:
From (4.13), the prior probability distribution is given by:
By Bayesian statistical inference and the above two equations, we obtain an a posteriori log likelihood function:
where is constant with respect to . The maximum a posteriori estimation is obtained by maximizing (4.15) with respect to :
The easiest way of choosing and is by setting , and then (4.16) becomes:
where , which is the noise-to-signal ratio. It is clear that the solution obtained by maximum a posteriori estimation has the same form as the solution of the Tikhonov regularization when the regularization parameter is set to be a priori value.
Remark 4.2: Recently, Quaife and Lewis (2010) considered constrained least squares solutions. They imposed an expectation of the temporal behavior of the parameter in the form:
where specifies the required constraint, is a weighting matrix with , is the observation covariance matrix and satisfies . In a special case, e.g. choosing as an identity matrix , to be zero, the above expression leads to:
If choose , we have:
which is equivalent to (4.8) if we set , and . However, in this formation, the Lagrangian parameter has to be considered in an a priori way. In Quaife and Lewis (2010), the authors considered setting in the amplitude of . And it is clear that it requires many numerical simulations to give statistical information of a quasi-optimal value of the Lagrangian parameter .
Remark 4.3: Combal et al. (2003) developed a quasi-Newton algorithm for searching for the maximum likelihood estimation of the probability density function of the canopy biophysical variables through the model:
where is the simulated BRDF, is the estimated a priori value to the retrieval, is the covariance matrix of the variable values. In calculation, the authors preferred and diagonal matrices and a sum of the root mean square error of the radiometric measurements and the canopy variables a priori information. It is reported that the quasi-Newton algorithm can yield good convergence. According to our experience, the quasi-Newton method relies on the initial guess value of . Improper choice of the guess value of may lead to divergence.
4 Direct regularization: spectrum decomposition method
We now discuss a special Tikhonov regularization method: direct regularization. Instead of standard Tikhonov regularization, the spectrum decomposition method aims to solve an equality constrained problem:
As mentioned already, the ill-posedness is largely due to the small singular values of the linear operator. Let us denote the singular value decomposition (SVD) of as , where both and are orthonormal matrices, i.e. the products of with its transpose and with its transpose are both identity matrices; is a diagonal matrix whose non-zero entries consist of the singular values of . The traditional least-squares error (LSE) solution of the constrained optimization system (4.22) can be expressed by the singular values and singular vectors in the form . If the rank of is , then the above solution form inevitably encounters numerical difficulties, since the denominator contains numerically infinitesimal values. Therefore, to solve the problem by the SVD, we must impose a priori information. We consider another way of incorporating a priori information to the solution. The idea is quite simple: instead of filtering the small singular values by replacing the small singular values with small positive numbers, we just make a truncation of the summation, i.e. the terms containing small singular values are replaced by zeroes. Note that in practice may not be exactly rank deficient, but instead be numerically rank deficient, i.e. it has one or more small but non-zero singular values such that . Here, refers to the numerical rank of a matrix (for details, see Wang et al., 2006b, 2007b). In this way, we obtain a regularized solution of the least squares problem:
It is easy to see that the numerically truncated singular value decomposition method is a special case of the model (4.5) by choosing special functions and . If we define the filter function of the standard Tikhonov regularization (4.6) with an identity as , and the filter function of the singular value decomposition as , then the regularized solution can be written as and , respectively. Therefore, the singular value decomposition method corresponds to the standard Tikhonov regularization with an identity and a priori value of the parameter .
V Sparse and non-smooth regularization with optimization
1 Total variation regularization
We assume that the variations of the parameters are sparse; this leads to the total variation inversion. The total variation of a continuous function defined on the interval [0,1] is given by:
or:
where . This corresponds to a sparse representation of variations of the function in L1 space. Note that the unknowns may be sparse and non-smooth; therefore, we consider choosing the stabilizer as . Considering the non-smoothness of the stabilizer and practical computer simulations, we obtain an approximate regularization model as follows (Wang, 2007):
where is the discrete form of
is an approximation to the total variation of the surface parameter x, and is the regularization parameter. Note that and in (5.3) have been in the discrete form, hence in computation the remaining problem is the discretization of . The discretization of can be realized by setting , then the parameter can be discretized as and for all i form an matrix. For solving the above minimization problem, we employ the Gauss-Newton method (Yuan, 1993). For simplicity of notation, we set . Denote the gradient of as , as and the gradient of as , and denote the Hessian of , and as , and , respectively, then the solution in each iteration can be written as:
where is the inverse of
, and and can be written as:
and:
respectively. Details about calculation of and are given in the Appendix. Note that the derivative of the function is greater than zero for any , must be symmetric and definite, hence there is a unique solution in the above equation.
The assumptions about the variations of the parameters are sparse are difficult to check. Therefore, this kind of regularization method needs further examination before commercial development.
2 Sparse inversion in lp−lq spaces
Ill-posedness is the intrinsic feature of the inverse problems. Unless some additional information/knowledge such as monotonicity, smoothness, boundedness or the error bound of the raw data are imposed, the difficulty may be insuperable. A more general regularization model is recently proposed in Wang et al. (2009a); where the authors considered a regularization model in general form:
where which are specified by users, is the regularization parameter, D is the scale operator, and is an a priori solution of the original model. It is clear to see that the regularization model (5.5) is also corresponding to Bayesian inference if we take -norm in equation (4.4) and choose the a priori distribution as the -norm. This formulation includes most of the developed methods. Particularly, for and or , the model represents non-smooth and sparse regularization, which is relevant to the important topic of compressive sensing for decoding in signal processing. A regularizing active set method was proposed both for quadratic programming and for non-convex problems; see Wang et al. (2009a) for details. In the following, we consider the case of and .
Generally speaking, the kernel-driven BRDF model is semi-empirical, and the retrieved parameters are mostly considered as a kind of weight function though it is a function of biophysical parameters. Therefore, is not necessarily positive. However, since it is a weight function, an appropriate arrangement of the components of can yield the same results. That is to say, can be artificially fixed as non-negative. The remaining problem is to develop some proper methods to solve the ‘artificial’ problem, whereas for aerosol particle size distribution function, the non-negativity of x (corresponding to n(r)) is self-evident. We recall that the -norm minimization method, sometimes referred to as sparse-spiky regularization, is an important issue in geophysics community. This is because the -norm could penalize outliers and large amplitude anomalies. However, -norm has a singular problem when the values of residual vanish. Even if the values of residuals are not zero, the numerical inversion process goes to failure at a very small residual. To tackle this difficulty, our new meaning to the solution is related to the -norm problem:
which automatically imposes a priori information by considering the solution in space. Because of the limitations of the observation system, one may readily see that the recovered land surface parameters are discrete and sparse. Therefore, if an inversion algorithm is not robust, the outliers far from the true solution may occur. In this situation, the a priori constrained minimization may work better than the conventional regularization techniques. The model (5.6) can be reduced to a linear programming problem (Wang et al., 2005, 2009b; Ye, 1997; Yuan, 2001), hence linear programming methods can be used for solving (5.6).
The -norm solution method is seeking a feasible solution within the feasible set . So it is actually searching for an interior point within the feasible set , hence is called the interior point method. The dual standard form of (5.6) is in the form:
where is a new variable and is a vector with all components equalling 1. Therefore, the optimality conditions for to be a primal-dual solution triplet are that:
where , and , are components of vectors s and , respectively. The notation denotes the diagonal matrix whose only non-zero components are the main diagonal line.
The interior point method generates iterates such that and . As the iteration index k approaches infinity, the equality-constraint violations and and the duality gap are driven to zero, yielding a limiting point that solves the primal and dual linear problems. Since the primal-dual interior point methods uses the central path to find a solution of the -norm problem, therefore, the solution is just picked one from the solution space, which sometimes may not be the wanted solution for specific problems (Wang et al., 2007a); in that case, the -norm solution served as an a priori estimate of the true solution. However, for the parameter retrieval from kernel-driven BRDF model, the method always works. For the implementation procedures and examples about using the algorithm, please refer to Wang et al. (2007a, 2009b) for details.
For general values of and , algorithms are ready to be made, this can be easily recognized that the gradient and Hessian of can be evaluated as:
where is defined by ; diag (·) again denotes the vector diagonalization; is the symbol function, defined by .
3 Discussion
Another problem for the sparse/non-smooth regularization of ill-posed parameter retrieval problems is the proper choice of the regularization parameters. Although a posteriori choice of the regularization is favourable, it needs to know the noise levels of the data and solve a smoothed non-linear equation (Wang, 2007; Wang and Xiao, 2001). For the sparse/non-smooth regularization, we set the values of the regularization parameter to be a priori. Many optimization methods can be used for solving the minimization problems (5.3), (5.5) and (5.6) except for the Gauss-Newton method and the linear programming method, e.g. the quasi-Newton method and its variations, gradient descent methods (Wang, 2007; Wang and Yuan, 2005). For the optimizing algorithms, a technical problem is the stopping principle for the iterations. Ideally, it requires as many iterative steps as possible to get an optimal approximation. However, there is a saturation state of errors for inverse problems, i.e. there exists a fixed iterative step, beyond which further iterations will bring less useful information to the solution (Xiao et al., 2003). Therefore, for different optimizing algorithms, stopping rules must be properly studied; we expect that the stopping rules are related with uncertainties in the observations.
VI Experiments
1 Retrieval of linear BRDF model parameters
We use the combination of RossThick kernel and LiTransit kernel in the numerical tests. In practice, the coefficient matrix cannot be determined accurately, and a perturbed version is obtained instead. Also, instead of the true measurement , the observed measurement is the addition of the true measurement and the noise , which for simplicity is assumed to be additive Gaussian random noise. Therefore it suffices to solve the following operator equation with perturbation:
where for some perturbation matrix and denotes the noise level (upper bound) of in (0,1). In our numerical simulation, we assume that is a Gaussian random matrix, and also that . The above assumption about the noise can be interpreted as that the signal-to-noise ratio (SNR) should be greater than 1. We make such an assumption as we believe that observations (BRDF) are not trustworthy otherwise. It is clear that (6.1) is an underdetermined system if and an overdetermined system if . Note that for satellite remote sensing, because of the restrictions in view and illumination geometries, does not need to have bounded inverse (Li et al., 2001; Verstraete et al., 1996; Wang et al., 2007b, 2008). We believe that the proposed optimization and regularization method can be employed to find an approximate solution satisfying .
We choose the widely used 73 data sets referred to by Li et al. (2001). These data sets cover a large variety of vegetative cover types, and are fairly well representative of the natural and cultivated vegetation. Table 1 summarizes the basic properties of the data sets used in this paper. To show the robustness of the developed methods introduced in this paper, we consider both the extreme case, i.e. only one observation is considered as limited number of observations and full data, and compare the retrieval results by different regularization methods. Comparison results are given in Tables 2 and 3. In these tables, NTSVD refers to numerically truncated singular value decomposition; Tikhonov refers to standard Tikhonov regularization in space; l1 sparse refers to regularization in spaces and TV regularization refers to total variation regularization method. The true white-sky albedo (WSA) is calculated from well-posed situations using AMBRALS (Algorithm for MODIS (Moderate Resolution Imaging Spectroradiometer) Bidirectional Reflectance Anisotropies of the Land Surface) (see Strahler et al., 1999), i.e. full observation data. It should be pointed out that the standard operational algorithm used in AMBRALS does not work for such severely ill-posed situations. If we regard WSA>1 or WSA<0 as failed inversion, it is clear that our proposed methods work for all of the cases.
Data sets used in the simulation
Data
Cover type
LAI
ranson soy.827
Soy
2.9
kimes.orchgrass
Orchard grass
1
parabola.1994.asp-ifc2
Aspen
5.5
Comparison of computational values of the WSAs from data sets in Table 1 for single observation with the true WSAs values (multiangular observations) for VisRed band
Data
Methods
Single observation
True WSAs
ranson soy.827
NTSVD
0.0449047
Tikhonov
0.0405936
Tikhonov
0.0401528
l1 sparse
0.0347181
TV regularization
0.0314538
kimes.orchgrass
NTSVD
0.1082957
0.0783379
Tikhonov
0.0753925
TV regularization
0.0771534
parabola.1994.asp-ifc2
NTSVD
0.0364620
0.0227972
Tikhonov
0.0262501
l1 sparse
0.0354550
TV regularization
0.0341585
Comparison of computational values of the WSAs from the data sets in Table 1 for single observation with the true WSAs values (multiangular observations) for Nir band
Data
Methods
Single observation
True WSAs
ranson soy.827
NTSVD
0.4469763
0.3653728
Tikhonov
0.3996775
l1 sparse
0.3543681
TV regularization
0.3510839
kimes.orchgrass
NTSVD
0.3890207
0.2963261
Tikhonov
0.2708260
l1sparse
0.2804019
TV regularization
0.2834896
parabola.1994.asp-ifc2
NTSVD
0.5517209
0.4240376
Tikhonov
0.3972022
l1 sparse
0.5434159
TV regularization
0.5421131
To show how the parameter estimation evolves with increasing number of data, it is sufficient to show the performance of the Tikhonov regularization method. We partition the data sets of each type into six parts, and then accumulate the data with increasing number of observations. Certainly, we can partition the data sets piece by piece into small parts, but for high-volume data it costs time and is unnecessary. The error bars of the retrieved WSAs for VisRed band and Nir band for Kimes orchgrass are illustrated in Figures 1 and 2, respectively (the patterns are similar for Ranson soy and Parabola aspen data). The figures reveal that the retrieved results more closely approximate the true solutions with increasing number of observations. This phenomenon indicates that our algorithm is stable and applicable for parameter inversion, and at the same time it indicates that the accumulation of data (information) is quite important for land surface parameter estimation.
Error bars of the WSAs for VisRed band with increasing number of Kimes orchgrass data
Error bars of the WSAs for Nir band with increasing number of Kimes orchgrass data
Next, we show that the regularization method described herein works for satellite data even with limited observations. As an example, we use atmospherically corrected moderate resolution imaging spectroradiometer (MODIS) 1B product acquired on a single day as an example of single observation BRDF at certain viewing direction. Each pixel has different view zenith angle and relative azimuth angle. The data MOD021KM.A2001135–150 with horizontal tile number (26) and vertical tile number (4) were measured in Shunyi county of Beijing, China. The three parameters are retrieved by using this 1B product. Figure 3 plots the reflectance for band 1 of a certain day DOY=137. In MODIS AMBRALS algorithm, when insufficient reflectances or a poorly representative sampling of high-quality reflectances are available for a full inversion, a database of archetypal BRDF parameters is used to supplement the data and a magnitude inversion is performed (Strahler et al., 1999; Verstraete et al., 1996). We note that the standard MODIS AMBRALS algorithm cannot work for such an extreme case, even for MODIS magnitude inversion, since it is hard to obtain seasonal data associated with a dynamic land cover in a particular site. But our method still works for such an extreme case because smoothness constraint is implanted into the model already. We plot the white-sky albedos (WSAs) retrieved by NTSVD, Tikhonov regularization, l1 sparse and total variation regularization for band 1 of one observation (DOY=137) (e.g. Figure 4; the other images are similar). From these images we see that the albedo retrieved from insufficient observations can preserve the albedo details. We conclude that these developed regularization methods can be considered useful methods for retrieval of land surface parameters and for computing land surface albedos, and can be considered as supplement algorithms for the robust estimation of the land surface BRDF/albedos.
Reflectance for band 1 of MOD021KM.A2001137
White-sky albedo retrieved by total variation regularization method
We want to emphasize that our method can generate smoothing data for helping retrieval of parameters once sufficient observations are unavailable. As we have pointed out in Wang et al. (2007b, 2008) we do not suggest discarding the useful history information (e.g. data that is not too old) and the multiangular data. Instead, we should fully employ such information if it is available. The key to why our new algorithms outperform previous algorithms is because our algorithms are adaptive, accurate and very stable, which solves kernel-based BRDF model of any order, which may be a supplement for BRDF/albedo retrieval product.
For the MODIS sensor this is not a strict restriction, since it aims at global exploration. For other sensors, the period for their detection of the same area will be longer than 20 days or more. Therefore, for vegetation in the growing season, the reflectance and albedos will change significantly. Hence robust algorithms to estimate BRDF and albedos in such cases are highly desired. Our algorithms provide a proper choice, since they can generate retrieval results which approximate the true values of different vegetation type of land surfaces by capturing just few observation times.
Moreover, for some sensors with high spatial resolution, the quasi-multiangular data are impossible to obtain, but with our algorithms can achieve results. To show this advantage, we test our regularizing algorithms to the Landsat Thematic Mapper (TM) data measured in Shunyi county of Beijing, China. Figure 5 plots the reflectance for band 5 on 17 May 2001. The spatial resolution for the TM sensor on band 5 is 30 m. We only list the results calculated using the sparse and non-smooth regularization. Figure 6 illustrates the white-sky albedo using the total variation regularization method. The retrieved results show that our algorithm works for satellite data with high spatial resolutions.
Reflectance for band 5 of Landsat Thematic Mapper Data (TM) on May 17, 2001
White-sky albedo retrieved by total variation regularization method
2 Aerosol particle size distribution function retrieval
In this subsection, we consider retrieving aerosol partile size distribution function from the attenuation equation (2.5). It is an infinite dimensional problem with only a finite set of observations, so it is improbable to implement such a system by computer to get a continuous expression of the size distribution function . Numerically, we solve the discrete problem of operator equation (2.5). Using collocation (Wang et al., 2006a), the infinite problem can be written in a finite dimensional form by sampling some grids in the interval [a, b]. Denoting by , where , , and , the aforementioned regularization methods can be applied to solve the distribution function. We consider the Tikhonov regularization in Sobolev space, i.e. the solution is written in the form of with solved by root-finding of the non-linear equation (4.9) and determined by discretization of the Sobolev norm , which is formed as the tridiagonal matrix:
where is the gridding distance between the gridding points. This is a well-behaved matrix with condition number for gridding nodes and interval μm. Compared to Phillips-Twomey’s formulation of regularization, the form of the matrix by the norm of the second differences results in a badly conditioned matrix . For example, with , the largest singular value is 15.998012, the smallest singular value is . This indicates that the condition number of the matrix defined by the ratio of the largest singular value to the smallest singular value equals . Hence, for small singular values of the discrete kernel matrix , the scale matrix cannot have them filtered even with large regularization parameter .
The size distribution function is used to generate synthetic data. The particle size radius interval of interest is μm. This aerosol particle size distribution function can be written as , where is a rapidly varying function of , while is more slowly varying. Since most measurements of the continental aerosol particle size distribution reveal that these functions follow a Junge distribution (Junge, 1955), , where is a shaping constant with typical values in the range 2.0–4.0, therefore it is reasonable to use of Junge type as the weighting factor to . In this paper, we choose and . The form of this size distribution function is similar to the one given by Twomey (1975), where a rapidly changing function can be identified, but it is more similar to a Junge distribution for μm.
We perform Tikhonov regularization method with in the form of (6.2) and determined by root finding method (4.7). In our numerical simulations, the complex refractive indices are assumed to be 0.005, 0.01, 0.05 and 0.1, respectively. The precision of the approximation is characterized by the root mean-square error (rmse):
where refers to the retrieved signals and refers to the measured signals.
Retrieved aerosol particle size distributions for noise level equaling 0.005 and different complex refractive indices are shown in Figure 7(noise at 0.05 is nearly identical). Note that the regularization parameter is iteratively chosen through root finding; their behaviours are illustrated in Figures 8 and 9 for noise level equaling 0.005 and 0.05, respectively. The root mean square errors (rmses) are shown in Table 4. It reveals that for noise level less than 0.1 the rmses are small, which indicates that our method is quite stable for retrieval of aerosol particle size distribution functions.
True and retrieved results with our inversion method in the case of error level δ = 0.005 and different complex refractive indices
Iterative determining regularization parameters using root finding methods when the error level δ = 0.005
Iterative determining regularization parameters using root finding methods when the error level δ = 0.05
The rmses for different noise levels and different complex refractive indices
Refractive indices
Noise levels
VII Conclusion
We study the regularization and optimization methods for solving the inverse problems in remote sensing. Two problems are considered as examples. One is the estimation of land surface parameters; another is the retrieval of aerosol particle size distribution function. The problems are formulated in functional space by introducing the operator equations of the first kind. Bayesian inference theory is introduced. The general regularization model for parameter retrieval problems in (for ) spaces, which can be convex or non-convex, is proposed. The mathematical models and solution methods in and spaces are developed. The total variation regularization methods are proposed. The regularization strategies and optimization solution techniques are fully described. We emphasize that although the methods introduced in this paper are for land surface parameter retrieval and aerosol particle size distribution function retrieval problems, they can be employed in other inverse problems in geosciences and quantitative remote sensing. Numerical simulations for these problems are performed and illustrated.
Footnotes
Appendix
Gradient and Hessian computation of the total variation regularization model
In discrete form, the can be written as:
The gradient of can be obtained by for any :
that:
where denotes the diagonal matrix whose i-th diagonal entry is , is the matrix whose i-th row is and:
Now we go back to the gradient of . Straightforward calculation yields:
Therefore, the gradient of becomes:
The Hessian of would be complicated in computation, it consists of the Hessians of and . One may readily see that:
To obtain the Hessian of, we need to calculate:
Straightforward calculation yields that:
where is given in (A3) and is an operator defined by:
where denotes the diagonal matrix whose i-th diagonal entry is .
Note that for , hence non-negativity may be lost in . Therefore, we drop the term in and obtain an approximate Hessian of :
Hence an approximate Hessian of would be:
This research is supported by National ‘973’ Key Basic Research Developments Program of China under grant numbers 2007CB714400 and National Natural Science Foundation of China under grant numbers 10871191.
Acknowledgements
We are grateful for reviewers’ helpful comments and suggestions on the paper.
References
1.
BockmannC (2001) Hybrid regularization method for the ill-posed inversion of multiwavelength lidar data in the retrieval of aerosol size distributions. Applied Optics40(9): 1329–1342.
BohrenGFHuffmanDR (1983) Absorption and Scattering of Light by Small Particles. New York: Wiley.
4.
CohnSESivakumaranNTodlingR (1994) A fixed-lag Kalman smoother for retrospective data assimilation. Monthly Weather Review122(12): 2838–2867.
5.
CombalBBaretFWeissM. (2003) Retrieval of canopy biophysical variables from bidirectional reflectance-using prior information to solve the ill-posed inverse problem. Remote Sensing of Environment84(1): 1–15.
6.
DaviesCN (1974) Size distribution of atmospheric aerosol. Journal of Aerosol Science5(3): 293–300.
7.
GoelNStrebelD (1983) Inversion of vegetation canopy reflectance for estimating agronomic variables: I. Problem definition and initial results using the suits model. Remote Sensing of Environment13(6): 487–507.
8.
HoughtonJTMeira FilhoLGCallanderBA. (1996) Climate Change 1995: The Science of Climate Change. Cambridge: Cambridge University Press.
9.
JacquemoudSBaretF (1993) Estimating vegetation biophysical parameters by inversion of a reflectance model on high spectral resolution data. In: Varlet-GrancherCBonhommeRSinoquetH (eds) Crop Structure and Light Microclimate: Characterizations and Applications. Paris: INRA, 339–350.
10.
JungeCE (1955) The size distribution and aging of natural aerosols as determined from electrical and optical data on the atmosphere. Journal of Meteorology12(1): 13–25.
11.
KnyazikhinYMartonchikJDinerD. (1998a) Estimation of vegetation canopy leaf area index and fraction of absorbed photosynthetically active radiation from atmosphere-corrected MISR data. Journal of Geophysical Research103(D24): 32239–32256.
12.
KnyazikhinYMartonchikJMyneniR. (1998b) Synergistic algorithm for estimating vegetation canopy leaf are index and fraction of absorbed photosynthetically active radiation from MODIS and MISR data. Journal of Geophysical Research103(D24): 32257–32276.
13.
KuuskA (1991) The inversion of the Nilson-Kuusk canopy reflectance model, a test case. In: Proceedings of the 11th Annual International Geoscience and Remote Sensing Symposium IGARSS’91. Finland: HelsinkiUniversity of Technology, 1547–1550.
14.
LiXGaoFLiuQ. (2000) Validation of a new GO kernel and inversion of land surface albedo by kernel-driven model. Journal of Remote Sensing4(S1): 1–7.
15.
LiXGaoFWangJ. (2001) A priori knowledge accumulation and its application to linear BRDF model inversion. Journal of Geophysical Research106(D11): 11925–11935.
16.
LiXWangJHuB. (1998) On utilization of a priori knowledge in inversion of remote sensing models. Science in China D41(6): 580–585.
17.
LiXWangJStrahlerAH (1999) Apparent reciprocal failure in BRDF of structured surfaces. Progress of Natural Sciences9(10): 747–750.
18.
LiangSL (2004) Quantitative Remote Sensing of Land Surfaces. New York: Wiley.
19.
McCartneyGJ (1976) Optics of Atmosphere. New York: Wiley.
20.
PhillipsDL (1962) A technique for the numerical solution of certain integral equations of the first kind. Journal of the Association for Computing Machinery9(1): 84–97.
21.
PokrovskyOMRoujeanJL (2002) Land surface albedo retrieval via kernel-based BRDF modeling: I. Statistical inversion method and model comparison. Remote Sensing of Environment84(1): 100–119.
22.
PokrovskyOMRoujeanJL (2003) Land surface albedo retrieval via kernel-based BRDF modeling: II. An optimal design scheme for the angular sampling. Remote Sensing of Environment84(1): 120–142.
23.
PokrovskyIOPokrovskyOMRoujeanJL (2003) Development of an operational procedure to estimate surface albedo from the SEVIRI/MSG observing system by using POLDER BRDF measurements: II. Comparison of several inversion techniques and uncertainty in albedo estimates. Remote Sensing of Environment87(2–3): 215–242.
24.
PrivetteJLEckTFDeeringDW (1997) Estimating spectral albedo and nadir reflectance through inversion of simple bidirectional reflectance distribution models with AVH{\open R}/MODIS-like data. Journal of Geophysical Research102(D24): 29529–29542.
25.
PrivetteJMyneniREmeryW. (1996) Optimal sampling conditions for estimating grassland parameters via reflectance model inversions. IEEE Transactions on Geoscience and Remote Sensing34(1): 272–284.
26.
QuaifeTLewisP (2010) Temporal constraints on linear BRDF model parameters. IEEE Transactions on Geoscience and Remote Sensing48(5): 2445–2450.
27.
RoujeanJLLeroyMDeschampsPY (1992) A bidirectional reflectance model of the Earth’s surface for the correction of remote sensing data. Journal of Geophysical Research97(D18): 20455–20468.
28.
SamainORoujeanJLGeigerB (2008) Use of a Kalman filter for the retrieval of surface BRDF coefficients with a time-evolving model based on the ECOCLIMAP land cover classification. Remote Sensing of Environment112(4): 1337–1346.
29.
SarkkaSVehtariALampinenJ (2004) Time series prediction by Kalman smoother with cross-validated noise density. Proceedings of the international Joint Conference on Neural Networks2: 1653–1657.
30.
StrahlerAHLiXWLiangS. (1994) MODIS BRDF/Albedo Product: Algorithm Technical Basis Document. NASA EOS-MODIS Doc. 2.1.
31.
StrahlerAHLuchtWSchaafCB. (1999) MODIS BRDF/Albedo Product: Algorithm Theoretical Basis Document. NASA EOS-MODIS Doc. 5.0.
32.
TikhonovANArseninVY (1977) Solutions of Ill-posed Problems. New York: Wiley.
33.
TikhonovANGoncharskyAVStepanovVV. (1995) Numerical Methods for the Solution of Ill-Posed Problems. Dordrecht: Kluwer.
34.
TwomeyS (1975) Comparison of constrained linear inversion and an iterative nonlinear algorithm applied to the indirect estimation of particle size distributions. Journal of Computational Physics18(2): 188–200.
VerstraeteMMPintyBMynenyRB (1996) Potential and limitations of information extraction on the terrestrial biosphere from satellite remote sensing. Remote Sensing Environment58(2): 201–214.
37.
VoutilainenandAKaipioJP (2000) Statistical inversion of aerosol size distribution data. Journal of Aerosol Science31(S1): 767–768.
38.
WannerWLiXStrahlerAH (1995) On the derivation of kernels for kernel-driven models of bidirectional reflectance. Journal of Geophysical Research100(D10): 21077–21090.
39.
WangYF (2007) Computational Methods for Inverse Problems and Their Applications. Beijing: Higher Education Press.
40.
WangYFXiaoTY (2001) Fast realization algorithms for determining regularization parameters in linear inverse problems. Inverse Problems17(2): 281–291.
41.
WangYFYuanYX (2005) Convergence and regularity of trust region methods for nonlinear ill-posed inverse problems. Inverse Problems21(3): 821–838.
42.
WangYFCaoJJYuanYX. (2009a) Regularizing active set method for nonnegatively constrained ill-posed multichannel image restoration problem. Applied Optics48(7): 1389–1401.
43.
WangYFFanSFFengX (2007a) Retrieval of the aerosol particle size distribution function by incorporating a priori information. Journal of Aerosol Science38(8): 885–901.
44.
WangYFFanSFFengX. (2006a) Regularised inversion method for retrieval of aerosol particle size distribution function in W1,2 space. Applied Optics45(28): 7456–7467.
45.
WangYFLiXWMaSQ. (2005) BRDF model inversion of multiangular remote sensing: Ill-posedness and interior point solution method. Proceedings of the 9th International Symposium on Physical Measurements and Signature in Remote Sensing (ISPMSRS)XXXVI: 328–330.
46.
WangYFLiXWNashedZ. (2007b) Regularised kernel-based brdf model inversion method for ill-posed land surface parameter retrieval. Remote Sensing of Environment111(1): 36–50.
47.
WangYFMaSQYangH. (2009b) On the effective inversion by imposing a priori information for retrieval of land surface parameters. Science in China D52(4): 540–549.
48.
WangYFWenZNashedZ. (2006b) Direct fast method for time-limited signal reconstruction. Applied Optics45(13): 3111–3126.
49.
WangYFYangCCLiXW (2008) Regularizing kernel-based brdf model inversion method for ill-posed land surface parameter retrieval using smoothness constraint. Journal of Geophysical Research113(D13): D13101.1–D13101.11.
50.
WangYFYangCCLiXW (2009c) Kernel-based quantitative remote sensing inversion. In: Camps-VallsGBruzzoneL (eds) Kernel Methods for Remote Sensing Data Analysis. Chichester: Wiley.
51.
WeissMBaretF (1999) Evaluation of canopy biophysical variable retrieval performance from the accumulation of large swath satellite data. Remote Sensing of Environment70(3): 293–306.
52.
WeissMBaretFMyneniR. (2000) Investigation of a model inversion technique to estimate canopy biophysical variables from spectral and directional reflectance data. Agronomie20(1): 3–22.
53.
XiaoTYYuSGWangYF (2003) Numerical Methods for the Solution of Inverse Problems. Beijing: Science Press.
54.
YeYY (1997) Interior Point Algorithms: Theory and Analysis. Chichester: Wiley.
55.
YosidaK (1999) Functional Analysis. Beijing: World Publishing Corporation.
56.
YuanYX (1993) Numerical Methods for Nonlinear Programming. Shanghai: Shanghai Science and Technology Publications.
57.
YuanYX (2001) A scaled central path for linear programming. Journal of Computational Mathematics19(1): 35–40.