Bayesian statistical inference is an important method of mathematical statistics in which both sample information and prior information are employed. Traditionally, it is often assumed that the sample observations from the population are observed precisely and characterized by crisp values. However, in many cases, the sample observations are collected in an imprecise way and characterized by uncertain values. In this paper, based on uncertain theory, we propose three kinds of uncertain Bayesian statistical inference including Bayesian point estimation, Bayesian interval estimation and Bayesian hypothesis test. Some numerical examples of uncertain Bayesian inference are presented to illustrate the proposed methods.
Bayesian statistical inference is an important technique in mathematical statistics in which Bayes’ theorem (Albert [1]) is used to update the probability for a hypothesis as more evidence or information becomes available. Bayesian statistics was named after Bayes [3], who studied how to compute a distribution for the probability parameter of a binomial distribution. Laplace [8] used Bayesian methods to solve a number of statistical problems. Many Bayesian methods were developed by Jeffreys [7], Berger [4], etc. With the advent of powerful computers and new algorithms like Markov chain Monte Carlo (Brooks and Roberts [5]), Bayesian methods have seen increasing use within statistics, such as Bayesian experimental design, Bayesian networks, Bayesian hierarchical modelling. Hierarchical model is used when information is available on several different levels of observational units. The hierarchical form of analysis and organization helps in the understanding of multiparameter problems and also plays an important role in developing computational strategies. Baladandayuthapani et al. [2] employed Bayesian hierarchical model to analyze colon carcinogenesis data. Shen et al. [19] proposed an improved Bayesian dictionary learning algorithm to effectively recover Aqua Moderate Resolution Imaging Spectroradiometer Band 6.
In Bayesian statistics, it is important to obtain the methods for estimating and testing the unknown parameters form the given observations and prior information. Traditionally, the observations of the sample are assumed to be precise and treated as real numbers. However, the observations are often imprecise by virtue of the uncertainty when acquiring the sample, and cannot be treated as crisp values. For example, the data of the factoriesąŕ carbon emission are collected in an imprecise way. It was shown by many surveys that uncertainty theory is more fitted to model the data with imprecise observations given by the experts (Liu [15]). Thus we should take the imprecisely observed data as uncertain variables and describe them by uncertainty distributions (Liu [10]).
The uncertainty theory is a branch of mathematics to model the human uncertainty. In the framework of the uncertainty theory, uncertain variable is used to model an uncertain quantity, and uncertainty distribution was employed to describe an uncertain variable. Gao [6] discussed some mathematical properties of uncertain measure. Peng and Iwamura [18] showed what function is an uncertainty distribution. Liu [13] proposed the operational law of uncertain variables based on the product measure axiom. The expected value of the function of uncertain variables was proposed by Liu and Ha [17]. The variance of an uncertain variable was provided by Yao [21]. The moment of an uncertain variable was proposed by Sheng and Ha [20] and Liu [16]. You [26] and Zhang [27] studied the convergence of a sequence of uncertain variables. After about ten years of development, the uncertainty theory has found a wide range of applications, for example, uncertain risk analysis (Liu [14]), uncertain programming (Liu [12]), uncertain statistics (Yao and Liu [25]), uncertain process (Yao and Li [24]), etc. Uncertain statistics is a method to collect and interpret expertąŕs data. Liu [13] proposed the least squares estimation of the unknown parameters in the uncertainty distribution. Linear uncertain regression analysis was proposed by Yao and Liu [25] and Lio and Liu [9]. Uncertain time series analysis was documented by Yang and Liu [22]. Uncertain statistical inference with the imprecise observations was discussed by Yao [23].
In this paper, we propose the uncertain Bayesian statistics inference with uncertainty variables based on uncertainty theory and Bayesian inference. Three kinds of Bayesian inference methods including the uncertain Bayesian point estimation, the uncertain Bayesian hypothesis test, and the uncertain Bayesian interval estimation are investigated when observations of the sample are imprecise. The rest of this paper is structured as follows. Some basic knowledge about uncertain theory are introduced in Section 2. Uncertain Bayesian point estimation is proposed in Section 3. Uncertain Bayesian hypothesis test is proposed in Section 4. Uncertain Bayesian interval estimation is proposed in Section 5. Finally, some remarks are made in Section 6.
Preliminary
In this section, we review some basic knowledge on uncertainty theory from the related references in order to make it easy for the reader to understand the paper. Definition 2.1. (Liu [10]) Let Ł be a σ-algebra on a nonempty set Γ. A set function M : Ł → [0, 1] is called an uncertain measure if it satisfies the following axioms: Axiom 1. (Normality Axiom) M {Γ} =1 for the universal set Γ;
Axiom 2. (Duality Axiom) M {Λ} + M {Λc} =1 for any event Λ;
Axiom 3. (Subadditivity Axiom) For every countable sequence of events Λ1, Λ2, ⋯ , we have
Besides, in order to provide the operational law, Liu [11] defined the product uncertain measure on the product σ-algebra Ł as follows.
Axiom 4. (Product Axiom) Let (Γk, Ł k, Mk) be uncertainty spaces for k = 1, 2, ⋯ . The product uncertain measure M is an uncertain measure satisfying
where Λk are arbitrarily chosen events from Łk for k = 1, 2, ⋯, respectively.
Definition 2.2. (Liu[10]) An uncertain variable is a function ξ from an uncertainty space (Γ, eulerL, M) to the set of real numbers such that {ξ ∈ B} is an event for any Borel set B.
In order to describe uncertain variable in practice, uncertainty distribution of an uncertain variable ξ is defined as Φ (x) = M{ ξ ≤ x } for any real number x. An uncertainty distribution Φ is called regular if its inverse function Φ-1 exists. In this case, the inverse function Φ-1 is called an inverse uncertainty distribution.
An uncertain variable ξ is called linear if it has a linear uncertainty distribution
denoted by Ł (a, b), where a and b are real numbers with a < b, and the inverse uncertainty distribution of linear uncertain variable Ł (a, b) is
The uncertain variables ξ1, ξ2, ⋯ , ξm are said to be independent (Liu[11]) if
for any Borel sets B1, B2, …, Bm of real numbers.
Let ξ1, ξ2 ⋯ , ξn be independent uncertain variables with regular uncertainty distributions Φ1, Φ2 ⋯ , Φn, respectively. If f is strictly increasing with respect to ξ1, ξ2 ⋯ , ξm and strictly decreasing with respect to ξm+1, ξm+2 ⋯ , ξn, then ξ = f (ξ1, ξ2 ⋯ , ξn) is an uncertain variable, the inverse uncertainty distribution of the ξ = f (ξ1, ξ2 ⋯ , ξn) are given by Liu [10],
As the average value of an uncertain variable in the sense of uncertain measure, expected value can represent the size of the uncertain variable. The expected value and variance of uncertain variable ξ are defined by
provided that at least one of the two integrals is finite,
respectively.
Let ξ be an uncertain variable with an uncertainty distribution Φ. If its kth moment E [ξk] exists, then
Furthermore, if Φ is regular, then
lET ξ be an uncertain variable with uncertainty distribution Φ, and f (x) be a strictly monotone (increasing or decreasing), then we have
Uncertain Bayesian point estimation
In Bayesian statistical inference, Bayes’ theorem can be used to estimate the parameters of a probability distribution or statistical model. Since Bayesian statistics treats probability as a degree of belief, we can directly assign a probability distribution that quantifies the belief to the parameter or set of parameters. The prior distribution π (θ) includes information about unknown parameter θ, and the data x includes the information about θ. The information in the prior and data are integrated in the joint distribution function f (x, θ) = f (x|θ) π (θ). The posterior distribution π (θ|x), which is the conditional distribution of the unknown quantities conditional on the observed data, is obtained easily using condition probability formula,
where f (x, θ) is the marginal distribution of x, f (x|θ) is the distribution of the data conditional on the parameter. The above formula is called Bayesąŕ theorem. It describes the conditional probability of an event based on data as well as prior information or beliefs about the event or conditions related to the event. The posterior probability distribution π (θ|x) is essential to Bayesian statistical inference for θ. When the observations of the sample from the population are imprecise, we propose uncertain Bayesian statistic inference problems such as Bayesian point estimation, Bayesian hypothesis test, and Bayesian interval estimation.
This section introduces uncertain Bayesian expectation estimator and uncertain Bayesian mode estimation when the observations are uncertain variables but not traditional crisp values.
Uncertain Bayesian point estimation
Let uncertain variables ξ1, ⋯ , ξn denote the imprecise observations of sample from the population with a probability density distribution f (x|θ). Suppose an unknown parameter θ is known to have a prior distribution π. For simplicity, we write ξ = (ξ1, ⋯ , ξn). The likelihood function is defined by
By Bayes’ theorem, we have the posterior distribution
Once the posterior distribution is available it can be used to estimation parameter of interest. It is best done using ideal of loss function. Let be an estimator of θ based on uncertainty variables ξ1, ⋯ , ξn. Mean square error (MSE) is the most frequently used risk function. It is defined by
where the subscript θ|ξ denotes the expectation taken under the posterior distribution π (θ|ξ). Using the above MSE risk function, the Bayesian estimate of the unknown parameter θ can be proved simply to be the expectation of the mean of the posterior distribution π (θ|ξ), i.e.
The proof of (3.2) can be found in Berger [4]. Note that observation data ξ = (ξ1, ⋯ , ξn) are not crisp numbers but uncertain variables, thus is also an uncertain variable. We take the numeral characteristic such as the expectation of as the point estimation of the unknown parameter θ. Thus,
where the subscriptM in (3.3) denotes the expectation is taken under the uncertain distribution.
Definition 3.1. Suppose the population is X with an unknown parameter θ, the probability density function of X is f (x|θ), the imprecise observations of sample from the population are denoted by uncertain variables ξ1, ⋯ , ξn. Suppose an unknown parameter θ is known to have a prior distribution π, then the uncertain Bayes expectation estimator of θ is EM [E [θ|ξ]].
Example 3.1: If the population follows a normal distribution x|θ ∼ N (θ, σ2), where σ2 is known. Let ξ1, ⋯ , ξn be the imprecise observations of sample from the population f (x|θ). The likelihood function of this sample is
The prior distribution for the parameter θ is taken to be normal θ ∼ N (μ, τ2),
where μ, τ2 are known. The posterior distribution can be obtained from equation (3.1),
where
and write By (3.2), the Bayes estimator under MSE is given by
Assume Φ (xi) is the uncertainty distribution of imprecise observation ξi. Then, we can estimate the value of parameter θ by
As a numerical example, we consider doing an intelligence test on a child. Suppose the test results follow a normal distribution x ∼ N (θ, 100), where θ is defined as the child’s IQ in psychology. Based on past tests, it can be set θ ∼ N (100, 225). Using the above method, the posterior distribution of the child’s IQ can be obtained when an imprecise observation ξ is obtained. The posterior distribution of child’s IQ is , where
If further suppose the child’s score ξ on this test follows linear uncertainty distribution by some expert, then the Bayes estimate of the child’s IQ by (3.5) is
Example 3.2: If uncertainty variable ξ is the imprecise observation from the Binomial distribution B (n, θ),
and if the prior is Beta distribution θ ∼ Be (a, b),
where a > 0, b > 0 are known and Γ (·) is the gamma function. By (3.1), the posterior is also Beta distributed,
that is θ|ξ ∼ Be (a + ξ, b + n - ξ). The Bayes estimator under MSE is given by
Assume the imprecise observation ξ has the uncertainty distribution Φ (x), then we can estimate the value of parameter θ by
Example 3.3: Assume population distribution is the exponential distribution x|θ ∼ exp(θ),
If the prior is Gamma distributed θ ∼ Ga (α, β),
where α > 0, β > 0 are known. By (3.1), the posterior is also Gamma distributed, .
The Bayes estimator under MSE is given by
Assume there are only n imprecise observations defined ξ1, ξ2, ⋯ , ξn available for us to estimate the unknown parameter θ of exponential distribution. Then, we can estimate the value of parameter θ by
Let ξi are independent uncertain variables with regular uncertainty distribution Φi, according to (2.1) the inverse uncertainty distribution of the sum is
By (2.2), we have
Uncertain Bayesian mode estimation
Suppose an unknown parameter θ is known to have a prior distribution π. Let uncertain variables ξ1, ⋯ , ξn denote the imprecise observations of sample from the population with a probability density distribution f (x|θ). One can derive the posterior distribution
Definition 3.2. If there is , such that
Then, uncertain posterior mode estimation is defined by
If π (θ|ξ) is derivative with respect to θ, then is the solution to the following equation:
Example 3.4: If the population follows a normal distribution x|θ ∼ N (θ, σ2), where σ2 is known. Let ξ1, ⋯ , ξn be the imprecise observations of sample from the population f (x|θ). The prior distribution for the parameter θ is taken to be normal θ ∼ N (μ, τ2), where μ, τ2 are known. The posterior distribution is , as shown in (3.4). By (3.7),
Hence, we have
Posterior mode estimation is
Remark: When the posterior distribution is symmetric, the minimum mean square error estimator and posterior mode estimation are identical.
Example 3.5: If uncertainty variable ξ is the imprecise observation from the Binomial distribution B (n, θ), and the prior is Beta distribution θ ∼ Be (a, b). The posterior is also Beta distributed, θ|ξ ∼ Be (a + ξ, b + n - ξ). By (3.7),
Hence, we have
Posterior mode estimation is
Uncertain Bayesian hypothesis test
This section introduces Bayesian hypothesis test when the observations of the sample are imprecise.
Let Θ be the set of all possible values of θ. Let null hypothesis about θ be H0 : θ ∈ Θ0 and alternative hypothesis be H1 : θ ∈ Θ1, where Θ0 and Θ1 are given subsets of Θ. We consider the following test question:
In the Bayesian framework, hypothesis test is straightforward. When acquiring the posterior distribution, we compute posterior probability of null hypothesis H0 and alternative hypothesis H1, respectively,
and
We make decisions by comparing α0 and α1. Note that α0 (ξ) and α1 (ξ) are uncertainty variables. If
we think uncertainty variable α0 (ξ) > α1 (ξ), and the null hypothesis H0 : θ ∈ Θ0 is accepted. If
we think uncertainty variable α0 (ξ) < α1 (ξ) and H0 is rejected. If
then we do not make decision, and need to furthermore acquire expert’s data or collect prior information. Example 4.1: Assume ξ1, ⋯ , ξn are some imprecise observations from a population with normal distribution x|θ ∼ N (θ, σ2), and the prior is normal, θ ∼ N (μ, τ2). The posterior distribution is by Example 3.1. Consider testing
where θ0 is a number fixed by some experimenter. Firstly, compute α0 (ξ) and α1 (ξ). Since the posterior distribution is , we have
where F (·) is distribution function of standard normal distribution N (0, 1). By duality of probability, we have
Nextly, we compute
As a numerical example, we consider the intelligence test on a child in Example 3.1. Suppose the test results follow a normal distribution x ∼ N (θ, 100), where θ is defined as the child’s IQ in psychology. The prior is θ ∼ N (100, 225). the posterior distribution of the child’s IQ can be obtained when an imprecise observation ξ is obtained. The posterior distribution of child’s IQ is , where
If further suppose the child’s score ξ on this test follows linear uncertainty distribution by some expert. Consider testing
We compute posterior probability of null hypothesis H0 and alternative hypothesis H1, respectively,
Nextly, we compute
Thus, the hypothesis H0 : θ ≤ 100 should be accepted, the child’s IQ scores is not more than 100.
Uncertain Bayesian interval estimation
In Bayesian statistics, a credible interval is a range of values within which an unobserved parameter value falls with a particular subjective probability. It is an interval in the domain of a posterior probability density function. Assume there are n imprecise observations of sample from the population with a probability density function f (x|θ), which we denote by uncertain variables ξ1, ⋯ , ξn. Suppose an unknown parameter θ is known to have a prior distribution π. When posterior distribution π (θ|ξ) is acquired, we can find a and b, such that
Note that a and b are functions of uncertain vector ξ, the definition of uncertain Bayesian credible interval is give as follows.
Definition 5.1. (Uncertain credible interval) Assume posterior distribution of parameter θ is f (θ|ξ), for the uncertainty variables vector ξ and probability 1 - α (0 < α < 1), if the uncertainty variables and , such that
then is called uncertain Bayesain credible interval where credible level is 1 - α . If satisfies
then is called uncertain Bayesian lower credible limit where credible level is 1 - α . If satisfies
then is called uncertain Bayesian upper credible limit where credible level is 1 - α .
Example 5.1: Assume ξ1, ⋯ , ξn are some imprecise observations from a population with normal distribution x|θ ∼ N (θ, σ2), and the prior is normal, θ ∼ N (μ, τ2). The posterior distribution is by Example 3.1. We have
Let , and we have
where zα is α quantile of standard normal distribution. Hence,
and credible interval of 1 - α credible level is
As a numerical example, we consider the intelligence test on a child in Example 3.1. Suppose the test results follow a normal distribution x ∼ N (θ, 100), where θ is defined as the child’s IQ in psychology. The prior is θ ∼ N (100, 225). the posterior distribution of the child’s IQ can be obtained when an imprecise observation ξ is obtained. The posterior distribution of children’s IQ is , where
If further suppose the child scores ξ on this test follows linear uncertainty distribution by some expert. we take credible level 95%, by (5.1),
and
Thus, we have
In a similar way, we can obtain EM [b] =112.80. The 95% credible interval of the child’s IQ is [80.23,112.80].
Conclusion
Uncertain Bayestian statistical inference is a new interdisciplinary to solve Bayestian statistical inference problems when observations of the sample are imprecise. This paper treated the imprecise observations as uncertain variables, and proposed three kinds of Bayesian inference methods such as Bayesian point estimation, Bayesian hypothesis test, and Bayesian interval estimation. When there is no analytic expression for the posterior distribution, the calculations are quite complicated, the proposed methods are challenging or failure and need for further research.
References
1.
AlbertJ., Bayesian Computation with R. Springer, New York, 2007.
2.
BaladandayuthapaniV., MallickB.K., HongM.Y., LuptonJ.R., TurnerN.D., CarrollR.J., Bayesian Hierarchical Spatially Correlated Functional Data Analysis with Application to Colon Carcinogenesis, Biometrics64 (1) (2008), 64–73.
3.
BayesT., Anessay towards solving a problem in the doctrine of chances, Philosophical Transaction of the Royal Society (54) 1764, 296–325.
4.
BergerJ.O., Statistical decision theory and Bayesian analysis. Belin: Springer, 1985.
5.
BrooksS., RobertsG., Assessing convergence of Markov chain Monte Carlo algorithms, Stat Comput8 (1998), 319–335.
6.
GaoX., Some properties of continuous uncertain measure, Int J Uncertain Fuzziness17 (3) (2009), 419–426.
7.
JeffreysH., Theory of probability, 3rd ed. London: Oxford Universtiy Oress1961.
8.
LaplaceP., Théorie Analytique des Probabilités. Paris: Courcier, 1812.
9.
LioW., LiuB., Residual and confidence interval for uncertain regression model with imprecise observations, J Intell fuzzy syst35 (2) (2018), 2573–2583.
LiuY.H., HaM., Expected value of function of uncertain variables, J Uncertain Syst4 (3) (2010), 181–186.
18.
PengZ.X., IwamuraK., A sufficient and necessary condition of uncertainty distribution, J Interdiscip Math13 (3) (2010), 277–285.
19.
ShenH., LiX., ZhangL., TaoD., ZengC., Compressed sensing based inpainting of aqua moderate resolution imaging spectroradiometer band 6 using adaptive spectrum weighted sparse Bayesian dictionary learning, IEEE T Geosci Remote52 (2) (2014), 894–906.
20.
ShengY.H., KarS., Some results of moments of uncertain variable through inverse uncertainty distribution, Fuzzy Optim Decis Making14 (1) (2015), 57–76.
21.
YaoK., A formula to calculate the variance of uncertain variable, Soft Comput19 (10) (2015), 2947–2953.
22.
YangX.F., LiuB., Uncertain time series analysis with imprecise observations, Technical Report, 2017.
23.
YaoK., Uncertain statistical inference models with imprecise observations, IEEE Trans Fuzzy Syst26 (2) (2018), 409–415.
24.
YaoK., LiX., Uncertain alternating renewal process and its application, IEEE T Fuzzy Syst20 (6) (2012), 1154–1160.
25.
YaoK., LiuB., Uncertain regression analysis: An approach for imprecise observations, Soft Comput22 (17) (2018), 5579–5582.
26.
YouC., On the convergence of uncertain sequences, Math Comput Model49 (3) (2009), 482–487.
27.
ZhangZ.M., Some discussions on uncertain measure, Fuzzy Optim Decis Making10 (1) (2011), 31–43.