General classes of bivariate distributions are well-studied in the literature. Most of these classes are proposed via a copula formulation or extensions of some characterization properties in the univariate case. In Kundu (2022), we see one such semi-parametric family useful to model bivariate data with ties. This model is a general semi-parametric model with a baseline. In this paper, we present a characterization property of this class of distributions in terms of a functional equation. The general solution to this equation is explored. Necessary and sufficient conditions under which the solution becomes a bivariate distribution are investigated. An application of the characterization property of the proposed class for generating bivariate pairs of random variables from a member distribution is also discussed.
Analyzing bivariate datasets is very challenging as it involves the association between two variables. Data with ties are also common. The Marshall-Olkin copula, defined by
where and is apt in modelling data with ties (see Nelsen (2007)). Many authors have proposed general classes of bivariate distributions and studied their properties. In Eq. (1), when is exponential () for and and , the copula reduces to the well-studied Marshall-Olkin bivariate exponential distribution with a singular component and is specified as
where . Here is a bivariate random vector having survival function with support . The distribution in Eq. (2) is characterized by the functional equation popularly known as bivariate lack of memory property (BLMP) (Marshall & Olkin, 1967) given by
for all . Kolev and Pinto (2018b) worked on Marshall-Olkin’s bivariate exponential distribution providing a weak version of the BLMP, which can be used to construct bivariate distributions having a singularity component along an arbitrary line through the origin. Lin et al. (2019) studied the moment generating function, product moments and dependence structure of the bivariate distributions satisfying the BLMP. Many authors have studied the BLMP from different perspectives. See Pinto and Kolev (2015) and Kolev and Pinto (2018a) to mention a few. In fact, the BLMP can be translated as a property of arbitrary bivariate continuous distributions (Galambos & Kotz, 2006).
If is Weibull () for , the Marshall-Olkin copula in Eq. (1) gives a bivariate distribution with a singular component whose survival function is
where , characterized by the functional equation,
for all .
If is the one-parameter Pareto () for , the Marshall-Olkin copula in Eq. (1) gives a bivariate distribution with a singular component whose survival function is
where , characterized by the functional equation,
for all .
If the marginals belong to the proportional hazard (PH) class given by for , then Eq. (1) becomes the copula associated with the bivariate proportional hazard class (BPHC) mentioned in Kundu (2022), the survival function of which is given by
where is the baseline distribution with for without loss of generality and . In this paper, the functional equation characterized by the singular family of bivariate distributions in Eq. (5) is generalized. The solution of this equation is a very general class of singular family of bivariate distributions of which Eq. (5) is a member. The rest of the paper is organized as below.
Section 2 proposes a class of distributions as a general solution to the functional equation characterizing Eq. (5). This functional equation characterizes a very general class of distributions, including those in Eqs (1)–(5). It is observed through a counter-example that the general solutions of this functional equation need not necessarily generate a bivariate probability distribution function. However, imposing certain conditions on the marginals ensures the solution is a bivariate probability distribution function. Accordingly, in Section 3, necessary and sufficient conditions are derived for univariate probability distributions to qualify as marginals for the bivariate probability distribution belonging to the proposed class. In Section 4, we translate these in terms of failure rates for the ease of constructing bivariate probability distributions satisfying the functional equation. Section 5 discusses the proposed class of probability distributions and its application in simulating a bivariate pair of random variables from a member distribution of the proposed class.
Functional equation
In this section, we propose a characterization of Eq. (5) in terms of a functional equation, the general solution of which contains many bivariate singular distributions including Eq. (5).
.
For some baseline survival function , the functional equation
where , and . Writing , and for some , Eq. (10) can be expressed equivalently as
Note that . Since Eq. (11) is satisfied for all , we can show from Aczél (1966) (page 38) that every solution is either everywhere or nowhere zero. We can now develop the proof in line with the results in Aczél (1966) (page 38). This is achieved by noting that is a non-negative continuous function. Hence taking logarithms on both sides of Eq. (11) reduces it to the Cauchy functional equation,
where . By induction, it follows directly that, for some positive integer ,
For any two positive integers and , if , then
In particular, for and , where , it follows that
for every positive rational . For , follows from Eq. (12). Moreover since is continuous, taking limits on both sides of Eq. (14) implies for all non-negative . This gives implying
Arguing similarly for , Eq. (7) can be retrieved. The converse is direct. ∎
.
Let and , then the univariate random variables and are independent and the joint function in Eq. (7) can be written as the product of the survival functions of and for .
Proof..
Writing , the survival function of follows from Eq. (15) as,
Also, writing , the survival function of is given by,
The application of this corollary is discussed in detail in Section 5. The next question of interest is whether the general equation in Eq. (7) represents a bivariate probability distribution function. The following counter-example shows that it does not always represent a bivariate probability distribution function for any arbitrary marginal distribution.
.
The LFR-exponential class: For the linear failure rate (LFR) marginals specified by,
On closely examining , it can be shown that the rectangular positivity condition for bivariate distributions is violated. For example, for the choice of and , disqualifying it as a bivariate probability survival function. The functional equation Eq. (16) has been well-studied under this perspective in Kulkarni (2006). Hence not all solutions with arbitrary and give bivariate probability distribution. Imposing some restrictions on the marginals ensures this. Motivated by this, we develop the conditions to be satisfied by the marginals so that is a bivariate survival function.
Necessary and sufficient conditions for generating bivariate distributions
In this section, the conditions on univariate distributions to qualify as marginal in Eq. (7) so that Eq. (7) is a bivariate survival function are derived. Let , where is the vector of parameters involved in the baseline and is the vector of parameters involved in the marginal .
.
Let be a distribution function with absolutely continuous density for which . The necessary and sufficient conditions for in Eq. (7) be a bivariate probability distribution are that
where .
Proof..
will be a bivariate survival function if and only if it can be written as a convex mixture of and where is the absolutely continuous part, is the singular part, and and are survival functions. Observe, for ,
Also,
and
Writing,
we have,
and
so that
Hence the absolutely continuous part has density given by Eqs (18) and (19). Also,
But , so that . Therefore, is a valid survival function if
is a convex mixture of and . From Eq. (19), since , it follows that
is a valid survival function. From Eq. (18), since , for , it follows that
This completes the proof. ∎
Counter-Example 2.1 (contd.) The LFR-exponential class: Consider again the bivariate function given in Eq. (17). Condition (ii) of Theorem 3.1 becomes
which is not satisfied for the choice of and . This again shows that a bivariate probability distribution cannot be formed with marginal being the linear failure rate model and baseline as exponential distribution in Eq. (7). However, any proportional hazard distribution qualifies as marginals in Eq. (7). This is discussed in detail in the next theorem.
.
The survival function with the marginals belonging to the PH class satisfies the functional equation Eq. (6) if and only if it is of the form
for some with .
Proof..
Let Eq. (6) be satisfied. Then, the general solution of is given by Eq. (7). If the marginals belong to the PH class so that , then the bivariate function in Eq. (7) reduces to Eq. (20). Now, it remains to show that is a valid bivariate probability distribution. Proceeding as in Theorem 3.1,
Also,
and
so that . Hence, the absolute continuous part has density given by,
Note that . But, so that . From (i) and (ii), as in Theorem 3.1, we have , which is true since . Also, is a valid survival function implying , which is true for any choice of the baseline distribution . The converse is direct. ∎
Construction of bivariate probability distributions satisfying functional equation Eq. (6)
The functional equation Eq. (6) can be equivalently represented in terms of the hazard gradient. The discussions above translated in terms of the hazard gradient would make it computationally easier to check if marginals qualify for the function Eq. (7) to be a bivariate survival function. Accordingly, in this section, the functional equation in terms of its hazard gradient is studied. The hazard gradient vector (Johnson & Kotz, 1975) is defined as
where .
Note that the functional equation Eq. (6) can be equivalently expressed as
Differentiate with respect to , to get,
which gives a condition in terms of the hazard gradient to satisfy the functional equation Eq. (6) as
where is the baseline hazard function. On integrating Eq. (23), the functional equation Eq. (6) is retrieved.
.
A bivariate random vector satisfy Eq. (6) if and only if
where is the marginal hazard rate of .
Proof..
Suppose satisfy,
Then,
For and , it follows that,
On differentiating w.r.t , it further follows that,
and
Proceed in a similar manner for to obtain Eq. (24). The converse is straight forward through the representation,
or
(Johnson & Kotz, 1975). ∎
On close investigation of Eq. (24), observe that the marginal hazard function should satisfy the condition,
since . Based on these observations, the necessary and sufficient conditions for differentiable to qualify as marginal failure rates for to be a bivariate survival function are discussed in the following theorem.
.
The necessary and sufficient conditions for differentiable functions to qualify as marginal failure rates of in Eq. (7) are that for ,
where . The bivariate function given by Eq. (7) is a survival function satisfying the functional equation in Eq. (6) where
Proof..
Condition (i) follows from Eq. (25), and condition (ii) is necessary for to be a hazard rate function. Conditions (iii) and (iv) imply conditions (i) and (ii) of Theorem 3.1.
Conversely, the hazard gradient associated with is given by Eq. (24), and condition (i) follows from the non-negativity of . Since are univariate failure rates, condition (ii) follows. Condition (iii) follows from the non-negativity of the density function associated with , and condition (iv) follows from Theorem 3.1. ∎
Counter-Example 2.1 (contd.) The LFR-exponential class: The marginal failure rates are given by
Then, condition (i) in Theorem 4.2 becomes , which is not satisfied for the choice of and . Hence, we cannot construct a bivariate survival function with the marginals as linear failure rate and baseline distribution as exponential.
.
The BPHC-Weibull class: For a proportional hazard rate,
with and , it is easily seen that conditions of Theorem 4.2 are satisfied. For,
.
for . From Eq. (26), , where . Substituting in Eq. (7), we get
Discussion
A class of distributions has been proposed as a general solution to the functional equation Eq. (6). This class of distribution is specified by,
for some and , where and is the baseline distribution. A counter-example is given, showing that not all marginals admit a bivariate probability distribution in this class. The conditions to be satisfied by a univariate distribution to be a marginal of the bivariate class are developed. These conditions are translated in terms of the univariate failure rates enabling them to be easily used to generate a bivariate probability distribution belonging to the general class. Many well-known classes of distributions are members of the proposed class in Eq. (7). Some of the well-studied classes are given in the following examples.
Example 1 – Extended Freund’s bivariate distribution (Asha et al., 2016)
If the marginals are specified by
for where and rewriting , the solution in Eq. (7) reduces to
Example 2 – Bivariate semi-parametric singular family of distributions (Kundu, 2022)
If the marginals are specified by
where and rewriting , the solution in Eq. (7) reduces to
Apart from this, the distributional properties and inference procedures of several probability distributions belonging to the class in Eq. (7) are discussed in the literature (see Proschan and Sullo (1976), Pena and Gupta (1990), Asha et al. (2016), Kundu (2022) to mention a few). For these studies, it is necessary to generate pairs of random variables from a continuous bivariate distribution. While specific simulation techniques are available for certain bivariate distributions, such as bivariate gamma, exponential, and normal distributions, simulating from a general bivariate distribution can often present multiple challenges. Theorem 2.1 provides a characterization result that aids in simulating a pair of random variables whose distribution satisfies the functional equation Eq. (6) through Corollary 2.1. Here, we can generate a bivariate pair by simulating and and using the probability for a given bivariate distribution satisfying the functional equation Eq. (6). This method also simplifies testing the goodness of fit of a bivariate data set, as it only requires testing of univariate quantities.
Footnotes
Acknowledgments
The authors thank the referee for the meticulous reading that has tremendously improved the quality of the paper. This work was financially supported by the Council of Scientific & Industrial Research (CSIR), Government of India, through the Senior Research Fellowship scheme vide No 09/239(0551)/2019-EMR-I.
Conflict of interest
The authors have no conflicts of interest to declare.
References
1.
AczélJ. (1966). Lectures on functional equations and their applications. Academic press.
2.
AshaG.Krishna KMJ., & KunduD. (2016). An extension of the Freund’s bivariate distribution to model load-sharing systems. American Journal of Mathematical and Management Sciences, 35(3), 207-226.
3.
GalambosJ., & KotzS. (2006). Characterizations of Probability Distributions.: A Unified Approach with an Emphasis on Exponential and Related Models, Vol. 675. Springer.
4.
JohnsonN.L., & KotzS. (1975). A vector multivariate hazard rate. Journal of Multivariate Analysis, 5(1), 53-66.
KolevN., & PintoJ. (2018b). A weak version of bivariate lack of memory property. Brazilian Journal of Probability and Statistics, 32(4), 873-906.
7.
KulkarniH. (2006). Characterizations and modelling of multivariate lack of memory property. Metrika, 64(2), 167-180.
8.
KunduD. (2022). Bivariate semi-parametric singular family of distributions and its applications. Sankhya B, 1-27.
9.
LinG.D.DouX., & KurikiS. (2019). The bivariate lack-of-memory distributions. Sankhya A, 81(2), 273-297.
10.
MarshallA.W., & OlkinI. (1967). A multivariate exponential distribution. Journal of the American Statistical Association, 62(317), 30-44.
11.
NelsenR.B. (2007). An introduction to copulas. Springer Science & Business Media.
12.
PenaE.A., & GuptaA.K. (1990). Bayes estimation for the Marshall-Olkin exponential distribution. Journal of the Royal Statistical Society: Series B (Methodological), 52(2), 379-389.
13.
PintoJ., & KolevN. (2015). Sibuya-type bivariate lack of memory property. Journal of Multivariate Analysis, 134, 119-128.
14.
ProschanF., & SulloP. (1976). Estimating the parameters of a multivariate exponential distribution. Journal of the American Statistical Association, 71(354), 465-472.