Concept of Ranked Set Sampling is applicable whenever ranking on a set of sampling units can be done easily by a judgment method or based on an auxiliary variable. In this work, we consider a study variable correlated with auxiliary variable which is used to rank the sampling units. Further is assumed to have bivariate Lomax distribution. We obtain an unbiased estimator of scale parameter of study variable based on ranked set sample and censored ranked set sample. Efficiency comparison of these estimators with usual SRS estimator is also performed.
The concept of Ranked Set Sampling (RSS) was first introduced by McIntyre (1952) for improving the precision of the sample mean, an estimator of population mean. This concept is applicable whenever variable of interest is difficult or expensive to measure but ranking on a small set of measurement is easily available. McIntyre (1952) uses judgement method for ranking a set of sample units. For a detailed discussion on the theory and applications of RSS see, Chen et al. (2004). Judgement method is not suitable when there is ambiguity in discriminating the rank of one unit with another. Judgement method is not mathematically much relevant with the problem of study. To remedy this problem Stokes (1977) uses an auxiliary variable for ranking of the sampling units. Stokes (1977) considers a situation where the variable of interest, say is difficult or expensive to measure, but an auxiliary variable correlated with is easily measurable and can be ordered exactly. The procedure of RSS using auxiliary variable described by Stokes (1977) is as follows.
Choose independent bivariate samples each of size . Arrange them randomly into samples each with units and observe the value of the auxiliary variable on each of these units. For the first sample, select that unit for which the measurement on the auxiliary variable is the smallest and measure the variate associated with smallest variate. In the second sample, choose associated to the second smallest . This procedure is repeated until the last sample associated with the largest is measured. The resulting set of units chosen by one from each sample as described above is called RSS. Let be the bivariate random sample selected from the sample, where denote the order statistic of the auxiliary variable in the sample and denote the measurement made on the variate associated with . David and Nagaraja (2003) referred as the concomitant of the order statistic arising from the sample.
Bain (1978) discusses a striking example of RSS where a study variate represents the oil pollution of sea water and the auxiliary variable represents the tar deposit in the nearby sea shore. Clearly collecting sea water sample and measuring the oil pollution in it is strenuous and expensive. However the prevalence of pollution in the sea water is much reflected by the tar deposit in the surrounding terminal sea shore. In this example ranking the pollution level of sea water based on the tar deposit on the sea shore is more natural and scientific than ranking it visually or by judgement method.
Stokes (1977) proposes RSS mean as an estimator for the mean of the study variate, when an auxiliary variable is used for ranking the sample units, under the assumption that follow a bivariate normal distribution. While Barnett and Moore (1997) obtained the Best Linear Unbiased Estimator (BLUE) of the mean of the study variate based on a RSS when follow bivariate Normal distribution. Stokes (1995) studies the estimation of parameters in location-scale family of distributions using RSS. Lam et al. (1994, 1995) have obtained the BLUE of location and scale parameter of exponential and Logistic distribution. Chacko and Thomos (2007) obtained the BLUE of the parameter involved in the study variate based on a RSS when follows bivariate Pareto distribution. Chacko and Thomos (2007) estimated the parameters of Morgenstern type bivariate exponential distribution by RSS and censored RSS. Singh and Mehta (2014) derived linear shrinkage estimator of scale parameter of Morgenstern type bivariate logistic distribution using RSS. Singh and Mehta (2015) considered estimation of scale parameter of a Morgenstern type bivariate uniform distribution by censored RSS. Attia et al. (2014a) estimated the parameters of the bivariate Lomax distribution of Marshall-Olkin based on right censored sample using expectation-maximization (EM) algorithm. Further Attia et al. (2014b) considered generalized bivariate Lomax distribution and maximum likelihood estimation procedure is derived for the estimation its parameters based on censored samples.
The above overview highlights the role of Ranked Set Sampling scheme in estimation of parameters associated with a study variable correlated with auxiliary variable and having different bivariate distributions. In this paper, our main objective is to estimate the parameter associated with (study variable) which is correlated with (auxiliary variable) by RSS and censored RSS scheme where are assumed to have bivariate Lomax distribution. The paper is organized as follows.
In Section 2, we obtain an unbiased estimator of scale parameter of bivariate Lomax distribution based on RSS when shape parameter is known. While in Section 3, we derive its BLUE based on RSS. In Section 4, we obtain censored RSS estimator of scale parameter. We perform efficiency comparison for obtained RSS estimators with that of estimator based on simple ransom sample (SRS). Section 5, concludes the paper with final remarks.
Estimator of scale parameter based on RSS
Consider the bivariate Lomax distribution (Sankaran and Nair (1993)) with parameters and probability density function (pdf) given by,
Each marginal of the above bivariate Lomax distribution , is univariate Lomax (Lomax (1954)). This distribution is also known as Pareto distribution of type II. Particularly, marginal distribution of is , a univariate Lomax distribution and is given by,
The mean and variance of is given by,
Note that the distribution of is free of scale parameter . Our objective is to propose different RSS estimators for scale parameter when shape parameter is known.
Suppose random samples each of size are drawn from . Let be the order statistic of the auxiliary variable in the sample and be the measurement on the variate associated with . Here is referred as the concomitant of order statistics, and is referred as RSS from distribution. The expressions for mean and variance of given by Begum (2003) are as follows.
where
Note that since and are non-negative for all .
Observe that , for , since and are drawn from two independent sets of samples. Some other expressions for the concomitant of order statistic for the distribution are given by Begum (2003) and Nayabuddin (2013).
Based on RSS from distribution, we propose an unbiased estimator of scale parameter under the assumption that the shape parameter is known. The estimator is defined as,
Clearly is an unbiased estimator of . Variance of is,
where and are as defined in Eqs 2 and 3 respectively.
The unbiased estimator of based on a SRS from is given by,
where .
The variance of is given by,
The Efficiency of over is,
Observe that as . Hence trivially Thus is efficient than i.e. an estimator of based on RSS is more efficient than that of estimator based on SRS. This can be seen through a numerical study presented in Table 1 where we evaluate for different values of and Figure 1 shows increase in with respect to for fixed . This supports to the conclusion that RSS always provides more information than SRS even if the ranking is imperfect (Chen et al. (2004)).
Efficiency of over and over
2
2.1
1.5782
2.3434
12
2.1
4.3010
1.5219
2.5
1.2570
1.1466
2.5
1.8439
1.0643
2.9
1.1498
1.0500
2.9
1.4382
1.0238
3.3
1.0991
1.0229
3.3
1.2782
1.0115
4
2.1
2.4603
2.1435
14
2.1
4.5630
1.4579
2.5
1.5202
1.1328
2.5
1.8767
1.0568
2.9
1.2866
1.0471
2.9
1.4528
1.0211
3.3
1.1856
1.0222
3.3
1.2870
1.0103
6
2.1
3.1038
1.8918
16
2.1
4.7837
1.4078
2.5
1.6573
1.1063
2.5
1.9027
1.0508
2.9
1.3527
1.0384
2.9
1.4643
1.0189
3.3
1.2263
1.0184
3.3
1.2940
1.0092
8
2.1
3.5958
1.7231
18
2.1
4.9725
1.3676
2.5
1.7425
1.0876
2.5
1.9238
1.0460
2.9
1.3923
1.0320
2.9
1.4737
1.0172
3.3
1.2505
1.0154
3.3
1.2996
1.0084
10
2.1
3.9850
1.6065
20
2.1
5.1357
1.3346
2.5
1.8010
1.0742
2.5
1.9414
1.0420
2.9
1.4189
1.0273
2.9
1.4814
1.0157
3.3
1.2666
1.0132
3.3
1.3042
1.0077
across when .
BLUE of scale parameter based on RSS
In this section we obtain BLUE of based on RSS under the assumption that is known. Let be the measurement made on the variate corresponding to the observation on the auxiliary variate in the unit of the RSS. Thus we have RSS from distribution. Let . Then the mean vector and the dispersion matrix of is given by,
where , and and are as defined in Eqs 2 and 3 respectively.
The BLUE of based on RSS obtained by using theory of generalized linear model for with (David and Nagaraja (2003)) is given by,
As , Thus is more efficient than . Further it can be seen that,
The for different values of and are reported in Table 1. Observe that approaches to 1 for higher values of when () is fixed. This is presented in Figs 2 and 3.
across when .
across when .
Remark: Observe that for obtaining all estimators of , is assumed to be known. In usual cases is not known. In this situation we suggest to obtain a rough estimator of using correlation coefficient. For the correlation coefficient between and is given by for If is the sample correlation coefficient between and then moment estimator for is given by The estimate will lead to feasible value of when . Specifically, when is negative and for which leads to non-existence of . Other possible choice for can be according to Begum (2003) who proposed an estimator of based on Concomitants of order statistics. There can be other choices for estimator of .
Estimator of scale parameter based on censored RSS
Consider an example of pollution study on sea samples discussed by Bain (1978). If there is no tar deposit at the seashore, then the corresponding sea sample will be treated as censored as these units can not be measured. For ranking on observations in a sample, the censored units are assumed to have distinct and consecutive lower ranks and the remaining units are ranked with the next higher ranks in a natural order. If in this censored scheme of RSS, units are censored, then we may represent the RSS observations on the study variate in RSS as where
such that .
In this case the usual censored RSS mean is,
It may be noted that need not occur in a natural order for . Let the integers be such that and . Then,
where
Therefore, the RSS mean in censored case is not an unbiased estimator for . However we can construct an unbiased estimator based on .
Let be the censored RSS observations on the study variate when the ranking is applied to the auxiliary variate from . When is known, an unbiased estimator of based on censored RSS of size is given by,
and where and .
We now propose the BLUE of based on the censored RSS, resulting out of ranking of observations on . Let be the vector of observations in censored RSS. The mean vector and the dispersion matrix of are given by,
where and .
If involved in and is known, then Eqs 8 and 9 defines a generalized Gauss-Markov set up and hence the BLUE of is obtained as,
and
Simplifying the expression from Eqs 10 and 11, we get,
where
Conclusions
In this paper, we consider a bivariate Lomax distribution and obtain an unbiased estimator and BLUE of a scale parameter associated with a study variable based on RSS. Further, BLUE of the scale parameter in case of censored RSS is also obtained. We perform efficiency comparison of proposed RSS estimators with that of estimator based on SRS. The efficiency performance of proposed estimators is studied. It is observed that, an estimator based on RSS is more efficient than that of estimator based on SRS. Moreover the efficiency based on RSS estimator increases with for fixed over SRS estimator.
Footnotes
Acknowledgments
The authors would like to thank the reviewers for their constructive comments which helped to improve the paper.
References
1.
AttiaA. F.ShabanS. A., & AmerY. M. (2014a). Parameter estimation for the bivariate lomax distribution based on censored samples. Applied Mathematical Sciences, 8(35), 1711-1721.
2.
AttiaA. F.ShabanS. A., & AmerY. M. (2014b). Estimation of the bivariate generalized lomax distribution parameters based on censored samples. Int J Contemp Math Sciences, 9(6), 257-267.
3.
BainL. J. (1978). Statistical analysis of reliability and life testing models: Theory and methods. New York: Marcel Dekker.
4.
BarnettV. & MooreK. (1997). Best linear unbiased estimates in Ranked Set Sampling with particular reference to imperfect ordering. Journal of Applied Statistics, 24, 697-710.
5.
BegumA. A. (2003). Concomitants of order statistics from bivariate pareto II distribution. International Journal of Statistical Sciences, 2, 27-35.
6.
ChenZ.BaiZ., & SinhaB. (2004). Lecture Notes in Statistics, Ranked Set Sampling: Theory and Applications. Springer, New York.
7.
ChackoM., & ThomasP. Y. (2007). Estimation of a parameter of bivariate pareto distribution by Ranked Set Sampling. Journal of Applied Statistics, 34(6), 703-714.
8.
ChackoM., & ThomasP. Y. (2008). Estimation of a parameter of Morgenstern type bivariate exponential distribution by Ranked Set Sampling. Annals of the Institute of Statistical Mathematics, 60(2), 301-318.
9.
DavidH. A. & NagarajaH. N. (2003). Order Statistics. 3rd Edn. John Wiley and Sons, New York.
10.
LamK.SinhaB. K., & WuZ. (1994). Estimation of parameters in a two-parameter exponential distribution using ranked set sample. Annals of the Institute of Statistical Mathematics, 46(4), 723-736.
11.
LamK.SinhaB. K., & WuZ. (1996). Estimation of location and scale parameters of a logistic distribution using a ranked set sample. In Statistical Theory and Applications. Springer New York, pp. 187-197.
12.
LomaxK. S. (1954). Business failures: Another example of the analysis of failure data. Journal of the American Statistical Association, 49(268), 847-852.
13.
McIntyreG. A. (1952). A method for unbiased selective sampling, using ranked sets. Australian Journal of Agricultural Research, 3(4), 385-390.
14.
Nayabuddin. (2013). Concomitants of generalized order statistics from bivariate Lomax distribution. ProbStat Forum, 6, 73-88.
15.
SankaranP. G., & NairN. U. (1993). A bivariate Pareto model and its applications to reliability. Naval Research Logistics (NRL), 40(7), 1013-1020.
16.
SinghH. P., & MehtaV. (2014). Linear shrinkage estimator of scale parameter of Morgenstern type bivariate logistic distribution using Ranked Set Sampling. Model Assisted Statistics and Applications, 9(4), 295-307.
17.
SinghH. P., & MehtaV. (2015). Estimation of scale parameter of a Morgenstern type bivariate uniform distribution using censored ranked set samples. Model Assisted Statistics and Applications, 10(2), 139-153.
18.
Lynne StokesS. (1977). Ranked set sampling with concomitant variables. Communications in Statistics-Theory and Methods, 6(12), 1207-1211.