Abstract
In this paper, an attempt has been made to study the estimation of finite population variance in simple random sampling without replacement (SRSWOR) in presence of non-response by proposing two estimators. The two estimators (dual to ratio and ratio cum dual to ratio type) have been proposed on the basis of availability and non-availability of auxiliary information. The properties such as bias and mean square error of the proposed estimators have also been studied up to first order of approximation. The proposed estimators are compared with some existing estimators and are also mutually compared. An empirical study, based on both vegetable crop data and simulated data, has been performed to find out the best estimator. It has been seen that out of the two proposed estimators, the ratio cum dual to ratio type estimator performed better than dual to ratio estimator.

Introduction
The auxiliary information includes a known variable (auxiliary variable) to which the variable of interest (study variable) is related. For example, production of a crop depends on area under cultivation. It is well known that the proper use of auxiliary information results in substantial increase in the precision of estimators. By applying auxiliary information in different forms, estimators for population parameters generally population mean and variance are made by various statisticians, namely, Cochran (1940), Olkin (1958), Srivastava and Jhajj (1981), Bahl and Tuteja (1991) and Singh et al. (2016).
Sometimes, there also arises a situation where information about some observations which are included in the sample is not available leading to incompleteness in the data. This type of problem is termed as non-response. Hansen and Hurwitz (1946) were the first to deal with the problem of incomplete samples in mail surveys. They proposed a model for mail survey design to provide unbiased estimators of population mean or total. Okafor and Lee (2000) applied Hansen and Hurwitz (1946) technique for ratio and regression estimator. Tracy and Osahan (1994) studied the effect of random non-response on the usual ratio estimator of population mean in two situations: (i) non-response in study as well as the auxiliary variable and (ii) non-response in the study variable only.
Singh and Joarder (1998) considered the problem of estimation of finite population variance in the presence of non-response in survey sampling. Singh (2003) compared the efficiency of estimators proposed by Singh and Joarder (1998) by using the data set on real estate farm loans and non real estate farm loans. After that, various researchers including Ahmed et al. (2005), Dubey and Uprety (2008), Kumar and Bhougal (2011), Kumar (2014), Shahzad et al. (2017), Bhat et al. (2018), Kumar et al. (2018), Irfan et al. (2018), Mittal and Kumar (2021) and Priyanka (2021) have contributed for the estimation of population mean and population variance under random non-response.
In this paper, an attempt has been made to study the estimation of finite population variance in simple random sampling without replacement (SRSWOR) in presence of non-response by proposing two estimators. The two estimators (dual to ratio and ratio cum dual to ratio type) have been proposed on the basis of availability and non-availability of auxiliary information and their bias and mean square error expressions have been derived. The efficiency of the proposed estimators has been compared with existing estimators theoretically as well as numerically.
Distribution of random non response and notations
Consider a finite population
where
We write,
and
are conditionally unbiased estimators of finite population variance of study variable
and auxiliary variable
respectively, and
and
Under (1), the following results are obtained,
and
where
and
For estimating the population variance
The usual unbiased estimator for the population variance
The variance of
Singh and Joarder (1998) proposed following estimators under situation I and II and studied the properties of estimators under non-response.
The bias and mean square error (MSE) of
In this section, two estimators viz. dual to ratio and ratio cum dual to ratio type estimators of population variance under two different situations of random non-response have been proposed by modifying the population variance estimator of Yadav and Kadilar (2013).
Dual to ratio estimator under different situations of random non-response
where,
where,
Taking expected value on both sides of Eq. (14) and using the results on expectations from Section 2, we get
The bias of estimator
From Eq. (14), we have
Squaring on both sides of Eq. (15) and neglecting the higher order terms, we get
Taking expected value on both sides of Eq. (16) and using the results on expectations from Section 2, we get
The mean square error of estimator
Hence the theorem.
Taking expected value on both sides of Eq. (19) and using the results on expectations from Section 2, we get
The bias of estimator
From Eq. (19), we have
Squaring on both sides of Eq. (20) and neglecting the higher order terms, we get
Taking expected value on both sides of Eq. (21) and using the results on expectations from Section 2, we get
The mean square error of estimator
Hence the theorem.
where
where
The estimator
We assume that
Taking expected value on both sides of Eq. (26) and using the results on expectations from Section 2, we get
The bias of estimator
The bias in estimator
Also,
Squaring on both sides of Eq. (28) and neglecting the higher order terms, we get
Taking expected value on both sides of Eq. (29) and using the results on expectations from Section 2, we get
The mean square error of estimator
Differentiating Eq. (38) with respect to
Optimum value of
Putting Eqs (4.2) and (32) in Eqs (27) and (4.2), we get Eqs (24) and (25). Hence the theorem.
where
The estimator
We assume that
Taking expected value on both sides of Eq. (35) and using the results on expectations from Section 2, we get
The bias of estimator
The bias in Eq. (36) is zero, when
Also,
Squaring on both sides of Eq. (37) and neglecting the higher order terms, we get
Taking expected value on both sides of Eq. (38) and using the results on expectations from Section 2, we get
The mean square error of estimator
Differentiating Eq. (39) with respect to
Putting Eqs (4.2) and (32) in Eqs (36) and (39), we get Eqs (33) and (34). Hence the theorem.
For efficiency comparisons following conditions have been obtained by comparing the proposed estimators
From Eqs (40) to ((iii)) it is observed that proposed ratio cum dual to ratio type estimator performed better than the usual unbiased, usual ratio and dual to ratio estimator under both situations of non-response.
In this section, we report the results of an empirical study that investigates the performance of proposed estimators. The empirical study includes application of both real data and simulation.
Real data application
The area and production of vegetable crop for Haryana (Source: Horticulture Department, Government of Haryana) from year 1970-71 to 2019-20 are considered for empirical study of the proposed estimators. The production under vegetable crop is considered as study variable
Percentage relative efficiency (PRE) of the proposed estimators is defined by
Table 1 shows the numerical results of the MSE and PRE of estimators obtained through real data. It can be easily observed that out of the two proposed estimators
Efficiency comparisons of the existing and proposed estimators through real data
Efficiency comparisons of the existing and proposed estimators through real data
Bold values represent the maximum percentage relative efficiency among the proposed and existing estimators.
Efficiency comparisons of the existing and proposed estimators under situation I through simulation study
Bold values represent the maximum percentage relative efficiency among the proposed and existing estimators.
Efficiency comparisons of the existing and proposed estimators under situation II through simulation study
Bold values represent the maximum percentage relative efficiency among the proposed and existing estimators.
Percent relative efficiency of existing and proposed class of estimators under situation I and II for different values of r (or p).
Simulation is performed to calculate the MSE of the estimators of
Select a (SRSWOR) of size 20 from the population of size 50. Select value of Drop Find the value of the estimator based on the Repeat steps (1) to (4) 50,000 times. Thus, we obtain 50,000 values for The MSE of proposed estimator is obtained by
Simulation results from Tables 2 and 3 shows that both the proposed estimators are more efficient than the usual unbiased estimator for different values of r (or p) (see Fig. 1). The critical findings of the simulation are listed below:
For different values of r (or p), the proposed estimator For different values of r (or p), the proposed estimator Both the proposed estimators
The dual to ratio and ratio cum dual to ratio type estimators are proposed for estimation of population variance under non-response considering two situations: i) when random non-response occurs on both
Footnotes
Acknowledgments
The authors are thankful to the reviewers for their valuable suggestions, which helped in improving the quality of this paper.
