Abstract
In this article, we have suggested a new modified mixed randomized response (RR) model and studied its properties. It is shown that the proposed mixed RR model is always more efficient than the Kim and Warde’s mixed RR model. The proposed mixed RR model has also been extended to stratified sampling. Numerical illustrations and graphical representations are also given in support of this study.
Keywords
Introduction
In survey of human population, questions requiring personal or controversial assertions often run into trouble in terms of resistance. It is difficult to collect reliable data from interviewees and hard to raise the quality of responses when the survey topic is sensitive. The purpose is to formulate an effective random device so as to induce each respondent to give truthful answers to sensitive questions without exposing his or her true identity to the interviewer. A large number of rectifications on Warner’s (1965) pioneering technique have been cited in the literature, for instance, see Fox and Tracy (1986), Chaudhuri and Mukherjee (1987, 1988), Hedayat and Singh (1991), Ryu, Hong, and Lee (1993), Tracy and Mangat (1996), and Singh (2003), for the reviews. Some other developments on randomized response (RR) sampling in recent years include Mahmood, Singh, and Horn (1998), Ryu et al. (2005–2006), Hong (2005–2006), Grewal, Bansal, and Sidhu (2005–2006), Perri (2008), Mahajan, Sharma, and Gupta (2007), Chua and Tsui (2000), Singh, Singh, and Mangat (2000), Chang and Huang (2001), Chang, Wang, and Huang (2004), Huang (2004), Land, Singh, and Sedory (2011), Kim and Warde (2004), Kim and Elam (2005), and so on.
To implement the privacy problem with the Moors (1971) model, Mangat, Singh, and Singh (1997) and Singh et al. (2000) have given several strategies as an alternative to Moors (1971) model, but their models may lose a large portion of data information and require a high cost to obtain confidentiality of the respondents. These drawbacks with the previous alternative models for the Moors model motivated Kim and Warde (2005) to envisage a mixed RR model using simple random sampling with replacement that modifies the privacy problem.
In this article, we have suggested a modification of Kim and Warde’s (2005) model to estimate the proportion of a qualitative sensitive variable. It has been demonstrated that the suggested model performs better than the mixed RR model of Kim and Warde (2005). In the fourth section, we introduce the stratified version of the proposed mixed RR model along with its properties. Comparison of the proposed stratified mixed RR model with that of Kim and Warde’s (2005) model has been given in the fifth section. The empirical studies performed show that, the proposed mixed RR model is more efficient than Kim and Warde’s (2005) model in stratified sampling too.
The Suggested Model
A single sample with size n is selected by simple random sampling with replacement (SRSWR) from the population. Each respondent from the sample is instructed to answer the direct question, “I am a member of the innocuous trait group.” If a respondent answers “Yes” to direct question, then he or she is instructed to go to randomization device R1 consisting of the statements (i) “I am a member of the sensitive trait group” and (ii) “I am a member of the innocuous trait group” with probabilities of selection P1 and (1 − P1), respectively. If a respondent answers “No” to the direct question, then the respondent is instructed to use a randomization procedure due to Mangat (1994). In the Mangat’s (1994) RR procedure, each respondent is instructed to say “Yes” if he or she is a member of the sensitive trait group. If he or she is not a member of the sensitive trait group, then the respondent is required to use the Warner’s (1965) randomization device R2 consisting of the statements: (i) “I belong to the sensitive trait group” and (b) “I do not belong to the sensitive trait group” with preassigned probabilities P and (1 − P), respectively. Then he or she is to report “Yes” or “No” according to the outcome of the randomization device R2 and the actual status that he or she has with respect to the sensitive trait group. The survey procedures are performed under the assumption that both the sensitive and the innocuous questions are unrelated and independent in a randomization device R1. To protect the respondent’s privacy, the respondents should not disclose to the interviewer the question they answered from either R1 or R2.
Let n be the sample size confronted with a direct question, and n1 and n2 (= n − n1) denote the number of “Yes” and “No” answers from the sample. Since all the respondents using a randomization device R1 already responded “Yes” from the initial direct innocuous question, the proportion “Y” of getting “Yes” answers from the respondents using randomization device R1 should be
An unbiased estimator of
The proportion of “Yes” answers from the respondents using Mangat’s (1994) randomization procedure is given by:
An unbiased estimator of
Its variance is given by:
Now we shall pool the two unbiased estimators using weights, to formulate an estimator for
Since the randomization device R1 and the used randomization procedure due to Mangat (1994) are independent, therefore,
Since the proposed mixed RR model also use Simmon’s method
Setting
Thus, we established the following theorem.
Efficiency Comparisons
An efficiency comparison of the proposed mixed RR model, under completely truthful reporting case, has been done with Kim and Warde’s (2005) model.
From Kim and Warde’s (2005, eq. 2.10:213), the variance of the Kim and Warde (2005) estimator
From equations (12) and (13), we have
Thus, the proposed estimator
To have the insight over the performance of the proposed mixed RR model, we have computed the percentage relative efficiency (PRE) of the proposed estimator
Percentage Relative Efficiency of the Proposed Estimator

Percentage relative efficiency of the proposed estimator
It is observed from Table 1 and Figure 1 that the values of
We note from Table 1 that the values of the
A Mixed RR Model Using Stratification
Stratified random sampling is generally obtained by dividing the population into nonoverlapping groups called strata and selecting a simple random sample from each stratum. The main advantage of the stratified approach is that the technique overcomes the limitation of the loss of individual characteristics of the respondents. An RR technique using a stratified random sampling gives the group characteristics related to each stratum estimator. Also, stratified sampling protects a researcher from the possibility of obtaining a poor sample. Hong, Yum, and Lee (1994) suggested a stratified RR technique using a proportional allocation. Kim and Warde (2004) presented a stratified RR model based on Warner (1965) model that has an optimal allocation and large gain in precision. Kim and Elam (2005) suggested a two-stage stratified Warner’s RR model using optimal allocation. Further Kim and Warde (2005) suggested a mixed stratified RR model.
In the proposed model, the assumptions for a stratified mixed RR model are similar to Kim and Warde (2004) and Kim and Elam (2005) model. We assume that the population is partitioned into “r” nonoverlapping strata, and a sample is selected by simple random sampling with replacement from each stratum. To get the full benefit from stratification, we assumed that the number of units in each stratum is known. In this proposed model, an individual respondent in a sample from each stratum is instructed to answer a direct question “I am a member of the innocuous trait group.” Respondents answer the direct question by “Yes” or “No.” If a respondent answers “Yes,” then he or she is instructed to go to the randomization device Rk1 consisting of statements: (i) “I am the member of the sensitive trait group” and (ii) “I am a member of the innocuous trait group” with preassigned probabilities Qk and (1 − Qk), respectively. If a respondent answers “No,” then the respondent is instructed to use a randomization procedure due to Mangat (1994). In the Mangat’s (1994) RR procedure, each respondent is instructed to say “Yes” if he or she is a member of the sensitive trait group. If he or she is not a member of the sensitive trait group, then the respondent is required to use the Warner’s (1965) randomization device Rk2 consisting of the statement: (i) “I belong to the sensitive trait group” and (b) “I do not belong to the sensitive trait group” with preassigned probabilities Pk and (1 − Pk), respectively. Then he or she is to report “Yes” or “No” according to the outcome of the randomization device Rk2 and the actual status that he or she has with respect to the sensitive trait group. The survey procedures are performed under the assumption that both the sensitive and the innocuous questions are unrelated and independent in a randomization device Rk1. To protect the respondent’s privacy, the respondents should not disclose to the interviewer the question they answered from either Rk1 or Rk2. Suppose we denote mk as the number of units in the sample from stratum k and n as the total number of units in samples from all strata. Let mk1 be the number of people responding “Yes” when respondents in a sample mk were asked the direct question and mk2 be the number of people responding “No” when respondents in a sample mk were asked the direct question so that
Since the respondent performing a randomization device Rk1 respond “Yes” to the direct question of the innocuous trait, if he or she chooses the same innocuous question from Rk1, then
The proportion of “Yes” answers from the respondents using Mangat (1994) RR technique will be
The unbiased estimator of
Its variance is
Thus, the unbiased estimator of
In order to do the optimal (Neyman) allocation of a sample size n, we need to know
The minimal variance of the estimator
Efficiency Comparison
In this section, we do an efficiency comparison of Kim and Warde’s (2005) stratified mixed RR model with our proposed stratified mixed RR model by way of variance comparison.
The minimal variance of the Kim and Warde’s (2005) estimator
The minimal variance of the proposed estimator
From equations (28) and (29), we have
It follows from equation (30) that our proposed stratified mixed RR model is superior to the one earlier considered by Kim and Warde (2005).
To have tangible idea about the performance of the proposed stratified estimator
We have obtained the values of the PREs for different cases of
Percentage Relative Efficiency of the Proposed Stratified Estimator

Percentage relative efficiency of the proposed stratified estimator
Table 2 exhibits that the PRE of the proposed stratified estimator
Discussion
In this article, we have envisaged a mixed RR model as well as its stratified version to estimate the proportion of qualitative sensitive character. It has been shown both theoretically and empirically that the proposed mixed RR model and its stratified version are always better than the Kim and Warde’s (2005) mixed randomize response models with larger gain in efficiency. Thus, our recommendation is to prefer the proposed mixed RR model and its stratified version in practice.
Footnotes
Acknowledgements
The authors are thankful to the editor in chief—Professor Christopher Winship, and to the three anonymous learned referees for their valuable suggestions regarding improvement of the article.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
