In the literature, an extensive work on sequential fixed-width confidence interval for the parameter of U(q, mq) model, where m > 1 is known, is available. In this article, we propose a two-stage sampling procedure for estimating the parameter q of U(aq, bq) distribution, where a < b are positive and known. Here, the risk of an estimator of q is less than a pre-assigned number w (>0), that is, , 0 < A < ∞ is known. We determine the parameter Bk of stopping variable so that the risk is uniformly bounded by a pre-assigned value w. We have also tabulated the values of the expected stopping time and its standard deviation (SD).
Graybill and Connell,[1] Cooke,[2, 3] Govindarajulu,[4, 5] Akahira and Koike,[6] and Koike[7] have introduced many sequential estimation methods for uniform distribution. The problem of obtaining confidence intervals having a specified width for the parameter in the density U(θ, mθ) distribution, where m > 1 is known and θ > 0, have been considered by Patil and Rattihalli.[8] Bhattacharjee and Mukhopadhyay[9] have discussed the purely sequential procedure for the unknown parameter θ of U(0, θ) distribution. The unknown parameter θ is estimated by four different estimators in stopping rule, and the two different estimators of θ were proposed in the loss function. Patil[10] has considered the two-stage estimation procedure for the parameter of U(θ, mθ) distribution. Bhattacharjee and Mukhopadhyay[11] have proposed the purely sequential minimum risk point estimation procedure for the parameter θ of the U(0,θ) distribution. Patil[12] has considered the purely sequential procedure for the parameter of the U(θ, mθ) distribution.
Various methods of sequential estimation of the scale parameter of an exponential distribution have been introduced by many authors, for example, Zacks and Mukhopadhyay,[13, 14] Mukhopadhyay and Pepe[15], Zacks,[16] etc. Zacks and Khan[17] studied the confidence intervals of the mean and scale parameter of a gamma distribution. Mahmoudi and Roughani[18] have considered bounded risk estimation of the scale parameter of a gamma distribution in a two-stage sampling procedure. For details, see Ghosh et al.[19]
The U(aθ, bθ) distribution is appropriate in a following situation. Consider an agriculture experiment where we want to study the impact of unknown soil fertility gradient θ of a plot on the yield/growth of a certain crop, which is an observable random variable, say X, whose range depend on θ, say aθ and bθ, where a < b are positive and known. It is but natural to assume that both aθ and bθ are increasing functions of θ. Assuming that θ is the only unknown entity, the random variable X has U(aθ, bθ) distribution. The problem of interest is to find a point estimate of soil fertility gradient (θ).
In this article, we propose an efficient two-stage procedure for estimating the parameter θ of U(aθ, bθ) distribution. Section 2 contains the fixed sample size procedure (FSS) solution and estimation problem. In Sections 3, we propose a two-stage procedure and compute value of B = Bk. In Section 4 we give the average sample number (ASN) function and standard deviation (SD) of Nk. In Section 5, some numerical values of ASN function and SD are computed.
Fixed Sample Size Procedure
Let X1, X2, …, .Xn be independent identical distributed (iid) random variables with U(aθ, bθ) distribution, where a < b are positive and known. Let X(1) = min(X1, X2, …, Xn) and X(n) = max(X1, X2, …, Xn). Note that X(n)/b ≤ θ ≤ X(1)/a almost surely (as). Then X(n)/b is the maximum likelihood estimator ofθ. That is = X(n)/b and the loss function for estimating θ by is given by
where A is positive known weight. Our goal is to make the associated risk less than a pre-assigned number w (> 0); that is, .
The risk in estimating by is and this risk will be at most w that is , which implies We know that (n+1) (n+2) > (n+1)2. So, we have that is , where n* is called the “optimal fixed sample size”. When is unknown, FSS procedure fails. In the light of this problem, we propose an efficient two-stage procedure.
Two-stage Procedure
Stage 1: For a fixed k, take an initial sample X1, X2, …, Xk from U(aθ, bθ) distribution. Then determine D = min(X1, X2, …, Xk) and X = max(X1, X2, …, Xk). Take We propose the stopping rule:
where B is a positive coefficient and denotes the largest integer less thanx. The coefficient B will be determined appropriately as the risk is bounded by w. We will see that B is only a function of A, k, a and b. While B is known, if Nk = k, stop and do not take more observation in the second stage, otherwise go to the second stage.
Stage 2: If Nk > k, the initial sample is not large enough, we must gather Nk − k additional observation in the second stage, say . Let . We estimate the parameter θ by . The risk associated with this estimator is given by . If Fk is the σ-field generated by X1, X2, ….Xk then Xk+1, Xk+2, …..are independent of Fk. Now we obtain the value of B. Now, we obtain the value of B:
We know that there are k samples in the first stage and (Nk − k) samples in the second stage. Thus
Since
and
so we have
We can write
Since algebraic sum of deviation of observations about its mean is zero, we have
where
and
Now,
and
We know (Nk − k +1) (Nk − k + 2) > (Nk − k)2 and 1/(Nk − k)2 < 1/(Nk − k). Further
Let Y = max(Y1, Y2, .......Yk ) = max(X1, X2, ...., Xk)/θ = X/θ, where Yi → U(a,b). Therefore, so that Equations (3.3) and (3.4) become
Taking addition of Equations (3.5) and (3.6), we get the lower bound for risk as below
But , it is sufficient that
Distribution of Nk
The random variable Nk is defined by (3.1), it can take the values {k, k + 1, …} and hence it is discrete random variable.
Thus, the stopping rule is closed.
Now, we develop the formulas of the first and second moments of Nk.
and
where Y = X/θ and
where is cumulative distribution function (cdf) of Y and
Hence, the variance of Nk is
Simulation Results
In this section, we compute optimal fixed sample size (n*) ASN and SD by simulation based on 10,000 repetitions. We take A = 2, θ = 15 and θ = 10 and w = 1, 0.5, 0.25, 0.1, 0.05, 0.025, 0.01. Pseudorandom samples from uniform population are drawn by using R programme. We compute simulated risk () = (see Tables 1–6).
Remark 5.1: From Tables 1–6, we observe that as value of w decreases, n*, E(Nk) and SD increases.
Remark 5.2: From Tables 1–6, we observe that as the value of k increases, SD decreases and E(Nk), first, increases, then slightly decreases.
Remark 5.3: From Tables 1–6, we observe that as the value of θ increases, E(Nk) and SD increases.
Remark 5.4: From Tables 1–6, we observe that as value of parameter b increases, E(Nk) and SD increases.
Remark 5.5: From Tables 1–6, we observe that the simulated risk is much less than the pre-assigned number w. Hence, one can adjust the coefficient B such that the risk remains less than w.
Numerical values of ASN and SD of rule (3.1)
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
9
13.1421
19
30.6228
43.7214
62.2456
99
E(Nk)
10.6213
14.6263
20.4504
32.1421
45.2439
63.8077
100.5816
SD
0.4851
0.6238
0.8408
1.4043
1.9472
2.7767
4.3552
0.3187
0.1818
0.1078
0.0451
0.0225
0.0119
0.0048
k = 30, A = 2, q = 10, b = 2, a = 1
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
9
13.1421
19
30.6228
43.7214
62.2456
99
E(Nk)
30
30
30
32.1157
45.2811
63.7773
100.5279
SD
0
0
0
0.6120
0.8129
1.0791
1.6248
0.0488
0.0507
0.0475
0.0443
0.0234
0.0120
0.0048
Source: All ta Table is obtained by using rule (3.1). Created by author.
Numerical values of ASN and SD of rule (3.1)
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
14
20.2132
29
46.4342
66.0820
93.8683
149
E(Nk)
15.5218
21.7447
30.5520
47.9793
67.64801
95.4580
150.6308
SD
0.6956
0.9946
1.3651
2.0857
2.9431
4.1467
6.5352
0.3696
0.2027
0.1057
0.0462
0.0243
0.0121
0.0050
k= 30, A = 2, θ = 15, b = 2, a = 1
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
14
20.2132
29
46.4342
66.08204
93.8683
149
E(Nk)
30
30
30.6288
47.9541
67.5907
95.4052
150.5347
SD
0
0
0.4831
0.8431
1.1168
1.5600
2.4154
0.1169
0.1141
0.1076
0.0458
0.0229
0.0124
0.0049
Source: Table is obtained by using rule (3.1). Created by author.
Numerical values of ASN and SD of rule (3.1)
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
14
20.2132
29
46.4342
66.0820
93.8683
149
E(Nk)
15.4444
21.6913
30.4545
47.8678
67.4877
95.2481
150.3052
SD
1.0112
1.4442
2.0182
3.1712
4.4760
6.3384
10.0108
0.4056
0.2030
0.1128
0.0456
0.0242
0.0119
0.0048
k = 30, A = 2, q = 10, b = 4, a = 1
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
14
20.2132
29
46.4342
66.0820
93.8683
149
E(Nk)
30
30
30.6236
47.9533
67.5769
95.3534
150.481
SD
0
0
0.4845
1.1938
1.6362
2.2987
3.6165
0.1155
0.1140
0.1047
0.0465
0.0224
0.0118
0.0049
Source: Table is obtained by using rule (3.1). Created by author.
Numerical values of ASN and SD of rule (3.1)
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
21.5
30.8198
44
70.1512
99.6231
141.3025
224
E(Nk)
22.9546
32.2648
45.4434
71.5610
100.9897
142.6208
225.206
SD
1.5162
2.1298
3.0193
4.7583
6.7158
9.5011
15.0110
0.4320
0.2215
0.1167
0.0485
0.0245
0.0124
0.0051
k = 30, A = 2, q = 15, b = 4, a = 1.
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
21.5
30.8198
44
70.1512
99.6231
141.3025
224
E(Nk)
30
32.3626
45.4673
71.6163
101.0929
142.7772
225.4703
SD
0
0.7932
1.1037
1.7047
2.4203
3.4244
5.4153
0.2609
0.2204
0.1175
0.0477
0.0242
0.0120
0.0050
Source: Table is obtained by using rule (3.1). Created by author.
Numerical values of ASN and SD of rule (3.1)
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
9
13.1421
19
30.6228
43.7214
62.24555
99
E(Nk)
10.6213
14.6262
20.4504
32.1421
45.2439
63.8076
100.5816
SD
0.4850
0.6238
0.8408
1.4043
1.9472
2.7767
4.3552
0.3114
0.1795
0.1085
0.0446
0.0228
0.0115
0.0048
k = 30, A = 2, q = 10, b = 4, a = 2.
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
9
13.1421
19
30.6228
43.7214
62.2455
99
E(Nk)
30
30
30
32.1157
45.2810
63.7773
100.5279
SD
0
0
0
0.6120
0.8128
1.0791
1.6248
0.0506
0.0509
0.0511
0.0447
0.0229
0.0116
0.0049
Source: Table is obtained by using rule (3.1). Created by author.
Numerical values of ASN and SD of rule (3.1)
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
14
20.2132
29
46.4342
66.0820
93.8683
149
E(Nk)
15.5218
21.7447
30.5520
47.9793
67.6480
95.4580
150.6308
SD
0.6956
0.9946
1.3651
2.0857
2.9431
4.1467
6.5352
0.3626
0.2005
0.1063
0.0459
0.0242
0.0120
0.0048
k = 30, A = 2, q = 15, b = 4, a = 2.
w
1
0.5
0.25
0.1
0.05
0.025
0.01
n*
14
20.2132
29
46.4341
66.0820
93.86833
149
E(Nk)
30
30
30.6288
47.9541
67.5907
95.4052
150.5347
SD
0
0
0.48311
0.8431
1.1168
1.5600
2.4154
0.1129
0.1113
0.1077
0.0452
0.0235
0.0123
0.0049
Source: Table is obtained by using rule (3.1). Created by author.
Footnotes
Declaration of Conflicting Interests
Funding
The authors received no financial support for the research, authorship and/or publication of this article.
ORCID iD
V. N. Kadam
References
1.
GraybillAConnellTL.Sample size required to estimate the parameter in uniform density within ‘d’ units of true value. J Am Stat Assoc. 1964; 59: 550–556.
2.
CookePJ.Sequential estimation in the uniform density. J Am Stat Assoc. 1971; 66(335): 614–617.
3.
CookePJ.Two-stage sequential estimation in uniform density. J Am Stat Assoc. 1973; 68(341): 107–108.
4.
GovindarajuluZ.A note no two-stage and sequential fixed-width intervals for the parameter in the uniform density. Stat Probab Lett. 1997; 36: 179–188.
5.
GovindarajuluZ.Erratum: a note on two-stage and sequential fixed-width intervals for the parameter in the uniform density. Stat Probab Lett. 1999; 42: 213–215.
6.
AkahiraMKoikeK.Sequential interval estimation of a location parameter with the fixed width in the uniform distribution with an unknown scale parameter. Seq Anal. 2005; 24: 63–75.
7.
KoikeK.Sequential interval estimation of a location parameter with fixed width in the non-regular case. Seq Anal. 2007; 26: 63–70.
8.
PatilHSRattihalliRN.Fixed width confidence interval for U(θ, mθ) distribution. Commun Stat (Theory Methods). 2011; 40: 907–915.
9.
BhattacharjeeDMukhopadhyayN.On recovering lost information for sequential estimation in a uniform distribution. Proceedings of The 58th World Statistical Congress (Session STS005). International Statistical Institute, Dublin, Ireland; 2011.
10.
PatilHS.Two-stage estimation procedure for the parameter of U(θ, mθ) distribution. Calcutta Stat Assoc Bull. 2012; 64: 255–256, 257–263.
11.
BhattacharjeeDMukhopadhyayN.On sequential point estimation in a uniform distribution with adjusted non-sufficient estimators: a comparative study and data illustration. Calcutta Stat Assoc Bull. 2013; 65: 103–122.
12.
PatilHS.A purely sequential estimation procedure for the parameter of U(θ, mθ) distribution. Calcutta Stat Assoc Bull. 2014; 66: , 235–240, 263–264.
13.
ZacksSMukhopadhyayN.Bounded risk estimation of the exponential parameter in a two-stage sampling. Seq Anal. 2006a; 25: 437–452.
14.
ZacksSMukhopadhyayN.Exact risk of sequential point estimators of the exponential parameter. Seq Anal. 2006b; 25: 203–220.
15.
MukhopadhyayNPepeW.Exact bounded risk estimation when the terminal sample size and estimator are dependent: The exponential cases. Seq Anal. 2006; 29: 85–101.
16.
ZacksS.The exact distributions of the stopping times and their functionals in two-stage and sequential fixed-width confidence intervals of the exponential parameter. Seq Anal. 2009; 28: 69–81
17.
ZacksSKhanA.Two-stage and sequential estimation of the scale parameter of a gamma distribution with fixed-width intervals. Seq Anal. 2011; 30: 297–307.
18.
MahmoudiERoughaniG.Bounded risk estimation of the scale parameter of a gamma distribution in a two-stage sampling procedure. Seq Anal. 2015; 34: 25–38.
19.
GhoshMMukhopadhyayNSenPK.Sequential Estimation. New York, NY: Wiley.