Laplace probability density function with additional shape parameter that regulates the degree of skewness is a skew Laplace distribution. The various forms of skew Laplace distribution are found in the literature, the distributions defined by Mc Gill (1962), Holla and Bhattacharya (1968), Lingappaiah (1988), Fernandez and Steel (1998). The skew log Laplace distribution is the probability distribution of a random variable whose logarithm follows a skew Laplace distribution. In this paper, the classical optimum tests for skewness parameter of skew log Laplace distribution (SLLD) derived from Lingappaiah (1988) distribution are discussed. Uniformly most powerful test, uniformly most powerful unbiased test and Wald’s sequential probability ratio test for skewness parameter are compared. The exact likelihood ratio test and Neyman structure test for testing skewness parameter when scale parameter is known are derived. Finally, the underreported income of Road Transport Company is analysed on the basis of the tests derived in this paper.
Log Laplace distribution has been used by researchers to study growth rates, stock prices, exchange rates, income, and so on. Hartley and Revankar (1974) used skew log Laplace distribution to model underreported income and is described below.
Let be the true income which assumes two parameter Pareto distribution given below
Assuming that income is underreported, true income is unobserved. The underreported income and observational error are denoted as and respectively. Thus and assumes only non negative values. . The proportion of income which is not reported is denoted by and is distributed independently of (Krishnaji, 1970) with probability density function (p.d.f.) as shown below
Using Eqs (1) and (2), it is proved that the p.d.f. of the underreported income is
where , is scale parameter and , are shape parameters.
The p.d.f. in Eq. (3) is Skew log Laplace distribution (SLLD) with parameters . It can be easily proved that, assumes skew Laplace distribution given by Lingappaiah (1988). The distribution is scale symmetric, in the sense that, and have same p.d.f. Thus SLLD is symmetric, when . Let , where , then the p.d.f. is
The p.d.f. given in Eq. (4) is SLLD , where is skewness parameter. Testing for scale symmetry is to test whether . The p.d.f. given in Eq. (4) can also be written as shown below
where if and if .
Kozubowski and Podgorski (2003) have studied the properties and characterization of SLLD. In this paper tests for skewness parameter of SLLD are derived when scale parameter is known. In Sections 2 and 3 uniformly most powerful (UMP) test and sequential probability ratio test (SPRT) for skewness parameter are derived, when and are known respectively. In Section 4, uniformly most powerful unbiased (UMPU) test is discussed, for skewness parameter, for two sided alternative, when and are known. The more general, exact likelihood ratio test (LRT) and Neyman structure test for testing skewness parameter are derived, when is known and in presence of nuisance parameter , in Sections 5 and 6 respectively. The test for skewness parameter based on number of observations less than or equal to the scale parameter only (Binomial LRT), is also discussed. The critical values and values of power functions of UMP, UMPU, LR and Neyman structure tests are tabulated, at level of significance 0.05, for different sample sizes, in Appendix. Power and ASN values for SPRT are also presented in Appendix. Section 7 demonstrates the application of these tests to the underreported income of bus drivers of Road Transport Company.
Uniformly most powerful (UMP) test
Suppose is a random sample from SLLD given in Eq. (5). Assuming and are known, the joint probability density function is
where
Let no. of observations less than or equal to in a sample of size .
It can be proved that
where represents binomial distribution with parameters and indicating number of trials and probability of success respectively, represents Gamma distribution with shape parameter and scale parameter .
The following theorem gives UMP test for skewness parameter .
.
The UMP test for testing against when of size is given by
where is as defined in Eq. (7), and are such that .
The proof of the theorem is deferred to the Appendix. The UMP test for testing against of size is described in Appendix (Remark 2.1). The UMP test for symmetry for one sided alternative hypothesis can be obtained by putting , in the tests described above. Critical values of for level of significance and power of the test for different values of are calculated for sample sizes 10(10)50, under and , for (chosen arbitrarily). These values are presented in Tables 2 and 3 respectively, in Appendix. Note that cutoff points and power of the test does not depend on the value of .
The sequential tests are preferred over fixed sample size test, when sampling is costly. In the following section, Sequential Probability Ratio Test for skewness parameter is discussed.
Sequential probability ratio test (SPRT)
The sequential probability ratio test based on independent and identical random variables from SLLD , for testing against , when and are known, for the strength can be easily derived and is as shown below.
SPRT at stage is as follows:
Reject if ,
Accept if ,
Continue sampling by taking one more observation if , where , , and .
On simplification SPRT is
At the stage
Reject if ,
Accept if ,
Continue sampling by taking one more observation if , where , and as defined in Eq. (7).
It can be easily seen that the SPRT for testing against , for the strength , when and are known is
At the stage,
Reject if ,
Accept if ,
Continue sampling by taking one more observation if .
The properties of SPRT are derived in Appendix. SPRT for symmetry can be obtained by substituting . The power and ASN values for strength are computed for (chosen arbitrarily) and are presented in Tables 4 and 5 respectively for different values of .
The UMP test discussed in Section 2, for one sided alternative hypothesis, clearly shows that the UMP test for testing against does not exist. Therefore in the next section, Uniformly most powerful unbiased test, for skewness parameter , for two sided alternative hypothesis is derived, as described in Lehmann (1986).
Uniformly most powerful unbiased (UMPU) test
Suppose is a random sample from SLLD given in Eq. (5). is as defined in Eq. (7). In the following theorem, UMPU test for skewness parameter is derived.
.
The uniformly most powerful unbiased test for testing against of size , when and are known, is given by
where and are such that
The proof of the theorem is deferred to the Appendix. Thus the UMPU test for skewness parameter, for testing against is obtained by substituting . The critical values of and , for level of significance 0.05 and power of the test for different values of are calculated for sample sizes 10(10)50, for (chosen arbitrarily). These values are presented in Table 6 in Appendix.
The tests for skewness parameter described in Sections 2, 3 and 4 assumes to be known. In the following Section, likelihood ratio test for skewness parameter is discussed, when is unknown.
Exact likelihood ratio test (LRT)
Let denote independent and identical SLLD random variables, where is known and is unknown. The exact likelihood ratio test for skewness parameter is derived in the following theorem.
.
The LRT based on observations of size , for testing against , where is known and is unknown is given by
where and are as defined in Eq. (7). and are obtained such that .
The proof of the theorem and relevant remarks are deferred to the Appendix. The test for symmetry is obtained by putting . The critical values of and power of the LRT for symmetry are obtained for different sample sizes 10(10)50 and at , for different values of and are shown in Table 7 in Appendix. The two sided Binomial LRT test is described in Appendix (Remark 5.4). The critical values and power are obtained for different sample sizes 10(10)50 and at and are shown in Table 8 in Appendix. From Tables 7 and 8, it is clear that power of an exact LRT is significantly greater than the power of Binomial LRT, as expected, but Binomial LRT is computationally simple. Also one sided LRT test for symmetry is derived in Appendix (Remark 5.5).
Neyman structure test is an alternate test to LRT when nuisance parameter is present. In the next section it is derived for skewness parameter in the presence of nuisance parameter .
Neyman structure test
Let denote independent and identical SLLD random variables, where is known and is unknown. Neyman structure test for skewness parameter is derived in the following theorem.
.
The Neyman structure test for testing against based on the random sample of size drawn from SLLD , where is known and is unknown is given by
where , , and , are as defined in Eq. (7). and are obtained such that .
The proof of the theorem is deferred to the Appendix. Neyman structure test and LRT for symmetry parameter are identical for one sided alternative hypothesis.
In the following Section application of LRT and Binomial LRT for symmetry to the real life data is presented.
Application
The Road Transport Company is running buses on different routes. The bus halts at several stops in between the starting point and the destination. The fare paid by each passenger will be different, depending on distance traveled as they can get in and get out of the bus at any stop. The number of passengers traveling per trip is not fixed. The fare collected from the passengers by the bus drivers (income) per trip on any route depends on the number of passengers traveling and distance traveled by the passengers. Table 1 shows income (in Rs.) reported per trip, by bus drivers, on one of the routes, in the month of February, for 29 trips, between the two fixed locations.
Income in Rs. reported per trip by bus drivers during 29 trips in February
3,160
2,152
1,798
1,871
2,535
1,753
4,190
2,181
2,423
3,094
3,228
2,886
1,237
1,524
1,889
1,629
3,954
2,967
745
1,896
1,595
2,025
1,373
2,205
2,184
2,208
1,379
1,625
1,446
The officer of Road Transport Company knows that bus drivers are underreporting incomes. He knows that the bus drivers should generate an income of at least Rs. 1,500 per trip. Therefore lower limit on true income is Rs. 1500. The sample size is and number of income values less than Rs. 1500 in the sample is . Using Eq. (7), the values of statistics are , . MLEs of the parameters are obtained using Eqs (35) and (36) as , .
The LRT described in Section 5 is applied to test versus According to Theorem 5.1, and it lies between 0 and the cutoff point (given in Table 7). Hence is rejected and it can be concluded that data supports the claim that the distribution of underreported income is not scale symmetric.
Using Binomial LRT, number of income values less than Rs. 1500 in the sample is and is less than the cutoff point (given in Table 8), leading to the same conclusion.
The comparison of fitting symmetric log Laplace distribution and skew log Laplace distribution (SLLD) to income data is carried out using Akaike’s information criterion (AIC). For any model , with the likelihood function its AIC is defined as
where denotes the number of parameters of the model estimated from the given data. The AIC values for symmetric log Laplace distribution and skew log Laplace distribution are 456.9047 and 436.9354 respectively. Skew log Laplace distribution (SLLD) has lesser AIC than symmetric log Laplace distribution and SLLD is appropriate to describe underreported incomes.
Assuming that underreported incomes of bus drivers follows SLLD , it can be concluded that percentage of trips with underreported income below Rs. 1,500 is 43.22 ( and ). Hartley and Revankar (1974) defined index of dishonesty as . Using the distributions of and given in Eqs (3) and (1) respectively, index of dishonesty is given by . The estimate of index of dishonesty is indicating that of 80.32 percent of income is reported.
Concluding remarks
All the tests derived in this paper are classical optimum exact tests, whose cutoff points and power can be obtained using simple computer program. The UMP test, UMPU test and LRT/Neyman structure test are randomized tests. All tests are applicable for any sample size. The analysis of underreported income based on LRT/Neyman structure test indicates that Skew log Laplace distribution is better than symmetric log Laplace distribution to describe observed underreported income. Using Skew log Laplace distribution it is concluded that 80.32 percent of income is reported.
Footnotes
Acknowledgments
Authors sincerely thank the members of Editorial Board and the reviewer for their helpful comments and suggestions.
Appendix
References
1.
FernandezC., & SteelF.J. (1998). On Bayesian modeling of fat tails and skewness. J Am Stat Assoc, 93, 359-371.
2.
HartleyM.J., & RevankarN.S. (1974). On the estimation of the Pareto Law from under-reported data. Journal of Econometrics, 2, 327-341.
3.
HollaM.S., & BhattacharyaS.K. (1968). On a compound Gaussian distribution. Ann Inst Stat Math, 20, 331-336.