Abstract
For efficiently estimating the normal mean (
Introduction
We consider normal mean (
Among them, there are
The following empirical censoring probability is calculated from the observations:
This is essentially a hierarchical maximum likelihood estimation as follows:
The “censoring” event ( The total number of events follows a binomial distribution with likelihood function
Since
For obtaining the MLE for certain function (
Approach II: Direct MLE
The direct MLE maximizes the full likelihood of all individual observations and we employ the EM algorithm to achieve this based on the following protocol:
Denote the complete observations as: Initialization (step For Update
Let
We denote such an estimate at convergence by
We set
Approach I
A first-order Taylor expansion around
and
Thus, the variance of
Here,

Here, the number of events (
We start with the following likelihood function for all individual observations:
The log-likelihood is as follows:
The first-order derivative of the log-likelihood is as follows:
The iterative “update” step (Equation (1.2)) reaches for a root of
Equation (1.2) amounts to updating “
The second-order derivative of the log-likelihood is
For a random variable with a continuous probability density function, general regularity conditions are available in the literature for MLE to be a consistent and asymptotic efficient estimator (the limiting variance equals the inverse of Fisher information). For the special case (Equation (2.3)), our simulation study shows that, the inverse of Fisher information accurately approximates the variance of
For Equation (2.6), we present an explanatory justification (not a rigorous proof) through using discrete approximation. The random variable If If If
In order for the inverse of Fisher information (of For any For any For any
It is easy to verify that the probability mass function (
The asymptotic variance ratio (Approach II vs. I) is as follows:
In Figure 2,

Proof. We have the following equivalent statements by simple algebraic operations. To prove the conclusion, we only need to show the following:
We divide both sides by
We further divide both sides by (
That is
We multiply both sides by
That is
The left side of (2.9) decreases as
Now, we prove the following:
This statement amounts to the following:
Define
It is easy to see that,
Its derivative
We only check that
When
this value is greater than the square of the right side of (2.10) which is
The proof ends. Some probability inequalities regarding the standard normal random variable are available in the literature. One of them is Mill's ratio inequality which is as following:
Theorem 1 gives another inequality relevant to standard normal distribution.
Proof. For any
Thus, the first statement in Lemma 1 holds. Since
Thus, the second statement in Lemma 1 holds. The proof ends.
For example, under configuration

Proof. We define
Its first-order derivative
Eq.(2.8) gives
The proof ends.
Proof.
When
Proof.
For i.i.d. normal variables under right-censoring, the present work investigates MLE from some new perspectives. Since the hierarchical MLE (Approach I) does not take into account the detailed individual observations and loses information compared to the direct MLE (Approach II), we are motivated to quantify the relative information loss by using the asymptotic variance ratio (Equation (2.7)) which is a monotonically decreasing function of
Footnotes
Acknowledgement
The authors are very grateful to Editor-in-Chief (Uttam Bandyopadhyay) and two anonymous referees whose insightful comments have greatly improved the presentation of the materials and motivated us to investigate more aspects of this subject.
Declaration of Conflicting Interests
Funding
The author(s) received no financial support for the research, authorship and/or publication of this article.
