Abstract
This study addresses the challenge of estimating parameters for two logistic populations that share a common scale parameter but have different location parameters in the presence of fuzzy data. To handle these complexities, both Maximum Likelihood Estimation (MLE) and Bayesian methods are employed. Asymptotic confidence intervals are constructed using ML estimates. For Bayesian estimation, a conjugate prior is utilized, and Bayes estimators are approximated using Lindley’s method due to the lack of closed-form solutions. Furthermore, Approximate Bayesian Computation (ABC) and Markov Chain Monte Carlo (MCMC) techniques, including Hamiltonian Monte Carlo (HMC) and the Metropolis–Hastings (MH) algorithm, are utilized to sample from the posterior distributions and construct Highest Posterior Density (HPD) intervals. A detailed comparative analysis of MLE, Lindley’s approximation, ABC, HMC, and MH is conducted to assess their performance. The effectiveness of the proposed methodology is demonstrated using a real-world dataset under fuzzy conditions.
Keywords
Introduction
The logistic distribution is widely used in various practical applications due to its flexibility in modeling real-world datasets. For instance, it has been effectively applied in demographic studies to describe growth models (Balakrishnan and Basu, 1995). Additionally, it is frequently employed in agricultural production modeling, categorical data analysis, biochemical data analysis, bioassays, and life testing studies (Kotz et al., 2019; Rashad et al., 2016). While its shape resembles that of the normal distribution, the logistic distribution differs in key aspects, it has heavier tails and a more pronounced peak at the center, making it more robust for analyzing datasets with extreme values.
Beyond its applications, a key statistical challenge is parameter estimation under equality restrictions, which plays a crucial role in comparative analysis and risk assessment across different populations. This problem has been extensively studied in normal distributions and is commonly referred to as the “common mean problem.” One of the earliest works in this area was by Graybill (1959), who proposed a combined estimator that dominates individual sample means in terms of variance under specific sample size conditions. Subsequent advancements extended this concept to two or more normal populations with unknown variances, with notable contributions by Moore (1997), Tripathy and Nagamani (2015), and others.
While researchers have extensively studied parameter estimation in normal populations, significant attention has also been given to non-normal distributions, such as the exponential, gamma, and inverse Gaussian families. Ghosh and Meeden (1984) estimated the common location parameter for two exponential distributions, while Jin (1992) introduced improved estimators that outperformed the Maximum Likelihood Estimator (MLE). Tripathy and Nagamani (2014) investigated the estimation of a common location parameter for two exponential populations under order-restricted failure rates. More recently, Azhad and Tripathy (2021) explored parameter estimation for heterogeneous exponential distributions using MLE, modified MLE, and the uniformly minimum variance unbiased estimator (UMVUE).
Yang (2007) explored the estimation of a common shape parameter for two Weibull populations. Later, Nagamani and Tripathy (2017) studied the estimation of a common scale parameter for two gamma populations, while in their other work, Tripathy and Nagamani (2017) focused on estimating the common shape parameter for the same gamma populations. Nagamani and Tripathy (2018) extended this work by examining the common dispersion parameter across multiple Gaussian populations. These studies underscore the growing significance of parameter estimation in non-normal statistical models, particularly in practical applications such as survival analysis, reliability studies, and risk assessment.
Despite these advancements, research on parameter estimation for two logistic populations remains limited. Recently, Nagamani and Tripathy (2020) explored Bayesian estimation of the common scale parameter for two logistic populations with unknown and possibly unequal location parameters. In another study, Lingutla and Nagamani (2024) examined two logistic populations with a common location parameter and possibly unequal scale parameters. However, these studies do not fully address the complexities introduced by fuzzy data in logistic distribution modeling.
While significant research has focused on classical estimation methods, handling fuzzy data in multi-population contexts remains an open problem. Many real-world datasets contain uncertainty, making traditional estimation techniques inadequate for precise inference. Several researchers have investigated parameter estimation under fuzzy conditions for single-population models. Pak and Parham (2013) analyzed the Weibull distribution using Maximum Likelihood Estimation (MLE), the method of moments, and Bayesian estimation under fuzzy conditions. Khoolenjani and Jafari (2016) estimated parameters of the weighted exponential distribution using Bayesian techniques with both symmetric and asymmetric loss functions, employing Lindley’s approximation. Pak (2020) analyzed the generalized exponential model with Bayesian methods under fuzzy data, illustrating its application in reliability and risk assessment scenarios. Roohanizadeh and Pak (2022) extended Bayesian inference techniques to fuzzy datasets in the context of generalized exponential and Weibull distributions. More recently, Ashok and Nagamani (2025) explored parameter estimation for the gamma distribution under fuzzy conditions. These studies highlight the increasing importance of incorporating fuzzy data into statistical models to enhance the robustness and applicability of estimation methods in uncertain environments.
Despite these contributions, a critical gap remains in developing methodologies that effectively handle fuzzy data in multi-population statistical models. Addressing this challenge is essential for improving decision-making processes in real-world scenarios characterized by imprecise or uncertain information.
This study focuses on parameter estimation for two logistic populations that share a common scale parameter but have different location parameters, integrating fuzzy data into the estimation process.
Key Contributions
The key contributions of this study are summarized as follows: We develop a unified estimation framework for two logistic populations sharing a common scale parameter in the presence of fuzzy data, extending classical logistic modeling to accommodate data imprecision. We provide a comprehensive comparative analysis of classical and Bayesian estimation methods, including EM-based MLE, Lindley’s approximation, Approximate Bayesian Computation (ABC), Hamiltonian Monte Carlo (HMC), and the Metropolis–Hastings (MH) algorithm, within a fuzzy data setting. The performance of the proposed estimators is systematically evaluated through extensive Monte Carlo simulation studies across different sample sizes, using bias and mean square error as performance criteria. The practical applicability of the methodology is demonstrated using a real engineering dataset, illustrating the impact of fuzzy data modeling on parameter estimation and uncertainty quantification.
The remainder of the paper is structured as follows: Section 2 introduces foundational concepts related to fuzzy data and triangular fuzzy numbers. Section 3 presents the MLE approach for logistic parameters using the Expectation-Maximization (EM) algorithm. Section 4 discusses Bayesian estimation techniques including Lindley’s approximation, ABC, and MCMC methods (HMC and MH). Section 5 contains simulation studies evaluating estimator performance. Section 6 applies the methods to real-world data. Finally, Section 7 concludes with a summary of findings and suggestions for future research.
Theoretical Background
Logistic Distribution
The logistic distribution is widely used in various fields, including survival analysis, economics, and machine learning, due to its similarity to the normal distribution but with heavier tails. The probability density function (PDF) of the logistic distribution for a random variable
Fuzzy set theory, introduced by Zadeh (1965), provides a mathematical framework to handle uncertainty and imprecision by allowing membership in sets to be gradual rather than binary. In classical set theory, an element either belongs to a set or does not; in fuzzy sets, an element can have a degree of membership, defined by a membership function
For a fuzzy set
The membership function reflects the degree of truth of an element’s belonging to a set. For example, in medical data, this could represent the degree of certainty about a patient’s survival time or diagnosis based on imprecise or incomplete information.
In the context of survival analysis, fuzzy methods are particularly useful in accommodating incomplete or uncertain data, such as imprecise survival times, due to factors like missing information, recall bias, or measurement errors. Fuzzy methods allow us to represent data that is not entirely precise, which is often the case in medical and clinical studies.
A Triangular membership function is commonly used to model fuzzy numbers, particularly when expert knowledge or subjective judgment is used to quantify uncertain data. A triangular fuzzy number is characterized by three parameters: the lower bound
The membership function
In medical data, triangular fuzzy numbers are frequently used to represent uncertain survival times or other clinical parameters. For example, if an expert estimates that a patient’s survival time is likely to be around 12 months, but could range between 10 and 15 months, the fuzzy number
In the subsequent sections we estimate the parameters, including Maximum Likelihood Estimation (MLE) and Bayesian estimation techniques. Specifically, we employ Lindley’s approximation, Approximate Bayesian Computation (ABC), Hamiltonian Monte Carlo (HMC), and the Metropolis-Hastings (MH) algorithm.
Maximum Likelihood Estimation
Consider two independent random samples,
The fuzzy likelihood function is given by:
Taking the natural logarithm, we obtain:
To estimate
The Expectation-Maximization (EM) (Dempster et al., 1977) algorithm is a widely used iterative method for estimating parameters in models with latent variables, particularly when dealing with incomplete, missing, or fuzzy data. For two logistic populations with a common scale parameter
The iterative EM procedure for estimating the parameters is as follows: Assume initial values If
Once the MLEs of the parameters are obtained, the asymptotic confidence intervals for
Bayesian estimation provides a flexible framework for parameter inference by incorporating prior information together with the observed data. Let
The posterior expectation of any function
Here,
Lindley’s Approximation: A second-order Taylor expansion of the log-posterior about the posterior mode (typically the maximum likelihood estimate) provides an analytical approximation to posterior expectations. This method is computationally efficient and particularly useful for low-dimensional parameter spaces. Approximate Bayesian Computation (ABC): A likelihood-free technique that approximates the posterior distribution by matching summary statistics from simulated data with those of the observed fuzzy data. ABC is applicable when the likelihood is either unavailable or too complex. Hamiltonian Monte Carlo (HMC): An advanced Markov chain Monte Carlo (MCMC) method that uses Hamiltonian dynamics to explore the posterior distribution efficiently. HMC greatly improves mixing in high-dimensional or highly correlated parameter spaces. Metropolis–Hastings Algorithm: A widely used MCMC procedure based on the acceptance rejection principle, which constructs a Markov chain whose stationary distribution is the desired posterior. This method is useful when the posterior distribution has a complicated form but is computationally manageable.
These methods allow us to obtain approximate Bayesian estimators and posterior summaries for the parameters
The approximation proposed by Lindley (1980) is a numerical technique for computing Bayes estimators when the posterior distribution does not possess a closed form. Under the squared error loss function (SELF), the Bayes estimator of a function
Lindley’s approximation to equation (7) is
Ignoring terms of order
We assume independent conjugate priors,
To address potential concerns regarding numerical stability, central difference schemes with carefully chosen step sizes were employed for numerical differentiation. These step sizes were selected to ensure stable numerical evaluation of the derivatives, and the resulting Bayes estimates were found to be numerically well behaved across all simulation settings.
Approximate Bayesian Computation (ABC) is particularly useful when the exact likelihood function is analytically intractable, as is the case for fuzzy and imprecise observations arising from the logistic distribution. ABC avoids explicit likelihood evaluations by comparing observed data with synthetic data generated from the model, thereby approximating the posterior distribution through simulation. This makes ABC especially suitable for Bayesian inference in reliability and survival models involving uncertainty, fuzziness, or non-standard likelihoods.
ABC Procedure
Two populations are modeled using logistic distributions with a common scale parameter:
Independent priors are assigned as
For each iteration, parameter values Suitable summary statistics are computed for the observed fuzzy data ( A discrepancy measure is calculated, commonly the Euclidean distance:
A tolerance level Repeating the simulation until a sufficient number of accepted samples is obtained yields an approximate posterior sample, from which Bayesian point estimates (posterior mean, median) and credible intervals are computed.
In summary, ABC provides a flexible and likelihood-free Bayesian inference framework capable of handling fuzzy observations and complex data structures. By relying on simulations rather than analytic likelihoods, ABC effectively accommodates uncertainty and supports robust parameter estimation for logistic reliability models under fuzzy data.
In the present study, the summary statistics were selected as the fuzzy sample mean, fuzzy median, and interquartile range, as these statistics jointly capture both location and dispersion while remaining robust under data imprecision. The tolerance level
Hamiltonian Monte Carlo (HMC)
Hamiltonian Monte Carlo (HMC) is an efficient Markov chain Monte Carlo (MCMC) method that uses Hamiltonian dynamics to generate proposals for sampling from complex posterior distributions. Unlike traditional random-walk MCMC methods, HMC uses gradient information from the log-posterior to guide the sampling trajectory, enabling rapid exploration of the parameter space and reducing autocorrelation among successive samples. This makes HMC particularly suitable for Bayesian inference involving fuzzy observations and nonstandard likelihood functions such as those arising from the logistic model considered here.
Working Rule for HMC
The two populations are modeled as
Independent prior distributions are assigned as
The posterior density is proportional to the product of the fuzzy-data likelihood and the priors:
The gradient of the log-posterior is required for Hamiltonian dynamics:
HMC augments the parameter vector Each HMC iteration consists of: Initialization: Start with current parameter vector Momentum Sampling: Draw Leapfrog Integration: Evolve the system using Metropolis–Hastings Correction: Accept the proposal After discarding burn-in iterations, the retained samples form an approximate posterior distribution. From these samples we compute: posterior means of Highest Posterior Density (HPD) credible intervals.
HMC thus provides an efficient and scalable sampling framework for Bayesian inference in the presence of fuzzy data. Its ability to explore the posterior distribution effectively makes it particularly well suited for the logistic model with fuzzy observations considered in this study.
The Metropolis–Hastings (MH) algorithm is a widely used Markov chain Monte Carlo (MCMC) technique for generating samples from a posterior distribution that cannot be sampled from directly. MH constructs a Markov chain whose stationary distribution is the target posterior, using a proposal mechanism followed by an acceptance–rejection step.
Working Rule for MH
The two populations under study follow logistic distributions with a common scale parameter:
The posterior density is proportional to the fuzzy likelihood multiplied by the prior:
A proposal distribution Given the current state Compute the acceptance probability
Accept or reject the proposal according to:
Repeat this process for Posterior summaries are obtained using the retained samples, including:
The MH algorithm thus provides a flexible and likelihood-free tool for Bayesian inference under fuzzy observations, enabling sampling from complex posterior structures when analytical forms are not available.
Data Analysis
In this section, we present a comparative evaluation of different estimation methods under varying sample sizes. Specifically, we estimate the common scale parameter
We consider the following parameter settings:
For Various Sample Sizes, We Compare estimated Value and Mean Square Errors of Multiple Estimators under Squared Error Loss for
.
For Various Sample Sizes, We Compare estimated Value and Mean Square Errors of Multiple Estimators under Squared Error Loss for
For Various Sample Sizes, We Compare estimated Value and Mean Square Errors of Multiple Estimators under Squared Error Loss for
Fuzzy numbers for Peak values are sampled from the respective logistic distributions. Lower and upper bounds of the triangular fuzzy numbers are constructed by subtracting and adding a small random value (sampled from a uniform distribution) to the peak. A triangular membership function is applied to determine the membership degree across the fuzzy intervals.
We assess the performance of each estimator using the following metrics: Average Value (Estimate)—the mean of the estimated values across replications. Mean Square Error (MSE)—to quantify estimation accuracy and variability.
To ensure robust performance comparison, we simulate 10,000 datasets from each parameter setting across different sample sizes. The prior hyperparameters used in Bayesian estimation are fixed as follows:
Tables 1 and 2 summarize the estimated values and MSEs for each method. The first column denotes the sample size, followed by the true parameter values. The subsequent columns report the estimation results for MLE, Lindley’s method, ABC, HMC, and MH.
Tables 3 and 4 provide interval estimates: The third column shows the 95% asymptotic confidence intervals derived using the observed Fisher information matrix. The fourth column lists the 95% Highest Posterior Density (HPD) intervals obtained via HMC and MH sampling.
95
95
The simulation study leads to the following observations:
MSE consistently decreases with increasing sample size, confirming the consistency of all estimators. Bayesian methods (Lindley, ABC, HMC, MH) provide stable and accurate parameter estimates, especially in small-sample scenarios. HMC performs particularly well for small sample sizes, yielding lower bias and MSE than MLE, especially for the scale parameters Lindley’s approximation offers computationally efficient and competitive estimates, showing performance comparable to MLE and other Bayesian methods. ABC is effective in cases with intractable likelihoods and performs well with fuzzy data, though it may require careful tuning of the tolerance parameter. HPD intervals from HMC and MH are generally narrower and more informative than traditional asymptotic intervals, particularly for the fuzzy data setting. Overall, the integration of fuzzy data into the Bayesian framework enhances estimation robustness and provides a more realistic reflection of uncertainty in survival data analysis.
Although explicit theoretical proofs are difficult to obtain for likelihood-based inference under fuzzy observations, the proposed estimators inherit desirable asymptotic properties from their classical counterparts. As the sample size increases and the degree of fuzziness decreases, the fuzzy likelihood converges to the classical likelihood, implying consistency of the EM-based maximum likelihood estimators as well as the approximate Bayesian estimators. This behavior is empirically supported by the simulation results, where the mean square errors consistently decrease with increasing sample size. Moreover, Bayesian estimators obtained via HMC and MH demonstrate robustness to data imprecision by providing stable posterior summaries and wider, yet informative, HPD intervals that appropriately reflect uncertainty under fuzzy data.
Real Aata Analysis
To assess the performance of our model, we utilized data from an engineering dataset provided by Pak and Parham (2013), which includes a case study on the light-emitting diodes (LED) manufacturing process, specifically focusing on the luminous intensities of LED sources. The data is shown in Table 5. We validated that the distribution of this data adheres to a logistic distribution using the Kolmogorov-Smirnov (KS) and Anderson-Darling (AD) tests, with the results displayed in Table 6.
The Breakdown Lifetimes at 32 and 34 KV of an Insulating Fluid.
The Breakdown Lifetimes at 32 and 34 KV of an Insulating Fluid.
KS and AD Test Results.
To evaluate whether the breakdown lifetimes at 32 and 34 KV follow a Logistic distribution, we applied the Kolmogorov-Smirnov (KS) test and the Anderson-Darling (AD) test. These tests assess whether an empirical distribution significantly deviates from a theoretical distribution.
The KS test is a non-parametric test comparing the empirical cumulative distribution function (ECDF) with the theoretical logistic distribution, while the AD test places more weight on the tails of the distribution, making it more sensitive to deviations in extreme values. These tests were selected to ensure that the fitted distribution is robust across different parts of the dataset.
For Set 1 (32 KV), the KS (
Wald Test Results
The Wald test was performed to evaluate the null hypothesis that the scale parameters of the two populations are equal (
Wald Test for Equality of Scale Parameters.
Wald Test for Equality of Scale Parameters.
This suggests that there is no statistically significant evidence to conclude that the scale parameters of the two populations differ. Practically, this implies that the two datasets share similar variability, supporting the assumption of a common scale parameter.
In our study, each data point is represented using triangular fuzzy numbers
The outcomes, summarized in Tables 8 and 9, provide comparative insights into the effectiveness of the different estimation strategies under uncertainty. Figure 1 displays the density plots for the parameters

Illustration of Density plots of parameters.
Maximum Likelihood and Bayesian Estimators of the Combined Model.
95% Asymptotic and HPD Intervals using ML Estimates, HMC and MH Estimates for the Parameters
Table 8 shows that Bayesian estimates closely align with those obtained via MLE. Lindley’s method gives results almost identical to the EM estimates, indicating its strong performance with imprecise inputs. ABC slightly underestimates the parameters, whereas HMC and MH produce consistent yet slightly conservative estimates, suggesting their better accommodation of data uncertainty.
As observed in Table 9, the 95% HPD intervals from HMC and MH are generally wider than the MLE-based asymptotic intervals, particularly for the scale parameter
From a computational perspective, the EM-based maximum likelihood estimation and Lindley’s approximation were found to be the most efficient methods, as they rely on deterministic optimization and low-dimensional numerical calculations. Approximate Bayesian Computation required moderate computational effort due to repeated simulation of synthetic datasets and acceptance–rejection steps. Among the Bayesian sampling methods, Hamiltonian Monte Carlo and the Metropolis–Hastings algorithm were the most computationally intensive, owing to their iterative nature and the need for a large number of iterations to ensure adequate convergence. However, these methods provide richer posterior inference and more comprehensive uncertainty quantification, which may justify the additional computational cost in applied fuzzy data settings.
This study investigated parameter estimation for two logistic populations sharing a common scale parameter under fuzzy data conditions. Both classical and Bayesian estimation approaches were examined, including EM-based maximum likelihood estimation, Lindley’s approximation, Approximate Bayesian Computation, Hamiltonian Monte Carlo, and the Metropolis–Hastings algorithm. Simulation results demonstrated the consistency of all estimators, with Bayesian methods exhibiting superior performance in small-sample scenarios and providing improved uncertainty quantification under fuzziness. The real-data application further illustrated the practical relevance of the proposed framework and confirmed the suitability of the logistic model for fuzzy observations. Overall, this study highlights the importance of incorporating fuzzy data into multi-population inference and provides practical guidance for selecting appropriate estimation methods in the presence of data imprecision.
Footnotes
Funding Statement
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
