Abstract
In this paper, a trivariate generalized non linear mixed model (TGLMM) using probit and complementary log-log transformations, is considered. These models are helpful in studying the complex relationship among the sensitivity (SN), specificity (SP) and disease prevalence (DP). For estimation of SN, SP, DP, positive (negative) predictive values (PPV and NPV) and positive (negative) likelihood ratios, Non-linear Mixed (NLMIXED) approach has been used. Model selection techniques are used to identify the best-fitting model for making statistical inference. The proposed trivariate non linear random effects models prove to be very useful in practice for meta-analysis of diagnostic accuracy studies.
Keywords
Introduction
Diagnosing a disease condition accurately helps in its control and prevention. For deciding upon the performance of a diagnostic test, sensitivity (SN), specificity (SP), positive (negative) predictive values (PPV and NPV) and positive (negative) diagnostic likelihood ratios (Zhou et al., 2009; Pepe, 2003) plays a very important role. As SN and SP vary among studies and depend on different cut-off points, the clinical usefulness of a diagnostic test may not be truly brought out which in turn, depends upon disease prevalance (Li et al., 2007). PPV and NPV are important indicators for the prediction accuracy of a test. Sometimes non linear relationship may exist between PPV, NPV and sensitivity, specificity and disease prevalence.
There is an emphasis on evidence-based diagnosis by meta-analysis of diagnostic test accuracy studies (Egger et al., 2008) due to fast development of evidence-based medicine. Meta-analysis helps in summarizing the results quantitatively from similar diagnostic test accuracy studies. When a diagnostic test is compared with its gold standard, many methods are available to take into account the heterogeneity between studies (Rutter & Gatsonis, 2001; Song et al., 2002; Van Houwelingen et al., 2002; Macaskill, 2004; Reitsma et al., 2005; Mallett et al., 2006) which arises due to the differences in disease prevalence, study design as well as laboratory and other errors. Because of this heterogeneity, random effects models viz hierarchical summary receiver operating characteristic model l (Rutter & Gatsonis, 2001) and bivariate random effects model on sensitivities and specificities (Van Houwelingen et al., 2002; Reitsma et al., 2005) have been recommended (Chu, 2009). In meta-analysis, Riley and others (Riley et al., 2007, 2008) suggested that bivariate random-effects offer numerous advantages over univariate meta-analysis.
Heterogeneity plays a distorted role in meta-analysis and may lead to differences in indicators like disease prevalence, population stratification etc. In clinical reserach, reporting of positive and negative diagnostic odds ratio, PPV and NPV of diagnostic test accuracy is of utmost importance and superseeds sensitivity and specificity on many instances, particularly in meta-analysis studies. Classical research is sparse on positive and negative odds ratio and also on PPV and NPV whether it is concerned with univariate meta-analysis or bivariate random effects meta-analysis. In literature, only Zwinderman and Bossuyt (2008) considered bivariate random effects meta-analysis on positive and negative diagnostic likelihood ratios and suggested that these should not be pooled in systematic reviews.
In situations where studies compare a diagnostic test with its gold standard reference test, only logit transformation has been used for the trivariate random effects meta-analysis of sensitivity, specificity and disease prevalance parameters in practice (Chu et al., 2009). Some other transformations such as probit, complementary log-log etc. have not been explored in this setting. Some of these transformations may provide a better goodness of fit than logit transformation and provide better statistical inference for sensitivity, specificity and disease prevalence. It has been observed that when the probability of an event is very small or very large, complementary log-log models are more applicable for a diagnostic test with a very high sensitivity and specificity. Moreover, asymetrical complementary log-log transformation is likely to provide better goodness of fit for skewed data as compared to symmetrical logit and probit transformations. If
In this article, we focus on situations where the reference test can be considered as a gold standard and consider a trivariate generalized non linear mixed effects model (TGLMM) for meta-analysis of diagnostic accuracy studies with logit, probit and complementary log-log transformation as special cases.
Some notations and basic definitions are given in Section 2. In Section 3, we present the generalized trivariate random effects model and derive the maximum likelihood function under the parameterization. Median estimation for logit, probit and complementary log-log link functions has been discussed in Section 3. Nonlinear mixed (NLMIXED) estimation approach is presented in Sectiom 4. In Section 5, we analyze a data set. Simulation results are presented in Section 6 and a brief conclusion follows in Section 7.
Cell counts and cell probabilities
Cell counts and cell probabilities
For
Random effect model based on parameterization of
,
and
In order to take correlation among the disease prevalence, sensitivity and specificity between studies into consideration, we extend the bivariate generalized linear mixed model approach for diagnostic measures (
If (
In each cell, the first line shows the number of subjects, the second and third line present the corresponding probabilities based on the parameterization of
To take heterogeneity and potential between-study correlations of
where
where the diagonal entries in
The estimation for medians of disease prevalence, sensitivities and specificities and an approximation to the medians of predictive values and positive and negative likelihood ratios for all studies in a meta-analysis are discussed below for different link functions.
Logit Link
The median estimates for the logit model are
Probit Link
The Probit model is written as
The median estimates for this model are
Complementary log-log link
The Complementary log-log model is written as
The median estimates for the Complementary log-log model are
Assuming a multinomial distribution, the likelihood for
The log-likelihood of
Let
To model study-specific covariate effects on
Akaike’s Information Criterion (AIC) (Burnham & Anderson, 1998) is used for checking goodness of fit. Smaller value of AIC results into better goodness-of-fit. Trivariate Generalized Non Linear Mixed Model (TGLMM) can be fitted by using statistical softwares such as SAS, SPLUS, R or STATA. This has been implemented through NLMIXED approach in SAS by using adaptive Gaussian quadrature to approximate the likelihood integrated over the random effects and maximizing the approximate likelihood by dual quasi-Newton optimization technique (Pinheiro & Bates, 1995). Moreover, this approach computes the population estimates of the back-transformed parameters of interest including the median sensitivity and specificity by using delta method. The predicted values of disease prevalences, sensitivities and specificities, positive & negative predictive values and positive & negative likelihood ratios are computed using empirical-Bayes estimates of the random effects. Since the distribution of these parameters is generally skewed, therefore medians are used instead of means.
The NLMIXED procedure provides estimates of median prevalence,
Data example
To illustrate the trivariate generalized linear mixed effects model discussed in this article, we apply them to meta-analysis data sets discussed in Gould et al. (2001).
Diagnostic accuracy of FDG-PET for malignant focal pulmonary lesions
Gould et al. (2001) presented 40 studies estimating the diagnostic accuracy of positron emission tomography (PET) with the glucose analog 18-fluorodexoxyglucose (FDG) of pulmonary lesions to identify malignant focal pulmonary nodules and mass lesions. Among the 40 studies, six studies did not report specificity and three studies examined FDG imaging with a modified gamma camera. To illustrate and compare different models on sensitivity, specificity and disease prevalence for FDG-PET, we have excluded these nine studies.
We fitted the TGLMM on the data of 31 studies on the diagnostic accuracy of FDG-PET of pulmonary nodules and mass lesions. We assumed trivariate normal distribution of (
| Logit | Probit | Complementary log-log | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| (Se, Sp, |
(Se, Sp, |
(Se, Sp, |
(Se, 1-Sp, |
(1-Se, 1-Sp, |
(1-Se, Sp, |
(Se, Sp, |
(1-Se, Sp, |
(1-Se, 1-Sp, |
(Se, 1-Sp, |
|
|
|
1.0324 | 0.6242 | 0.2627 | 0.2628 | 0.2632 | 0.2633 | 1.2046 | 1.2054 | 1.2055 | 1.2047 |
| (0.1199) | (0.0686) | (0.0614) | (0.0612) | (0.0613) | (0.0614) | (0.1048) | (0.1047) | (0.1044) | (0.1043) | |
|
|
3.8615 | 2.0589 | 1.3743 | 1.3730 | 3.8101 | 3.8019 | 1.3729 | 3.7955 | 3.8043 | 1.3719 |
| (0.4256) | (0.2088) | (0.1575) | (0.1574) | (0.3785) | (0.3740) | (0.1569) | (0.3715) | (0.3763) | (0.1570) | |
|
|
1.1407 | 0.6836 | 0.3146 | 1.3256 | 1.3256 | 0.3149 | 0.3008 | 0.3012 | 1.3094 | 1.3098 |
| (0.2106) | (0.1234) | (0.1125) | (0.1787) | (0.1780) | (0.1118) | (0.1173) | (0.1166) | (0.1796) | (0.1803) | |
|
|
0.07176 | 0.0377 | 0.0099 | 0.0687 | 0.0633 | 0.0143 | 1.0000 | 1.0000 | 1.0000 | 1.0000 |
| (0.2449) | (0.2404) | (0.2339) | (0.2456) | (0.2481) | (0.2430) | (0.3372) | (0.3380) | (0.3303) | (0.3275) | |
|
|
0.1554 | 0.1459 | 0.1178 | 0.1172 | 0.1723 | 0.1782 | 0.0123 | 0.4505 | 0.4673 | 0.0802 |
| (0.2217) | (0.2255) | (0.2298) | (0.2299) | (0.2351) | (0.2380) | (0.2294) | (0.2157) | (0.2148) | (0.2438) | |
|
|
4819.2 | 4819.4 | 4823.8 | 4818.2 | 4812.1 | 4818.7 | 4828.5 | 4823.4 | 4834.8 | 4823.0 |
| AIC | 4837.2 | 4837.4 | 4841.8 | 4836.2 | 4830.1 | 4836.7 | 4846.5 | 4841.4 |
|
4841.0 |
Summary of disease prevalance, sensitivities, specificities and predictive values with standard errors for saturated model
Summary of parameter estimates with standard errors for the partially reduced model
| Logit | Probit | Complementary log-log | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| (Se, Sp, |
(Se, Sp, |
(Se, Sp, |
(Se, 1-Sp, |
(1-Se, 1-Sp, |
(1-Se, Sp, |
(Se, Sp, |
(1-Se, Sp, |
(1-Se, 1-Sp, |
(Se, 1-Sp, |
|
| DP | 0.7391 | 0.7359 | 0.7299 | 0.72999 | 0.7300 | 0.7300 | 0.2574 | 0.2589 | 0.2588 | 0.2574 |
| (0.0226) | (0.0221) | (0.0119) | (0.0217) | (0.0217) | (0.0217) | (0.0224) | (0.0232) | (0.0231) | (0.0224) | |
| Se | 0.9794 | 0.9801 | 0.9806 | 0.9806 | 0.0219 | 0.0222 | 0.9806 | 0.0222 | 0.0220 | 0.9806 |
| (0.0085) | (0.0099) | (0.0119) | (0.0119) | (0.0081) | (0.0081) | (0.0119) | (0.0081) | (0.0081) | (0.0119) | |
| Sp | 0.7819 | 0.7770 | 0.7710 | 0.2141 | 0.2141 | 0.7710 | 0.7710 | 0.7411 | 0.2366 | 0.2141 |
| (0.0334) | (0.0341) | (0.0335) | (0.0337) | (0.0337) | (0.0334) | (0.0335) | (0.0408) | (0.0370) | (0.0337) | |
| PPV | 0.9245 | 0.9793 | 0.9204 | 0.7713 | 0.0701 | 0.2082 | 0.5974 | 0.3019 | 0.0095 | 0.0326 |
| (0.0133) | (0.0058) | (0.0134) | (0.0208) | (0.0259) | (0.0686) | (0.0454) | (0.0266) | (0.0036) | (0.0126) | |
| NPV | 1.0000 | 0.9975 | 0.9364 | 0.8037 | 0.0749 | 0.2258 | 0.9914 | 0.9696 | 0.3873 | 0.6947 |
| (0.0073) | (0.0041) | (0.0377) | (0.1031) | (0.0133) | (0.0206) | (0.0053) | (0.0189) | (0.0467) | (0.0267) | |
| LR+ | 4.4900 | 4.3959 | 0.9154 | 0.9754 | 2.2446 | 5.2694 | 0.9153 | 4.7773 | 2.2811 | 0.9754 |
| (0.6890) | (0.6741) | (0.0536) | (0.0152) | (0.0544) | (0.6248) | (0.0536) | (0.5956) | (0.0636) | (0.0152) | |
| LR- | 0.0263 |
|
0.0251 | 0.0904 |
|
|
0.0251 |
|
|
0.0904 |
| (0.0109) | (0.0568) | (0.0155) | (0.0577) | (0.7203) | (0.0560) | (0.0155) | (0.0736) | (0.6455) | (0.0576) | |
Summary of parameter estimates with standard errors for the reduced model with zero correlation
Summary of disease prevalance, sensitivities, specificities and predictive values with standard errors for the reduced model with zero correlation
Table 3 compares the regression parameter estimates, standard errors and the goodness of fit measure using Akaike’s Information Criterion (AIC) resulting from the trivariate random effects meta-analysis obtained from the logit, probit and complementary log log link models (saturated models). Table 4 compares the regression parameter estimates, standard errors and the goodness of fit measurement using Akaike’s Information Criterion (AIC) obtained from the partially reduced models with
On the basis of results in Tables 3–7, it is concluded that
For saturated model, complementary log-log model (1-Se, 1-Sp, For the partially reduced model and the reduced model with zero correlation, complementary log-log model (1-Se, 1-Sp, Saturated model provides the best fit as its AIC is the lowest as compared to others and suggest that
There is perfect negative correlation between random effects of disease prevalence and specificity. There is moderate positive correlation between random effects of sensitivity and specificity.
We conduct simulation studies under two sets to evaluate the performance of trivariate generalized linear mixed model with complementary log log transformation. In first set of simulations, (
Bias of median estimates and the standard errors for
Bias of median estimates and the standard errors for
Table 8 shows that under considered scenarios, the estimated median disease prevalence, sensitivity, specificity, PPV and NPV are unbiased under the complementary log-log link function.
In this paper, we use the TGLMM approach to jointly model the sensitivities, specificities and disease prevalence and discuss the estimation of sensitivity, specificity, disease prevalence, positive(negative) predictive values and positive(negative) likelihood ratios in meta-analysis of diagnostic tests. Earlier these studies have been carried out by using logit link function. We explore the use of other link functions like probit and complementary log-log and compare our results with existing results on the basis of Akaike Information Criterion (AIC). We conclude through a practical illustration that complementary log-log link function results into better goodness of fit for saturated models, partially reduced models and the reduced models with zero correlations. Simulation studies are carried out under two sets to evaluate the performance of trivariate generalized non linear mixed model with complementary log log transformation. Under considered scenarios, the estimated median disease prevalence, sensitivity, specificity, PPV and NPV are unbiased under the complementary log-log link function. The results depict a better fit than the cases where logit and probit link functions are used.
Footnotes
Acknowledgments
The first author is grateful to Department of Science and Technology (DST), Goverment of India for providing financial assistance under INSPIRE for carrying out this work. All the authors also acknowledge the support provided by DST under PURSE grants.
