Reliability and Validity of the Triple D Score to Predict Stone-Free Status After Extracorporeal Shockwave Lithotripsy: Methodological Mistake

Abstract

Dear Editor:

I was interested in reading the paper by Ozgor and colleagues that was published in the January 2017 edition of the Journal of Endourology. ¹ The purpose of the authors was to evaluate the accuracy of the Triple D scoring system in predicting shockwave lithotripsy (SWL) success rates.¹ They reported that stone-free status was achieved in 115 out of 200 patients (57.5%), and 85 patients had one or more residual fragments (42.5%). Differences in stone characteristics, including stone location, density, and volume, were statistically significant in patients, regardless of whether SWL achieved stone-free status or not (p < 0.001, p < 0.001, and p < 0.001, respectively).¹

These results have nothing to do with external validation, reliability, and validity.^2

–6 Reproducibility (precision) and validity (accuracy) as two completely different methodological issues should be assessed by using appropriate tests. Regarding reliability, for quantitative variables, intraclass correlation coefficient should be used; for qualitative variables, weighted kappa should be used. However, to assess validity, for quantitative variables, interclass correlation coefficient (Pearson r) and for qualitative variables, sensitivity, specificity, positive predictive value, negative predictive value, likelihood ratio positive and likelihood ratio negative, as well as diagnostic accuracy and odds ratio are among the most appropriate tests. Moreover, for prediction, we need at least two different cohort datasets or at least one cohort dataset, splitting them for development and validation of our model.^5,6

Their multivariate analyses revealed that Triple D score and stone location were identified as independent factors affecting SWL success (p < 0.001 and p = 0.008, respectively). The mean number of SWL sessions was significantly higher in patients with SWL failure (p = 0.003), concluding the external validity of the Triple D scoring system and its association with SWL success in the treatment of renal and ureteral stones. Such a conclusion is just a misleading message, because none of the earlier methodological issues regarding reliability, validity, and prediction have been taken into account.

References

Ozgor

, Tosun

, Kayali

, Savun

, Binbay

, Tepeler

. External validation and evaluation of reliability and validity of the Triple D score to predict stone-free status after extracorporeal shockwave lithotripsy. J Endourol, 2017. Doi: 10.1089/end.2016.0721.

Sabour

. Reliability of a new modified tear breakup time method: Methodological and statistical issues. Graefes Arch Clin Exp Ophthalmol, 2016; 254:595–596.

Sabour

, Ghassemi

. Reliability of the International Spinal Cord Injury Musculoskeletal Basic Data Set; methodological and statistical issue to avoid misinterpretation. Spinal Cord Ser Cases, 2016; 2:16023.

Sabour

, Ghassemi

. Accuracy and reproducibility of the ETDRS visual acuity chart: Methodological issues. Graefes Arch Clin Exp Ophthalmol, 2016; 254:2073–2074.

Sabour

. Prediction of 3-dimensional pharyngeal airway changes after orthognathic surgery: A methodological issue. Am J Orthod Dentofacial Orthop, 2015; 147:8.

Sabour

. Prediction of preterm delivery using levels of VEGF and leptin in amniotic fluid from the second trimester: Prediction rules. Arch Gynecol Obstet, 2015; 291:719.