Abstract

I
These results have nothing to do with external validation, reliability, and validity. 2 –6 Reproducibility (precision) and validity (accuracy) as two completely different methodological issues should be assessed by using appropriate tests. Regarding reliability, for quantitative variables, intraclass correlation coefficient should be used; for qualitative variables, weighted kappa should be used. However, to assess validity, for quantitative variables, interclass correlation coefficient (Pearson r) and for qualitative variables, sensitivity, specificity, positive predictive value, negative predictive value, likelihood ratio positive and likelihood ratio negative, as well as diagnostic accuracy and odds ratio are among the most appropriate tests. Moreover, for prediction, we need at least two different cohort datasets or at least one cohort dataset, splitting them for development and validation of our model. 5,6
Their multivariate analyses revealed that Triple D score and stone location were identified as independent factors affecting SWL success (p < 0.001 and p = 0.008, respectively). The mean number of SWL sessions was significantly higher in patients with SWL failure (p = 0.003), concluding the external validity of the Triple D scoring system and its association with SWL success in the treatment of renal and ureteral stones. Such a conclusion is just a misleading message, because none of the earlier methodological issues regarding reliability, validity, and prediction have been taken into account.
