Comparing Expert Reported Outcomes to National Surgical Quality Improvement Program Risk Calculator-Predicted Outcomes: Do Reporting Standards Differ?

Abstract

Introduction:

Expert-reported outcomes and complications may not reflect the standardized coding that can be provided by independent, third-party evaluations. The goal of this article is to compare expert-reported complications with standardized coding by the National Surgical Quality Improvement Program (NSQIP). The procedures evaluated were laparoscopic radical nephrectomy (LRN), robot-assisted radical prostatectomy (RARP), and radical cystectomy (RC).

Methods:

The 10 largest LRN, RARP, and RC series were reviewed for reported complications. An index patient was derived from each series using patient demographic data. Index patients were entered into the NSQIP surgical risk calculator (SRC), which provides 11 predicted outcomes based on inputted data. SRC-predicted outcomes were compared with available complication rates in each series.

Results:

Across the 30 studies, 172 out of 330 (52%) of NSQIP-provided outcome types were presented within expert manuscripts. Death and venous thromboembolism (VTE) were the most commonly reported (27 and 23 studies, respectively), whereas urinary tract infection (UTI) (9) and pneumonia (10) were the least commonly presented. Comorbidities and follow-up duration were reported in 8 out of 30 and 17 out of 30 studies, respectively. For LRN, the median number of reported outcomes was 3 (range 1–5). LRN experts demonstrated a shorter mean length of stay (LOS) (2.5 days, SD=1.7) (p<0.001). In RARP studies, a median of 7.5 (3–11) outcomes was reported. Experts outperformed NSQIP RARP predictions in serious complications (p<0.001), any complication (p<0.001), surgical site infection (p=0.025), UTI (p<0.001), and VTE (p=0.002). RC manuscripts reported a median of 7 (2–11) outcomes. RC experts had higher rates of serious complications (p<0.001), reoperation (p<0.001), and death (p<0.001) than predicted by SRC.

Conclusion:

The level of standardization in reporting of outcomes differs between expert series and NSQIP, thus making comparisons difficult.

Introduction

High-quality and patient-centered care is paramount in urologic surgery. The review of outcomes and complications after surgery facilitates improvements. The Joint Commission¹ and Centers for Medicare and Medicaid Services' Physician Quality Reporting System² have put forth standards for reporting metrics of quality care. In addition, voluntary programs such as the National Surgical Quality Improvement Program (NSQIP) are currently utilized by nearly 570 hospitals. Despite this, no standardized system of defining and reporting surgical complications is employed in the urologic literature. Therefore, making comparisons between surgical approaches, institutions, and individual surgeons is difficult.

The American College of Surgeons' (ACS) NSQIP is a national clinical registry specifically developed to help surgeons and hospitals improve surgical quality. Standardized coding is performed by trained individuals, is audited for accuracy, and is consistent across hospitals. Perioperative data on preoperative risk factors, intraoperative variables, and 30-day postoperative mortality and morbidity are prospectively collected.^3,4 Studies have shown that NSQIP more reliably captures surgical complications and mortality when compared with systems that use administrative and claims data.^5

–8 Recently, the ACS has used the NSQIP database to develop the NSQIP surgical risk calculator (SRC), a web-based tool that allows users to select a current procedural terminology (CPT) code for a particular surgery and to enter 21 preoperative patient factors (e.g., demographics, comorbidities). Using regression models, the SRC analyzes the patient factors and predicts 30-day postoperative outcomes.⁴

Interest in the NSQIP database by urologic oncology has grown rapidly, with studies querying the database for surgeries such as nephrectomy,^9,10 radical prostatectomy,^11,12 and radical cystectomy (RC).^13
–15 In this study, we sought to compare the reported postoperative complication in the urologic oncology expert literature with data from the NSQIP database by using the SRC to generate predicted 30-day complications.

Methods

Three major urologic procedures were selected to compare to the NSQIP database: laparoscopic radical nephrectomy (LRN), robotic-assisted radical prostatectomy (RARP), and RC. From 1990 to 2014, we conducted a search of all published English literature using PubMed. Search terms for each procedure were combined with the keyword “complications.” A total of 214, 134, and 982 papers were initially returned for LRN, RARP, and RC, respectively. Articles were compiled into a database, and abstracts were then screened for relevance. Subsequently, the bibliographies of relevant studies, reviews, and international guidelines were hand-searched. Bibliographies of the retrieved literature were cross-searched manually for additional publications. All available publications describing peri-operative complications for LRN, RARP, or RC patients were evaluated. Randomized, prospective observational, and retrospective observational studies were included. Studies were excluded if they reported only subsets of larger cohorts (e.g., studies only reporting LRN outcomes for T2 or larger renal tumors). For RC, studies were selected regardless of type of urinary diversion. Initially, 29 LRN, 17 RARP, and 20 RC studies met the inclusion criteria. From these, the 10 largest papers for each of the three surgeries were selected. If an institution's patient cohort was described in multiple historical series, the largest (and usually most contemporary) publication was selected.

To compare expert-reported outcomes with NSQIP SRC-predicted outcomes, an index patient was created from each expert study. Index patients were defined by the average demographics reported in each study (e.g., age, body mass index [BMI], American Society of Anesthesiologists [ASA] score). For studies not reporting demographics, an index patient was created based on the averaged cumulative patient factors of the remaining studies within that surgery group. Each study's index patient preoperative factors were entered into the online SRC along with the appropriate CPT code, and predicted 30-day postoperative complication rates were generated for each study. For RC studies that included different types of urinary diversion, both orthotopic neobladder and ileal conduit index patients were generated separately. For each of the three surgeries, overall predicted outcomes were also obtained using a cumulative index patient calculated from averaged patient factors from all 10 studies; studies not specifically reporting complications till 30 days were not used to calculate these cumulative index patients.

NSQIP SRC-predicted outcomes are reported in 11 categories as follows:¹⁶ pneumonia; cardiac (cardiac arrest or myocardial infarction); surgical site infection (SSI) (superficial incisional, deep incisional, or organ space); renal failure; urinary tract infection (UTI); venous thromboembolism (VTE); and return to operating room. In addition, “any complications” include all of what has been mentioned earlier, plus wound disruption, unplanned intubation, ventilator use >48 hours, stroke, or systemic sepsis. “Serious complications” are similar to “any complications,” except that superficial incisional SSI, ventilator >48 hours, and stroke are not considered serious. Death is the 10th outcome, and according to SRC categorization,¹⁶ is included under “serious complications” but not “any complications.” The 11th outcome is the predicted hospital length of stay (LOS).

Reported complications from each of the 30 studies were extracted and tabulated according to the SRC complication categories listed earlier. Only complications that were reported according to the NSQIP criteria were included for analysis. Each of the 30 studies was also assessed for risk of bias in postoperative outcomes reporting using the Cochrane Collaboration's tool for assessing risk of bias.¹⁷ Cumulative expert-reported and SRC-predicted outcomes were compared using one-sample test of proportions for rates and one-sample t-test for LOS. Statistical significance was assumed when p<0.05. Statistical analysis was performed using R version 3.1.3.

Results

The surgical outcomes in 30 studies, which included 2503 LRN,^{18

–27} 8924 RARP,^{28

–37} and 4687 RC^{38

–47} patients, were analyzed. One study³¹ was a prospective cohort trial comparing RARP with radical retropubic prostatectomy. The remaining 29 studies were retrospective in design, although 16 accrued their data prospectively. Risk of bias in outcome reporting is shown in Figure 1. Average age was 59.8, 60.2, and 66.1 years; average ASA score was 2.5, 2.2, and 2.4; and average BMI was 26.9, 27.3, and 26.9 kg/m² for LRN, RARP, and RC, respectively. Fifteen of 30 studies did not report ASA scores, while 13 of 30 studies did not include BMI. The average ASA score was 2.4 in the RC studies; therefore, SRC-predicted outcomes were calculated for both ASA 2 and 3. Comorbidity data (e.g., CCI, diabetes, cardiovascular disease) and duration of follow-up were reported in 8 out of 30 and 17 out of 30 studies, respectively.

FIG. 1.

Risk of bias in urological studies reporting complications as evaluated by Cochrane Collaboration's tool. *Attrition is assumed to be zero, as authors assess only complications occurring before discharge. ^†Primarily assesses those complications leading to death. ^‡Only complications leading to hospital readmissions are captured.

Tables 1 –3 outline the complication rates from each study in comparison with SRC-predicted outcomes for index patients. Fifty-two percent (172 of 330) of the 11 NSQIP SRC outcome categories were captured by the 30 expert studies. UTI (9 of 30 studies) and pneumonia (10 of 30 studies) were the least frequently reported complications, while death (27 of 30 studies) and VTE (23 of 30 studies) were the most regularly reported.

Table 1.

Laparoscopic Radical Nephrectomy—Comparison of Expert-Reported and NSQIP-Predicted Outcomes

Study	N	Serious complication (%)	Any complication (%)	Pneumonia (%)	Cardiac (%)	SSI (%)	UTI (%)	VTE (%)	ARF (%)	ROR (%)	Death (%)	LOS (days), mean (SD)
Jeong et al.^18,a	631	NR	NR	NR	NR	NR	NR	NR	0.8	NR	NR	NR
NSQIP^b		4.8	7.7	0.8	0.4	1.7	0.9	0.4	0.6	0.4	0.4	3
Permpongkosol et al.^19,c	549	NR	NR	NR	NR	NR	NR	NR	NR	NR	0.2	NR^d
NSQIP^e		4.8	5.8	0.3	0.2	1.5	0.7	0.3	0.3	1.9	0.1	2
Gabr et al.²⁰	255	NR	NR	NR	NR	NR	NR	NR	NR	NR	0.4	2.6 (1.7)
NSQIP^f		5	6.3	0.3	0.1	1.9	0.7	0.3	0.4	1.9	0.1	2
Steinberg et al.^21,a	231	NR	NR	NR	0	NR	NR	0.4	0.9	NR	0.9	1.5^d
NSQIP^g		4.8	7.9	0.7	0.4	1.9	0.9	0.4	0.7	2.4	0.3	3
Wu et al.^22,a	188	NR	NR	NR	NR^h	0.5	NR	2.7	NR	NR	0.5	2.5 (1.6)
NSQIP^g		4.8	7.9	0.7	0.4	1.9	0.9	0.4	0.7	2.4	0.3	3
Cadeddu et al.^23,a	157	NR	NR	NR	NR	0.6	1.3	1.3ⁱ	NR	1.9	1.3	NR
NSQIP^g		4.8	7.9	0.7	0.4	1.9	0.9	0.4	0.7	2.4	0.3	3
Gong et al.^24,a	141	NR	NR	NR	NR	NR	NR	0.7ⁱ	2.1	0.7	0.7	2.1 (1.7)
NSQIP^g		4.8	7.9	0.7	0.4	1.9	0.9	0.4	0.7	2.4	0.3	3
Wille et al.^25,a	125	NR	NR	NR	NR	NR	NR	NR	NR	1.6	NR	6^j
NSQIP^g		4.8	7.9	0.7	0.4	1.9	0.9	0.4	0.7	2.4	0.3	3
Matin et al.²⁶	113	NR	NR	NR	0	NR	NR	NR	0	NR	0	2
NSQIP^g		4.8	7.9	0.7	0.4	1.9	0.9	0.4	0.7	2.4	0.3	3
Simon et al.^27,a	113	NR	NR	NR	NR	NR	NR	NR	NR	0.9	0	NR
NSQIP^g		4.8	7.9	0.7	0.4	1.9	0.9	0.4	0.7	2.4	0.3	3
Cumulative experts^k	2503	NR	NR	NR	0	NR	NR	NR	0.0	NR	0.2	2.5 (1.7)
NSQIP^g		4.8	7.9	0.7	0.4	1.9	0.9	0.4	0.7	2.4	0.3	3
p-Value		—	—	—	0.50	—	—	—	0.37	—	0.65	<0.001

Highlighted cells contain outcomes rates for which a comparison was possible.

Bold text indicates a p-value<0.05.

Duration of follow-up not indicated.

Index patient: male, age <65, BMI normal, ASA 3, clean-contaminated.

Reports on all laparoscopic procedures performed from 1993 to 2005. Only reports gross number of complications for LRN by Clavien classification.

Median LOS.

Index patient: male, age <65, BMI overweight, ASA 2, clean-contaminated.

Index patient: male, age <65, BMI class 1 obese, ASA 2, clean-contaminated.

Index patient: male, age <65, BMI overweight, ASA 3, clean-contaminated.

Does not distinguish myocardial infarctions from nonspecific arrhythmias.

Reports pulmonary embolism but omits deep vein thrombosis.

Does not state if 6 days is mean or median LOS.

Only uses data from studies reporting 30-day postoperative complications.

SSI=surgical site infection; UTI=urinary tract infection; VTE=venous thromboembolism; ARF=acute renal failure; ROR=return to operating room; LOS=length of stay; SD=standard deviation; NR=not reported/not assessed; NSQIP=National Surgical Quality Improvement Program; BMI=body mass index; ASA=American Society of Anesthesiologists; LRN=laparoscopic radical nephrectomy.

Table 2.

Robotic-Assisted Radical Prostatectomy—Comparison of Expert-Reported and NSQIP-Predicted Outcomes

Study	n	Serious complication (%)	Any complication (%)	Pneumonia (%)	Cardiac (%)	SSI (%)	UTI (%)	VTE (%)	ARF (%)	ROR (%)	Death (%)	LOS (days), mean (SD)
Agarwal et al.²⁸	3317	2.2	2.3	0.06	0.03	0.09	0.1	0.2	0.03	1.4	0.03	1.2 (1.5)
NSQIP^a		3.5	3.9	0.1	0.1	0.5	0.8	0.6	0.1	0.8	0	1
Coelho et al.²⁹	2500	1.4	1.9	NR	0.2	0.6	0.2	0.3	NR	0.5	0	1.25
NSQIP^a		3.5	3.9	0.1	0.1	0.5	0.8	0.6	0.1	0.8	0	1
Pierorazio et al.^30,b	1422	1.1	1.0	0.1	0.07	NR	NR	0	0.1	0.6	0.07	1.96
NSQIP^a		3.5	3.9	0.1	0.1	0.5	0.8	0.6	0.1	0.8	0	1
Nelson et al.^31,c,d	629	NR	NR	NR	NR	0.3	1.3	0.6	NR	NR	NR	1.17
NSQIP^a		3.5	3.9	0.1	0.1	0.5	0.8	0.6	0.1	0.8	0	1
Novara et al.^32,b,e	415	3.1	3.4	NR	0.2	0.2	NR	0.2	0.2	2.7	0	6^f
NSQIP^a		3.5	3.9	0.1	0.1	0.5	0.8	0.6	0.1	0.8	0	1
Murphy et al.^33,d	400	NR	NR	NR	NR	NR	NR	NR	NR	4.5	0	3.1 (1.4)
NSQIP^a		3.5	3.9	0.1	0.1	0.5	0.8	0.6	0.1	0.8	0	1
Joseph et al.^34,d	325	NR	NR	NR	0.9	NR	NR	1.5	NR	0.3	0	NR
NSQIP^a		3.5	3.9	0.1	0.1	0.5	0.8	0.6	0.1	0.8	0	1
Hu et al.^35,d	322	1.6	3.4	0	0	1.9	NR	0.6	NR	0.9	0	NR
NSQIP^a		3.5	3.9	0.1	0.1	0.5	0.8	0.6	0.1	0.8	0	1
Zorn et al.^36,d	300	NR	NR	NR	0.7	2.0	NR	0.7	NR	0.3	0	1.4
NSQIP^a		3.5	3.9	0.1	0.1	0.5	0.8	0.6	0.1	0.8	0	1
Krambeck et al.³⁷	294	3.4	3.4	NR	0	NR	1.0	0.7	0	NR	0	NR
NSQIP^a		3.5	3.9	0.1	0.1	0.5	0.8	0.6	0.1	0.8	0	1
Cumulative experts^g	8924	1.9	2.2	0.1	0.1	0.3	0.2	0.3	0.03	1.0	0.02	1.4 (1.6)
NSQIP^a		3.5	3.9	0.1	0.1	0.5	0.8	0.6	0.1	0.8	0	1
p-Value		<0.001	<0.001	1	1	0.025	<0.001	0.002	0.17	0.09	<0.001 ^h	<0.001

Highlighted cells contain outcomes rates for which a comparison was possible.

Bold text indicates a p-value<0.05.

Index patient: male, age <65, BMI overweight, ASA 2, clean-contaminated.

Reports only complications occurring before initial discharge.

Complications listed based on unscheduled visits and hospital readmissions.

Duration of follow-up not indicated.

Reports complications till postoperative 90 days.

Median LOS.

Only includes data from studies reporting 30-day postoperative complications.

NSQIP predicted death rate set at 0.0001% for the null hypothesis value in test-for-one-proportion.

Table 3.

Radical Cystectomy—Comparison of Expert-Reported and NSQIP-Predicted Outcomes

Study	n	Neobladder (%)	Serious complication (%)	Any complication (%)	Pneumonia (%)	Cardiac (%)	SSI (%)	UTI (%)	VTE (%)	ARF (%)	ROR (%)	Death (%)	LOS (days), mean (SD)
Quek et al.^38,a	1359	NR	NR	NR	NR	NR	NR	NR	1.8^b	NR	NR	2.0	NR
NSQIP^c Ileal conduit			17.3	26.9	3.5	1.4	11.7	5.2	5.6	2.3	5.3	1.3	9.0
Shabsigh et al.^39,d	1142	36.6	47.1	55.4	3.9	1.3	13.6	9.9	8.4	2.8	3.3	2.7	9^e
NSQIP^c Ileal conduit			17.3	26.9	3.5	1.4	11.7	5.2	5.6	2.3	5.3	1.3	9.0
Neobladder			20.1	31.1	2.5	1.8	13.6	8.4	4.4	2.2	4	1.2	10.0
Hautmann et al.^40,d	1013	100	21.5	20.4	2.1	0.3	3.6	NR^f	2.8	1.8	NR	2.3	NR
NSQIP^g Neobladder			18.3	29.1	2	1.1	14.3	6.5	3.5	1.8	4.1	0.7	9.5
Stimson et al.^41,h	753	33.2	NR	NR	NR	NR	2.0	1.7	1.5	0.9	NR	2.1	6.0^e
NSQIP^c Ileal conduit			17.3	26.9	3.5	1.4	11.7	5.2	5.6	2.3	5.3	1.3	9.0
Neobladder			20.1	31.1	2.5	1.8	13.6	8.4	4.4	2.2	4.0	1.2	10.0
Frazier et al.⁴²	675	7.4	22.1	25.0	1.8	1.9	10.2	NR	2.2	0.3	NR	2.5	19.0^e
NSQIP^g Ileal conduit			15.7	25.1	2.7	0.9	12.3	4	4.5	1.8	5.4	0.8	8.0
Roghmann et al.^43,d	535	34.8	39.3	36.3	5.0	2.8	7.7	13.5	5.0	0.2	NR	3.9	19^e
NSQIP^c Ileal conduit			17.3	26.9	3.5	1.4	11.7	5.2	5.6	2.3	5.3	1.3	9.0
Neobladder			20.1	31.1	2.5	1.8	13.6	8.4	4.4	2.2	4	1.2	10.0
Novotny et al.⁴⁴	516	36.8	NR	NR	NR	1.4	NR	NR	6.4	NR	6.2	0.8	21.2 (6.8)
NSQIP^c Ileal conduit			17.3	26.9	3.5	1.4	11.7	5.2	5.6	2.3	5.3	1.3	9.0
Neobladder			20.1	31.1	2.5	1.8	13.6	8.4	4.4	2.2	4	1.2	10.0
Lee et al.⁴⁵	498	40.2	NR	NR	1.2	NR	8.2	10.4	1.2^d	NR	NR	1.6	8.0^e
NSQIP^c Ileal conduit			17.3	26.9	3.5	1.4	11.7	5.2	5.6	2.3	5.3	1.3	9.0
Neobladder			20.1	31.1	2.5	1.8	13.6	8.4	4.4	2.2	4	1.2	10.0
Studer et al.⁴⁶	482	100	NR	NR	1.7ⁱ	1.2	NR	NR	5.2	NR	NR	1.7	NR
NSQIP^c Neobladder			20.1	31.1	2.5	1.8	13.6	8.4	4.4	2.2	4.0	1.2	10.0
Schiavina et al.⁴⁷	404	36.6	26.5	23.8	1.2	1.2	0.5^j	NR	2.2	3.7	7.4	3.2	15.0 (8.6)
NSQIP^c Ileal conduit			17.3	26.9	3.5	1.4	11.7	5.2	5.6	2.3	5.3	1.3	9.0
Neobladder			20.1	31.1	2.5	1.8	13.6	8.4	4.4	2.2	4	1.2	10.0
Cumulative experts^k	4687	39.7	23.7	24.6	1.5	1.5	5.5	5.2	3.3	1.3	6.7	2.0	18.5 (8.3)
NSQIP^c Ileal conduit			17.3	26.9	3.5	1.4	11.7	5.2	5.6	2.3	5.3	1.3	9.0
p-Value			<0.001	0.083	<0.001	0.72	<0.001	0.99	<0.001	0.005	0.051	<0.001	<0.001
Neobladder			20.1	31.1	2.5	1.8	13.6	8.4	4.4	2.2	4	1.2	10.0
p-Value			0.003	<0.001	0.004	0.29	<0.001	<0.001	0.004	0.009	<0.001	<0.001	<0.001

Highlighted cells contain outcomes rates for which a comparison was possible.

Bold text indicates a p-value<0.05.

Primarily analyzes perioperative mortality.

Reports pulmonary embolism but omits deep vein thrombosis.

Index patient: male, age 65–74, BMI overweight, ASA 3, contaminated.

Reports complications till 90 days postoperative.

Median LOS.

Reports 176 “UTI/pyelonephritis” but does not specify UTIs alone. These data were not added to “Any” or “Serious” complications category.

Index patient: male, age <65, BMI overweight, ASA 3, contaminated.

Complications were captured based on hospital readmissions.

Acute respiratory distress syndrome was combined with pneumonia.

Reports “abscesses” under infectious complications but does not list any other type of SSI.

Only includes data from studies reporting 30-day postoperative complications.

Statistical analysis comparing cumulative expert rates with predicted NSQIP outcomes are shown in the last rows of Tables 1 –3. For LRN, only mean LOS was found to be significantly different between that reported by experts and that predicted by SRC (2.5 days vs 3 days, p<0.001) (Table 1). Compared with the RARP SRC index patient, centers of excellence had significantly better rates for both overall and serious complications, including lower SSI, UTI, and VTE rates (Table 2). For RC, experts had statistically lower pneumonia, SSI, VTE, and acute renal failure (ARF) rates, as well as lower UTI and overall complication rates after adjusting for complexity of urinary diversion type (Table 3). However, rates of serious complications and death regardless of RC diversion type were cumulatively higher at centers of excellence when compared with NSQIP index patients. Experts had greater reoperation rates than the NSQIP neobladder index patient.

Discussion

A comparison of outcomes and complications allows for education and improvement in surgery. As a result, there is marked interest in quantifying associated risks. In addition, projecting risk to the prospective surgical patient facilitates informed consent. While reports from expert urologic oncologists have been enlightening, the rigorous assessment of outcomes data is hindered by lack of standardized reporting. There was a lack of standardization among the 30 expert studies included in this analysis, making a comparison with NSQIP-predicted outcomes challenging.

When compared with NSQIP outcomes categories, some complication data were not directly reported in the expert literature. In total, 52% (172/330) of the 11 SRC outcomes were found in the manuscripts assessed. This difference was the most pronounced for LRN, wherein 28% (31/110) of outcomes were reported. Even for VTE (one of the most consistently reported complications [23/30]), four studies^23,24,38,45 listed pulmonary embolisms but did not record the number of deep vein thrombosis. Thus, the expert VTE rates used in this analysis may be underreported.

There were also differences in terminology employed in the expert literature, which limited robust comparisons. For example, some studies' complications were listed as “cardiac,”^20,41,45 “pulmonary,”^18,20,21,41 or “wound”^20,21 without further explanation. In the present analysis, these complications were not counted as myocardial infarction/cardiac arrest, pneumonia, and SSI or wound disruption, respectively. The inexplicit label “wound infection” was counted as an SSI but was not included as a deep incisional or organ space SSI in the “serious complication” category. “Respiratory distress” was not specific enough to qualify for “unplanned intubation,” whereas “acute respiratory distress syndrome” qualified as such. Finally, for studies that listed complications according to Clavien classification, solely reporting a complication as Grade 3 without further detail^40,43 was not considered specific enough to qualify as a return to the operating room.

On comparing LRN series with SRC-predicted outcomes, LOS was the only significant difference, favoring the expert literature (2.5 days vs 3 days, p<0.001). In the largest of the LRN series, the authors acknowledge lack of sufficient perioperative complication data from their multicenter cohort, stating that “complications [data] were not available from some institutions owing to a lack of a common protocol for collecting data at each institution.”¹⁸ A movement toward standardized reporting may clarify any potential differences in complications after LRN.

For RARP, expert series had significantly better cumulative complication rates for a majority of outcomes categories, including SSI, VTE, and serious complications. While these lower rates likely reflect the inverse relationship between hospital volume and perioperative morbidity and mortality,^48
–50 only half of the studies mention that effort was taken to obtain outpatient follow-up information, which may introduce attrition bias (Fig. 1). Therefore, complications that do not occur immediately postoperatively (e.g., SSI, VTE) may be under-captured. Pierorazio et al. reported on their series of 1422 RARP patients and presented immediate perioperative morbidity and LOS.³⁰ The lack of 30-day complication data limits the assessment of perioperative events to subsequent morbidity. Two patients from the entire expert RARP cohort died, one from a suspected infection 23 days postoperatively²⁸ and one from an aspiration event on postoperative day 5 (Table 2).³⁰ The cumulative expert mortality rate of 0.02% was statistically greater than the SRC-predicted rate of 0%. However, it is likely that these deaths were anomalies and the statistical difference is narrow. In addition, data from the entire NSQIP database indicate that national mortality rates for RARP are around 0.05%,¹² which is not significantly different from expert rates (p=0.3).

In RC, the experts demonstrated higher rates of serious complications and death than were predicted by the SRC (Table 3). This was an unanticipated finding given that population-based studies of RC have consistently shown that high-volume institutions have lower morbidity and mortality than lower-volume institutions.^51
–53 The most likely explanation for our findings is that the RC index patient used for SRC-predicted outcomes did not accurately represent the true comorbidity and disease-severity status of patients undergoing RC at centers of excellence. We attempted to adjust for this by rounding the cumulative expert-reported ASA score of 2.4 up to 3. However, lack of standardized reporting of comorbidities (e.g., cardiac disease or diabetes) by most of the RC studies prevented us from further risk-adjusting the index patients. A recent analysis of the NSQIP database for perioperative outcomes after RC showed that baseline comorbidity status was indeed associated with increased odds of complications, including a 2.4 times increased risk of death among patients with cardiovascular disease.¹⁴ We were not able to fully incorporate these risk factors into the index patients used in this study. Further, the SRC does not currently allow for the input of important preoperative risk factors, such as weight loss or serum albumin levels, which could clearly impact the surgical outcomes of the more comorbid RC patients seeking care at tertiary centers. The “surgeon adjustment of risks” option also was not employed in the SRC model for purposes of consistency.

NSQIP represents one possible vehicle for prospective collection of data of the perioperative outcomes and complications of a wide variety of surgeries. The NSQIP database is reproducible, standardized, and validated. The database is populated by persons specially trained and audited, who meticulously gather preoperative through 30-day postoperative data on surgical patients using strict comorbidity and adverse event definitions. Irrespective of discharge status, patients are followed for 30 days after surgery either by manual review of medical records or through personal communication with patients and outside physicians.⁵⁴ Furthermore, NSQIP may actually be improving care in the hospitals that currently employ it. Hall et al. examined trends over time in surgical mortality and morbidity from 118 hospitals participating in NSQIP and found 66% improved risk-adjusted mortality and 82% improved risk-adjusted complication rates. The authors estimate that, on average, participation in NSQIP may have resulted in each institution avoiding more than 200 complications and 12 to 36 deaths.⁵⁵ The NSQIP database is not without limitations. Currently, it lacks consideration of oncologic severity (i.e., grade and stage), as well as risk -stratification for preoperative factors that are common in cancer patients (e.g., weight loss, serum albumin levels). NSQIP also does not currently subcategorize complications by severity grade, such as by the Clavien-Dindo Scoring system. The database is also deficient in surgery-specific outcomes (e.g., impotence and incontinence for RARP), and therefore it is subject to much of the same reporting biases noted in several expert series (Fig. 1). In some surgeries, complications after 30 days are relatively common; these complications would not be illustrated by NSQIP. Further, the SRC is based on NSQIP data from 2009 to 2012; it does not yet incorporate newly accrued surgical data into its models. Finally, the SRC does not allow for surgeon- and institution-specific factors, such as surgical volume and provider experience. A recent study using the SRC in laparoscopic colectomy patients suggests that the calculator accurately predicts outcomes for average surgical risk patients but may not accurately predict outcomes for serious complications.⁵⁶ This highlights the need for further external validation of the SRC and improved risk-stratification models.

Our study is not without limitations. The creation of index patients was limited by a finite quantity of available risk factors reported in the expert series. Therefore, the index patients may not accurately represent the population of patients seen at tertiary care centers. In addition, we did not assume that studies that made no mention of certain adverse events had a complication rate of zero for that particular outcome. For example, if a study did not specify that data on renal failure were collected and postoperative kidney function was not listed, that study was recorded as having not assessed ARF. Thus, series relying on an assumption that reviewers would interpret no mention of a complication as absence of that complication were under-captured by this study. The Cochrane Collaboration's tool for assessing risk of bias was originally developed to evaluate the methodological quality of randomized control trials. Its use herein to evaluate institutions' surgical case series should not be viewed as a critique of these series' lack of randomization or blinding. Rather, its use is intended to draw attention to biases commonly found in observational studies. Finally, several of the expert series used in this study predate SRC data by several years. Improvements in surgical technique and surgeon experience make a comparison with more modern SRC outcomes difficult.

It is not necessarily the contention of the findings here that there are significant differences in outcomes between the experiences of experts in the literature and those surgeons participating in NSQIP. Rather, it is noted that clear differences exist in the way that outcomes and complications are reported and that standardizing this process may afford transparent comparisons.

Conclusion

Differences in the style and components of complication reporting within the expert urologic oncology literature exist, making a comparison with predicted outcomes from the highly standardized NSQIP database difficult. The need for standardization in the accrual and reporting of surgical complications may become more critical as healthcare moves toward outcome metrics as measures for quality of care.

Footnotes

Author Disclosure Statement

Sam B. Bhayani is a consultant for Intuitive Surgical, Inc. For the remaining authors, no competing financial interests exist.

Abbreviations Used

References

David

, Lavengood

Jr.

Bilateral Wilms' tumor. Treatment, management, and review of the literature. Urology, 1974; 3:71–78.

Roupret

, Wallerand

, Traxer

, et al. Checkup and management of upper urinary tract tumours in 2010: An update from the committee of cancer from the French National Association of Urology. Prog Urol, 2010; 20:260–271.

Yoshida

, Nishimura

, Harada

, et al. Primary adenocarcinoma of renal pelvis and ureter suspected as metastatic tumor: A case report. Hinyokika Kiyo, 2007; 53:247–250.

Bilimoria

, Liu

, Paruch

, et al. Development and evaluation of the universal ACS NSQIP surgical risk calculator: A decision aid and informed consent tool for patients and surgeons. J Am Coll Surg, 2013; 217:833–842.e1–e3.

Davenport

, Holsapple

, Conigliaro

. Assessing surgical quality using administrative and clinical data sets: A direct comparison of the University HealthSystem Consortium Clinical Database and the National Surgical Quality Improvement Program data set. Am J Med Qual, 2009; 24:395–402.

Cima

, Lackore

, Nehring

, et al. How best to measure surgical quality? Comparison of the Agency for Healthcare Research and Quality Patient Safety Indicators (AHRQ-PSI) and the American College of Surgeons National Surgical Quality Improvement Program (ACS-NSQIP) postoperative adverse events at a single institution. Surgery, 2011; 150:943–949.

Koch

, Li

, Hixson

, Tang

, Phillips

, Henderson

. What are the real rates of postoperative complications: Elucidating inconsistencies between administrative and clinical data sources. J Am Coll Surg, 2012; 214:798–805.

Steinberg

, Popa

, Michalek

, Bethel

, Ellison

. Comparison of risk adjustment methodologies in surgical quality improvement. Surgery, 2008; 144:662–667; discussion 662–667.

Liu

, Leppert

, Maxwell

, Panousis

, Chung

. Trends and perioperative outcomes for laparoscopic and robotic nephrectomy using the National Surgical Quality Improvement Program (NSQIP) database. Urol Oncol, 2014; 32:473–479.

10.

Corman

, Penson

, Hur

, et al. Comparison of complications after radical and partial nephrectomy: Results from the National Veterans Administration Surgical Quality Improvement Program. BJU Int, 2000; 86:782–789.

11.

Korets

, Weinberg

, Alberts

, Woldu

, Mann

, Badani

. Utilization and timing of blood transfusions following open and robot-assisted radical prostatectomy. J Endourol, 2014; 28:1418–1423.

12.

Liu

, Maxwell

, Panousis

, Chung

. Perioperative outcomes for laparoscopic and robotic compared with open prostatectomy using the National Surgical Quality Improvement Program (NSQIP) database. Urology, 2013; 82:579–583.

13.

Hollenbeck

, Miller

, Taub

, et al. Identifying risk factors for potentially avoidable complications following radical cystectomy. J Urol, 2005; 174:1231–1237; discussion 1237.

14.

Gandaglia

, Varda

, Sood

, et al. Short-term perioperative outcomes of patients treated with radical cystectomy for bladder cancer included in the National Surgical Quality Improvement Program (NSQIP) database. Can Urol Assoc J, 2014; 8:E681–E687.

15.

Monn

, Kaimakliotis

, Cary

, et al. Short-term morbidity and mortality of Indiana pouch, ileal conduit, and neobladder urinary diversion following radical cystectomy. Urol Oncol, 2014; 32:1151–1157.

16.

ACS NSQIP. About the ACS Risk Calculator. Available at http://www.riskcalculator.facs.org/Home/About/ (Accessed January 5, 2015 ).

17.

Higgins

, Altman

, Gotzsche

, et al. The Cochrane Collaboration's tool for assessing risk of bias in randomised trials. BMJ, 2011; 343:d5928.

18.

Jeong

, Rha

, Kim

, et al. Comparison of laparoscopic radical nephrectomy and open radical nephrectomy for pathologic stage T1 and T2 renal cell carcinoma with clear cell histologic features: A multi-institutional study. Urology, 2011; 77:819–824.

19.

Permpongkosol

, Link

, Su

, et al. Complications of 2,775 urological laparoscopic procedures: 1993 to 2005. J Urol, 2007; 177:580–585.

20.

Gabr

, Gdor

, Strope

, Roberts

, Wolf

Jr.

Patient and pathologic correlates with perioperative and long-term outcomes of laparoscopic radical nephrectomy. Urology, 2009; 74:635–640.

21.

Steinberg

, Finelli

, Desai

, et al. Laparoscopic radical nephrectomy for large (greater than 7 cm, T2) renal tumors. J Urol, 2004; 172:2172–2176.

22.

, Lesani

, Zhao

, et al. A multi-institutional study on the safety and efficacy of specimen morcellation after laparoscopic radical nephrectomy for clinical stage T1 or T2 renal cell carcinoma. J Endourol, 2009; 23:1513–1518.

23.

Cadeddu

, Ono

, Clayman

, et al. Laparoscopic nephrectomy for renal cell cancer: Evaluation of efficacy and safety: A multicenter experience. Urology, 1998; 52:773–777.

24.

Gong

, Lyon

, Orvieto

, Lucioni

, Gerber

, Shalhav

. Laparoscopic radical nephrectomy: Comparison of clinical Stage T1 and T2 renal tumors. Urology, 2006; 68:1183–1187.

25.

Wille

, Roigas

, Deger

, Tullmann

, Turk

, Loening

. Laparoscopic radical nephrectomy: Techniques, results and oncological outcome in 125 consecutive cases. Eur Urol, 2004; 45:483–488; discussion 488–489.

26.

Matin

, Dhanani

, Acosta

, Wood

. Conventional and hand-assisted laparoscopic radical nephrectomy: Comparative analysis of 271 cases. J Endourol, 2006; 20:891–894.

27.

Simon

, Castle

, Ferrigni

, et al. Complications of laparoscopic nephrectomy: The Mayo clinic experience. J Urol, 2004; 171:1447–1450.

28.

Agarwal

, Sammon

, Bhandari

, et al. Safety profile of robot-assisted radical prostatectomy: A standardized report of complications in 3317 patients. Eur Urol, 2011; 59:684–698.

29.

Coelho

, Palmer

, Rocco

, et al. Early complication rates in a single-surgeon series of 2500 robotic-assisted radical prostatectomies: Report applying a standardized grading system. Eur Urol, 2010; 57:945–952.

30.

Pierorazio

, Mullins

, Ross

, et al. Trends in immediate perioperative morbidity and delay in discharge after open and minimally invasive radical prostatectomy (RP): A 20-year institutional experience. BJU Int, 2013; 112:45–53.

31.

Nelson

, Kaufman

, Broughton

, et al. Comparison of length of hospital stay between radical retropubic prostatectomy and robotic assisted laparoscopic prostatectomy. J Urol, 2007; 177:929–931.

32.

Novara

, Ficarra

, D'Elia

, Secco

, Cavalleri

, Artibani

. Prospective evaluation with standardised criteria for postoperative complications after robotic-assisted laparoscopic radical prostatectomy. Eur Urol, 2010; 57:363–370.

33.

Murphy

, Kerger

, Crowe

, Peters

, Costello

. Operative details and oncological and functional outcome of robotic-assisted laparoscopic radical prostatectomy: 400 cases with a minimum of 12 months follow-up. Eur Urol, 2009; 55:1358–1366.

34.

Joseph

, Rosenbaum

, Madeb

, Erturk

, Patel

. Robotic extraperitoneal radical prostatectomy: An alternative approach. J Urol, 2006; 175:945–951.

35.

, Nelson

, Wilson

, et al. Perioperative complications of laparoscopic and robotic assisted laparoscopic radical prostatectomy. J Urol, 2006; 175:541–546.

36.

Zorn

, Gofrit

, Orvieto

, Mikhail

, Zagaja

, Shalhav

. Robotic-assisted laparoscopic prostatectomy: Functional and pathologic outcomes with interfascial nerve preservation. Eur Urol, 2007; 51:755–762; discussion 763.

37.

Krambeck

, DiMarco

, Rangel

, et al. Radical prostatectomy for prostatic adenocarcinoma: A matched comparison of open retropubic and robot-assisted techniques. BJU Int, 2009; 103:448–453.

38.

Quek

, Stein

, Daneshmand

, et al. A critical analysis of perioperative mortality from radical cystectomy. J Urol, 2006; 175:886–890.

39.

Shabsigh

, Korets

, Vora

, et al. Defining early morbidity of radical cystectomy for patients with bladder cancer using a standardized reporting methodology. Eur Urol, 2009; 55:164–174.

40.

Hautmann

, de Petriconi

, Volkmer

. Lessons learned from 1,000 neobladders: The 90-day complication rate. J Urol, 2010; 184:990–994; quiz 1235.

41.

Stimson

, Chang

, Barocas

, et al. Early and late perioperative outcomes following radical cystectomy: 90-day readmissions, morbidity and mortality in a contemporary series. J Urol, 2010; 184:1296–1300.

42.

Frazier

, Robertson

, Paulson

. Complications of radical cystectomy and urinary diversion: A retrospective review of 675 cases in 2 decades. J Urol, 1992; 148:1401–1405.

43.

Roghmann

, Trinh

, Braun

, et al. Standardized assessment of complications in a contemporary series of European patients undergoing radical cystectomy. Int J Urol, 2014; 21:143–149.

44.

Novotny

, Hakenberg

, Wiessner

, et al. Perioperative complications of radical cystectomy in a contemporary series. Eur Urol, 2007; 51:397–401; discussion 401–402.

45.

Lee

, Dunn

, Chen

, Joshi

, Sheffield

, Montie

. Impact of body mass index on radical cystectomy. J Urol, 2004; 172:1281–1285.

46.

Studer

, Burkhard

, Schumacher

, et al. Twenty years experience with an ileal orthotopic low pressure bladder substitute—Lessons to be learned. J Urol, 2006; 176:161–166.

47.

Schiavina

, Borghesi

, Guidi

, et al. Perioperative complications and mortality after radical cystectomy when using a standardized reporting methodology. Clin Genitourin Cancer, 2013; 11:189–197.

48.

Birkmeyer

, Siewers

, Finlayson

, et al. Hospital volume and surgical mortality in the United States. N Engl J Med, 2002; 346:1128–1137.

49.

Begg

, Cramer

, Hoskins

, Brennan

. Impact of hospital volume on operative mortality for major cancer surgery. JAMA, 1998; 280:1747–1751.

50.

Barocas

, Mitchell

, Chang

, Cookson

. Impact of surgeon and hospital volume on outcomes of radical prostatectomy. Urol Oncol, 2010; 28:243–250.

51.

Konety

, Dhawan

, Allareddy

, Joslyn

. Impact of hospital and surgeon volume on in-hospital mortality from radical cystectomy: Data from the health care utilization project. J Urol, 2005; 173:1695–1700.

52.

Leow

, Reese

, Trinh

, et al. The impact of surgeon volume on the morbidity and costs of radical cystectomy in the United States: A contemporary population-based analysis. BJU Int, 2015; 115:713–721.

53.

Zakaria

, Santos

, Dragomir

, Tanguay

, Kassouf

, Aprikian

. Postoperative mortality and complications after radical cystectomy for bladder cancer in Quebec: A population-based analysis during the years 2000–2009. Can Urol Assoc J, 2014; 8:259–267.

54.

Shiloach

, Frencher

Jr. , Steeger

, et al. Toward robust information: Data quality and inter-rater reliability in the American College of Surgeons National Surgical Quality Improvement Program. J Am Coll Surg, 2010; 210:6–16.

55.

Hall

, Hamilton

, Richards

, Bilimoria

, Cohen

, Ko

. Does surgical quality improve in the American College of Surgeons National Surgical Quality Improvement Program: An evaluation of all participating hospitals. Ann Surg, 2009; 250:363–376.

56.

Cologne

, Keller

, Liwanag

, Devaraj

, Senagore

. Use of the American College of Surgeons NSQIP Surgical Risk Calculator for Laparoscopic Colectomy: How good is it and how can we improve it?. J Am Coll Surg, 2015; 220:281–286.