Abstract
Objective:
Endourology continues to grow with the introduction of new technologies into clinical practice. Simulators and training models have been developed to improve comfort and proficiency in endoscopic procedures. The purpose of this systematic review was to examine the current educational interventions utilized to improve the performance of endourology trainees and to critically appraise the strengths and limitations of each.
Methods:
A search of the Ovid MEDLINE, EMBASE, PsycINFO, and the Cochrane Library databases was performed to identify literature focused on current educational interventions for improving technical skills of trainees in endourologic procedures. The Medical Education Research Study Quality Instrument (MERSQI) was used to evaluate the methodological quality of the abstracted articles.
Results:
Of the 2236 articles identified, 22 met the inclusion criteria. The types of educational interventions included: bench/wet lab models, virtual reality simulators, and instructional courses. Metrics used to quantify the impact of these interventions include global rating scales, Objective Structured Assessment of Technical Skills (OSATS) scores, and task-specific checklists. The setting of these evaluations comprises both virtual reality simulators and live surgery.
Conclusions:
In the surgical education literature, simulation-based training and assessment continues to play a prominent role in urologic training. The educational interventions highlighted in this review address various aspects of endourology, from stone management to transurethral resection. Additional work is needed to correlate technical performance in clinical and nonclinical settings with patient outcomes and develop a focused approach to nontechnical skill training.
Introduction
Urologic education is evolving from the Halstedian apprenticeship model of graduated autonomy in surgery toward a more proactive approach, including simulation and competency-based training. In the current climate of reduced work hours for residents and financial constraints, educators and training programs are facing increasing pressures to improve efficiency in training without compromising patient safety. These limitations have driven investment in ex vivo education in surgery, with formal curricula incorporating this modality to better prepare trainees for the operating room experience. 1
Endourology, in particular, has faced unique training challenges due to the rapid implementation of newer more sophisticated technologies. 2 For example, the number of approaches for benign prostatic hyperplasia (BPH) surgery has exploded in the last decade with the adoption of procedures such as Holmium Laser Enucleation of the Prostate (HoLEP), GreenLight™ Laser Photoselective Vaporization of the Prostate (PVP), and Aquablation® into clinical guidelines. 2
Stone management and transurethral resection remain key components of a urologist's surgical repertoire. As such, endoscopic skills are relevant in both community/private and academic practices and must be mastered by graduating residents. In recent years, many groups have developed simulators and training models to improve comfort and proficiency in endoscopic procedures. The purpose of this systematic review is to examine current educational interventions to improve endourologic performance of trainees and to critically appraise the strengths and limitations of each intervention. In addition, a contemporary framework was used to evaluate the quality of the evidence presented in the included studies.
Methods
This systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) protocol. 3
Eligibility criteria
Articles exploring educational interventions to improve the endoscopic procedural skills of urology trainees at all levels (medical students, residents, fellows, and staff) were included. Endoscopic procedures were broadly categorized into cystoscopy, upper tract stone management, transurethral resection, and others. Studies that did not include urology participants were excluded. Studies which did not assess participants pre- and poststudy were excluded. Articles were eligible for inclusion regardless of publication status and study type. Reviews, editorials, opinion letters, and case reports were excluded.
Information sources
A search was conducted in Ovid MEDLINE, Embase Classic, PsycINFO, and the Cochrane Library on January 18, 2019.
Search
The search strategy was designed to capture educational interventions impacting the complete surgical repertoire of a urologist, including open surgical, laparoscopic, endoscopic, and robot-assisted surgical skills. Medical subject headings (MeSH) terms included “clinical competence,” “cognitive,” “computer simulation,” “curriculum,” “education, medical, graduate,” “endoscopes,” “laparoscopy,” “non-technical,” “performance,” “robotic surgical procedure,” “simulation training,” “skill,” “surgery,” “surgical training,” “technical,” “urologic surgical procedures,” “urologists,” and “urology.”
Study selection
Titles and abstracts were independently screened by the authors for full-text review. Articles referenced in the included studies and previous review articles were eligible. Disagreements were resolved by consensus.
Data collection
The following data were extracted from the included articles: type of endoscopic procedure, setting and type of intervention, sample size, participant type, measures of improvement, limitations of the intervention, and information relevant to the assessment of study quality.
Evaluation of study quality
The Medical Education Research Study Quality Instrument (MERSQI) is a standardized tool used to assess the methodological quality of medical education research. 4 The MERSQI tool was used to evaluate the following domains in the included articles: study design, sampling, type of data, validity of the evaluation instrument, appropriateness and sophistication of data analysis, and outcomes. A comprehensive explanation of the components of MERSQI is provided in Appendix A1. 4
Results
The search terms yielded 2236 articles, of which 22 were selected for full text review (Fig. 1). These articles are summarized in Table 1 with the corresponding MERSQI scores listed. As mentioned, studies were organized into the following categories: cystoscopy, upper tract stone management, transurethral resection, and others. Of the studies included, six were rated as high quality, defined as a MERSQI score ≥14.

Preferred Reporting Items for Systematic Reviews and Meta-Analyses flow chart.
Studies included, MERSQI Scores, and Limitations
EUR = Euros; F = fellow; fURS = flexible ureteroscopy; GRS = Global rating scale; MERSQI, Medical Education Research Study Quality Instrument; MS = medical student; OSATS = Objective Structured Assessment of Technical Skills; R = resident; RIRS = retrograde intrarenal surgery; S = staff; SBT = simulation-based training.
Cystoscopy
Flexible cystoscopy is a fundamental diagnostic skill in urology and is often taught to junior trainees. To date, three studies have explored educational interventions to improve cystoscopic proficiency, of which two were simulation based and one was a bench model. Hu et al. found that transparent anatomical models were more effective than nontransparent models in cystoscopy-naive trainees using the Uro-Scopic Trainer, with a mean Objective Structured Assessment of Technical Skills (OSATS) score of 21.83 ± 3.64 and 18.50 ± 4.03 (p = 0.022), respectively. 5 The benefits of the transparent model include allowing trainees to appreciate the internal anatomy and receive visual feedback during the procedure, while enabling educators to provide immediate and relevant feedback. The UroMentor by Simbionix is a virtual reality simulator widely used to practice endoscopic procedures. 6 Following eight sessions on the UroMentor, trainees were observed to have faster procedure times (511 ± 67 seconds vs 111 ± 10 seconds, p < 0.001), fewer iatrogenic injuries (12 ± 2–5 ± 1 times, p < 0.001), improved identification of digital markers (9 ± 0–10 ± 1, p = 0.005), and better overall performance on global rating scales (1.3 ± 0.2–3.9 ± 0.2 points, p < 0.001). 7 Shah and Darzi reported similar improvements in trainees following 10 trials on the UroMentor, with shorter task completion times (4.89 minutes vs 2.33 minutes, p = 0.005) and improved lesion identification (8.0 vs 9.2, p = 0.05). 8 Bach and colleagues developed a “homemade” transurethral resection of the prostate (TURP) simulator composed of 7 cm of a 30F garden hose, a suprapubic tube, a 1500 cc Tupperware container, three Foley catheter plugs, and a small tube of silicone gel, with an equipment cost totalling $40 U.S. 9 Investigators observed that the participants performed the task of flexible cystoscopy in approximately half (50.96%) of the original time taken by the end of the study. 9
Upper tract stone management
Semirigid ureteroscopy
Brehmer and Swartz evaluated the effectiveness of bench model teaching in hands-on courses on cystoscopy and semirigid ureteroscopy. 10 Following course completion, procedural confidence (0.46/2–1.88/2), global scores (2.6/9–7.8/9), total scores (7.7/19–17.2/19), and task-specific checklist for ureteroscopy (5.1/10–9.2/10) all improved. Matsumoto et al. demonstrated that training on low-fidelity models is a cost-effective alternative to learning basic ureteroscopy skills, with comparable global rating scores (p = 0.08), checklist scores (p = 0.17), and task completion times (p = 0.80) as high-fidelity models. 11 Both training models outperformed didactic teaching.
Flexible ureteroscopy
Several modalities have been explored to teach flexible ureteroscopy in trainees composed of staff, residents, and medical students. For bench models, Hu and colleagues developed a wet lab model using isolated kidneys and ureters in vitro. 5 Participants composed of 20 staff had to perform ureteroscopy and look for a foreign body randomly placed in the kidneys and identify the exact renal calix. Researchers observed that operative time (18 ± 3.4–11 ± 1.2 minutes; p < 0.05), global rating scores (9.3 ± 0.8–28.7 ± 1.1; p < 0.05), and pass rates (30%–95%; p < 0.05) improved by the study's completion. 5 In addition, Ganesamoni et al. demonstrated that training on the Endo-Urologie-Modell (Karl Storz) resulted in significant improvements in global rating scores (19.1 ± 3.8–25.2 ± 6.1, p = 0.002) of both medical students and residents. 12 They show the value of receiving active mentoring during training, with better global rating scores (29.5 ± 3.5 vs 25.2 ± 6.1, p < 0.02) and task completion times (213 ± 55 seconds vs 342 ± 135 seconds, p < 0.001) than those without mentoring.
For virtual reality simulators, Knoll et al. conducted a study on 20 staff urologists and 5 junior residents to monitor their improvement in flexible ureteroscopy training on the UroMentor (Simbionix). Participants were evaluated based on total operation time, guidewire insertion time, time of progression from orifice to the stone, stone contact time, number of perforations, bleeding events, laser misfiring, scope damage, and treatment success. The staff with greater experience in flexible ureteroscopy (>40 ureteroscopies) were faster than those with less experience (<40 ureteroscopies). The junior residents who received training on the URO Mentor were faster than the untrained residents. This study demonstrated that ureteroscopy-naive trainees had shorter operating times (p < 0.05) in their first four live ureteroscopy cases in the operating room following 10 training sessions on the URO Mentor (Simbionix). 13
Regarding curricular design, Huri et al. developed a program incorporating didactic lectures, video presentations, anatomy dissection, and hands-on training using cadavers. 14 After course completion, trainees with no prior flexible ureteroscopy experience demonstrated shorter procedure times on cadavers (3.56 ± 2.0–1.76 ± 1.54 minutes, p = 0.008). Soria et al. developed a retrograde intrarenal surgery (RIRS) training course to train a group of urologists with no prior experience in RIRS. 15 The course was composed of three modules, including theoretical knowledge, bench model practice, and training on live porcine models. At the end of Module 1, all trainees passed the theoretical test. Participants observed an increase in skill by 43.89% for global rating scores and 29.66% for RIRS checklist. 15 Furthermore, it was measured that 73.4% of trainees were able to increase their endourologic skills by over 40%. A 2016 study by Lee et al. developed a simulation-based training curriculum for medical students composed of didactic lectures, hands-on session of ureteroscopic instrumentation and skills, and practice sessions using dry-lab models. 16 Time to completion (23.9 ± 3.7–20.3 ± 3.4 minutes, p < 0.001), as well as performance score, improved (7.97 ± 1.44–11.81 ± 2.68, p = 0.001). Furthermore, they found that earlier expert feedback before individual practice sessions resulted in shorter task completion times (19.2 minutes vs 21.5 minutes, p < 0.01) and better performance scores (13.1 vs 10.5, p < 0.01) compared to delayed feedback. 16 They found similar benefits for task completion times (15.2 minutes vs 9.1 minutes, p < 0.0001) of early expert feedback for residents, with significantly higher mean postcourse performance scores (25.7 vs 22.5, p = 0.05). 17
Percutaneous nephrolithotomy
A study by Karagozlu et al. in 2016 used a simulator to teach a group of urologists and pediatric surgeons without percutaneous nephrolithotomy (PCNL) experience. 18 They used a box trainer with two open sides and radiopaque pelvicaliceal system tools to train participants in percutaneous renal puncture. They noted that procedure time (183–45 seconds; p < 0.001), fluoroscopy time (77–15 seconds; p < 0.001), and puncture attempts (4–1; p < 0.001) improved. Knudsen et al. 19 found that training on a PCNL simulator, consisting of percutaneous renal puncture followed by introduction of a guidewire into the collecting system, resulted in better global rating scores, total procedure time, fluoroscopic time, and attempted fewer needle punctures and caused fewer blood vessel injuries than the untrained group (p < 0.001). For lower pole access and ureteral guidewire insertion, Papatsoris et al. 20 found that a 2-hour simulator training session resulted in improvements in procedural time (15.8 ± 7.8–6.49 ± 3.17 minutes, p < 0.01), duration of radiation exposure (3.97 ± 1.86–2.25 ± 1.62 minutes, p < 0.01), radiation dose (4.13 ± 2.05–2.33 ± 1.64 mSV, p < 0.01), correct caliceal puncture (77.7%–91.6%, p < 0.01), and time to correct caliceal access (10.82 ± 9.16–4.48 ± 5.55 minutes, p < 0.05).
Transurethral resection
Photoselective vaporization of the prostate (PVP)
Two studies used simulation to train groups of residents and medical students in PVP. Aydin et al. used a GreenLight simulator composed of part-task training modules (short two five-minute videos on anatomy identification, sweep speed, tissue-fiber distance, power settings, and coagulation) and case modules consisting of six clinical cases of BPH. 21 They found significant improvements in performance in all modules: anatomy identification (p < 0.001), sweep speed (p < 0.001), bleeding coagulation (p < 0.001), and power settings (p = 0.007). A reduction in operative time (p < 0.001) and operative error (p = 0.017) was observed. 17 A 2014 study by Angulo used a virtual reality simulator for training PVP. 21 In this study, participants were able to practice prostate vaporization with diode laser. They were trained in all aspects of the procedure, including anatomical margin identification, equipment operation, coagulation, and carrying out safe examination at the conclusion of the procedure. 22 The investigators observed that for a 55 g prostate, learning curve was stable after 4 procedures; for a 70 g prostate, learning curve was stable after 10 procedures. Overall, the simulators were able to demonstrate good functional task alignment, with realistic vaporization and instrumentation. 22 The graphics and haptics of the virtual reality simulator were considered to be realistic and helpful for participants gaining comfort with the procedure.
Transurethral resection of the prostate
A study by He et al. used the TURPSim simulator for training a group of urologists. 23 Participants were evaluated objectively by the simulator based on their performance during the TURP. They observed an increase in global rating scores (18.0 ± 4.0 vs 12.4 ± 4.2, p < 0.001), decrease in the rate of capsule resection (26.3% ± 0.6% vs 21.2% ± 0.4%, p < 0.001), decrease in blood loss (125.8 ± 86.3 vs 83.7 ± 41.6 mL, p < 0.001), and decrease in external sphincter injury (3.6 ± 2.9 vs 2.0 ± 2.0, p < 0.001). 23 A 2010 study by Kallstrom and coworkers evaluated the effectiveness of the PelvicVision TURP simulation model. 24 They noted that there was an improvement for students overall, and improvement was observed for the staff with respect to total operative time, resection speed, completeness, irrigation fluid used, and checklist sources. Students noted that the procedure became easier over time. 24
Transurethral resection of the bladder tumor (TURBT)
Kruck et al. evaluated the effectiveness of a virtual reality simulator to teach transurethral resection of the bladder tumor (TURBT), as well as the effectiveness of photodynamic diagnostic (PDD), in a group of residents and staff. 25 Participants were introduced to the theory underlying the simulator and the principles of TURBT using instruction. Afterward, they were exposed to hands-on training on the simulator involving detection, resection of bladder tumors, and management of bleeding. The novice group completed five standardized 5-minute resection scenarios, while the expert group of TURBT-trained urologists was evaluated using the virtual reality PDD-TURBT program using white light visualization with and without PDD assistance. In the novice group, after five practice scenarios: bladder inspection rates (36.8% ± 12.9%–54.3% ± 7.3%; p = 0.0008) and tumor resection rate improved (26.5% ± 9.6%–52.0% ± 6.0%; p < 0.0001). 25 In the expert group, after five practice scenarios with PDD view: bladder inspection rates (52.2% ± 3.4%–62.7% ± 4.3%; p = 0.003) and tumor resection rate (43.8% ± 3.3%–57.1% ± 5.3%; p = 0.002) improved. Overall, participants found that this simulator improved their understanding of the anatomy involved. 25
Other
Endoscopic correction of vesicoureteral reflux
Soltani and colleagues studied the utility of a porcine bladder model in training a group of residents and fellows to perform endoscopic correction of vesicoureteral reflux. 26 The simulator consisted of a dissected ex vivo porcine bladder with distal ureters and urethra secured. First, participants were exposed to a short video outlining the steps of the procedure. They were then presented with a demonstration, followed by a 2-hour hands-on training of the simulator. This study evaluated the technique of administering two injections on the simulation bladder without assistance. 26 Trainee scores improved from 56% to 92% (p = 0.008). Specific areas showing significant improvement included: identification of ureteral orifices (Δ = 33%, p = 0.025), ureteral orifice hydrodistention (Δ = 33%, p = 0.010), first and second injection administration technique (Δ = 46%, p = 0.006; Δ = 46%, p = 0.006), as well as location (Δ = 50%, p = 0.041), size (Δ = 50%, p = 0.004), and depth of the mound (Δ = 58%, p = 0.013) after injection. 26
Discussion
In the modern era of surgical education, there continues to be push for skill acquisition outside the immediate clinical space. This has distinct advantages by increasing the volume of exposure obtained by surgical trainee while limiting patient exposure to novices. Training programs and trainees must adapt to increased competition for limited learning opportunities due to reduced work hours, emphasis on reducing medical errors, hospital regulations, and limited funding. This is all in an era where medical technology is advancing faster than it ever has, necessitating practicing surgeons to learn these novel procedures. The resultant trickle-down effect on trainees even further accentuates their reduced exposure to learning opportunities in the clinical space. It is also important to note that of 2236 articles identified, only 22 qualified for this systematic review. The low inclusion rate is a reflection of the reality that we do not have sufficient evidence across all of these procedure types. This article highlights the fact that we need to study not just the development, face validation, and practicality of various study methods. It is imperative to study the impact that these interventions have in a real-world clinical setting.
Educational interventions highlighted in this review aim to address this gap in endourology and are categorized into virtual reality-based simulators, bench models, and training courses. However, each type of intervention carries limitations as outlined in Table 1. Physical models often lack intrinsic quantitative methods of evaluating performance; they are often single- or limited use and lack human physiology such as bleeding and respiration. In contrast, while virtual models address some of the shortcomings of physical models with detailed performance reports and simulated scenarios, they lack haptic feedback and latency. Course-based interventions are logistically challenging, resource intense, and thus not easily accessible.
A notable limitation in the current literature is the paucity of evidence correlating technical performance with clinical performance and outcomes. Only one of the included studies demonstrates true technical improvements in the operating room. 13 This issue is prevalent in other aspects of urologic surgical education, including robot-assisted surgery. 27 For educational interventions to be worth widespread implementation, they should be associated with clinically meaningful improvements in performance. In general surgery, Williams et al. demonstrated that general surgery trainees achieve similar adenoma detection rates as gastroenterologists following an endoscopy simulation curriculum. 28
As we continue to shift surgical education outside of the operating room, we must also set standardized benchmarks for simulation training. Von Websky et al. suggested that peer-group derived metrics should form a fundamental part of basic skill training before advancing to more sophisticated simulators. 29 Therefore, achieving these metrics will help trainees reach milestones and aid with surgical skill acquisition. Rashid et al. identified different TURP resection strategies between novices, residents, and experts based on amount of prostate resected, blood loss, irrigation volume used, foot pedal use, and time spent with orientation, cutting, or coagulation. 30 An evolution in strategy and predefined metrics, such as adhering to a systematic approach, less orientation time, adequate hemostasis, and judicious use of irrigation fluid may therefore serve as surrogate markers for attaining TURP proficiency. For accreditation, the Japanese Urologic Association and Japanese Society of Endourology and Extracorporeal Shock Wave Lithotripsy developed the Endoscopic Surgical Skill Qualification System to evaluate competence in urologic laparoscopic skills. 31 Using video assessments of laparoscopic nephrectomies and adrenalectomies, applicants were certified based on specific intraoperative errors, with a 66% pass rate.
Equally important is the necessity of implementing curricula or simulation to achieve proficiency in nontechnical skills (NTS) in endourologic procedures. Brewin and colleagues created a full immersion TURP simulation to develop technical and NTS using the Bristol TURP simulator, scrub nurse, and anesthetist. 32 This demonstrated multiple elements of assessment validity, but the impact on technical performance was not studied. In robot-assisted surgery, the NASA Task Load Index is the most widely studied NTS assessment tool, but validity evidence supporting other more clinically relevant tools is limited. 33 NTS training can also be extended into endourologic crises, such as bladder perforation, and TUR syndrome. Such simulation of high stress scenarios can improve team dynamics and educational competencies. 34 Goldenberg et al. developed a simulation-based exercise to access technical and NTS of urology residents during a vasovagal response to pneumoperitoneum and inferior vena cava injury during laparoscopic surgery. 35 The authors found that level of training correlated with technical and nontechnical performance.
With a plethora of educational interventions available, the next step is to determine how these modalities can be effectively integrated into the urology residency program. Key questions that need to be addressed include which trainees are best suited for each intervention, what educational interventions should be used, what competencies should be addressed, and when trainees should be approved to perform a clinical activity. The feasibility of a centralized curriculum for simulation has been demonstrated by Khan et al. with excellent validity and participant satisfaction. 36 It would be valuable to study how an integrated endourologic curriculum composed of simulation and models highlighted from this systematic review would impact trainee performance and clinical outcomes.
Conclusion
Simulation-based exercises continue to play a prominent role in urologic training. The educational interventions highlighted in this review address various aspects of endourology, from stone management to transurethral resection. Additional work is needed to correlate technical performance in nonclinical settings with patient outcomes and developing an effective approach to NTS training.
Footnotes
Authors' Contributions
Conception and design: I.A. and M.G.G. Collection and assembly of data: I.A. and J.C.C.K. Data Analysis and interpretation: I.A. and J.C.C.K. Article writing: I.A., J.C.C.K., T.C., J.Y.L., and M.G.G. Final Approval of article: I.A., J.C.C.K., T.C., J.Y.L., and M.G.G.
Author Disclosure Statement
No competing financial interests exist.
Funding Information
No funding was received for this article.
Abbreviations Used
Explanation of the Components of the Medical Education Research Study Quality Instrument
| MERSQI domain | Criteria and scores |
|---|---|
| Type of study | Single cohort survey or postintervention: 1 |
| Single cohort pre- and postintervention: 1.5 | |
| Nonrandomized, 2 cohorts: 2 | |
| Randomized controlled trial: 3 | |
| Sampling: number of institutions of participants | 1 institution: 0.5 |
| 2 institutions: 1 | |
| ≥3 institutions: 1.5 | |
| Sampling: response rate of participants | Not applicable |
| <50% or not reported: 0.5 | |
| 50%–74%: 1 | |
| ≥75%: 1.5 | |
| Type of data | Subjective, assessed by participant: 1 |
| Objective, assessed by observer: 3 | |
| Validity evidence of evaluation instrument scores | Not applicable |
| Content (using theory, guidelines, experts, existing tools): 1 | |
| Internal structure (reliability, factor analysis): 1 | |
| Relationship (expert vs novice, correlation with other variables): 1 | |
| Data analysis: sophistication | Descriptive (i.e., frequency, mean, median): 1 |
| Beyond descriptive (any statistical inference): 2 | |
| Data analysis: appropriate | Analysis appropriate for study and type of data: 1 |
| Outcome | Satisfaction, attitudes: 1 |
| Knowledge, skills: 1.5 | |
| Behavior in clinical setting: 2 | |
| Patient outcome: 3 |
Adapted from Cook and Reed. 4
