Educational Role for an Advanced Suturing Task in the Pediatric Laparoscopic Surgery Simulator

Abstract

Background:

Laparoscopic models are recognized as important training tools. Lower fidelity systems are used mainly for simpler tasks; an advanced suturing task may allow for additional training of experts. The purpose of this study was to explore the educational role of an advanced suturing task using motion analysis and establish the task's construct validity.

Methods:

The pediatric laparoscopic surgery (PLS) simulator was customized with motion-tracking hardware and software. Participants were stratified by expertise, then performed an advanced task involving intracorporeal suturing in a vertical plane, with the suture passing superiorly to inferiorly. Traditional PLS scores were calculated, and motion was analyzed in the four degrees of freedom available in laparoscopic surgery (Pitch, Yaw, Roll, and Surge). Data were compared to historic results for a standard suturing task.

Results:

Sixty participants were recruited (8 novices, 13 intermediates, and 39 experts). Analysis of motion in all degrees of freedom allowed discrimination between participants based on expertise level. Compared with the standard task, PLS scores for the advanced task were significantly lower for intermediates and experts, and the number of extreme motion events was significantly higher, indicating that advanced task is more challenging. In addition, only 76.3% of experts, 76.9% of intermediates, and 37.5% of novices were able to successfully complete the advanced task.

Conclusions:

Performance of an advanced intracorporeal suturing task allowed discrimination of expertise level. The task's increased complexity may help hone laparoscopic technical skills, particularly among advanced performers, and even allow discrimination of psychomotor expertise within the traditional cohort of experts.

Introduction

Laparoscopic models are increasingly recognized as important tools in surgical training. While validation of new tasks using box trainers is regularly published, most focus on training residents, not experts.^1–7 We hypothesize that a sufficiently difficult task could present a significant challenge to experts and provide an opportunity to further develop the latter's laparoscopic skills.

The primary objectives of this study were to establish the construct validity for a more challenging intracorporeal suturing task and to explore expanding a box trainer's use to help upskill experts. Traditional construct validity is established when performance of a task allows discrimination between participants of differing levels of expertise. This methodology is felt to be adequate in assessment for educational purposes, although it may need to be more stringent for high stake assessment such as certification.⁸ While traditional metrics of time for task completion, precision, and ability to complete the task were explored in this study, the focus was on motion analysis, which allows real-time assessment. Analysis of motion has previously been validated in the pediatric laparoscopic surgery (PLS) simulator to assess performance of a simpler suturing task.⁹

To assist in upskilling experts, efforts were made to define a more challenging advanced task. We wanted to gain insight into the very definition of expertise and thoughtfully address factors that may help experts further improve their technical skills. To date, expertise—for the purpose of establishing construct validity—is generally ascribed according to self-reported caseloads⁵ or certification (medical student, resident, and staff),¹⁰ which presents inherent problems: verification, comparability and equivalence, and relevance. Current laparoscopic training systems, such as the fundamentals of laparoscopic surgery (FLS)¹¹ and PLS¹² certification programs, are aimed at establishing basic laparoscopic skills. As laparoscopic training and skills advance, determination of a less biased measure of expertise may aid for purposes of self-directed learning and assessment of psychomotor expertise.

Materials and Methods

The PLS simulator, a pediatric-sized trainer measuring 18 × 14 × 11 cm with an HD webcam, was fitted with custom-built motion tracking hardware and software consisting of tracking sensors on the inner surface of trocar ports (Fig. 1). Motion was assessed in the four degrees of freedom available in traditional laparoscopic surgery: Pitch, which is moving the hand up or down around a fixed point; Yaw, moving the hand side-to-side around a fixed point; Roll, which is pronation and supination; and Surge, which is pushing the hand in and pulling the hand out. Pitch, Yaw, and Roll are rotational degrees of freedom; Surge is a translational degree of freedom.

FIG. 1.

Pediatric laparoscopic surgery box trainer.

Participants were recruited at the 2013 International Pediatric Endosurgery Group (IPEG) meeting and were stratified into novice (<10 laparoscopic cases performed per year), intermediate (10–50 laparoscopic cases per year), and expert (>50 laparoscopic cases per year) groups based on self-reported caseloads. Participants performed an advanced intracorporeal suturing task, involving suturing in a plane perpendicular to the traditional axis of sewing, that is, passing the suture from superior to inferior instead of right to left, with the penrose drain placed in a vertical station instead of a more ergonomic horizontal one (similar to suturing during a Kasai procedure or choledochal cyst excision; Fig. 2).

FIG. 2.

Traditional intracorporeal suturing task (left column pictures) and advanced intracorporeal suturing task (right column pictures).

Traditional PLS scores were calculated using time (in seconds) and precision (proximity of needle passage to target, in mm). Three motion analysis parameters (MAPs), velocity, acceleration, and 90% range of motion, were recorded (see Construct validity). Performance was analyzed for successful task completion (unsuccessful tasks involved avulsion of the Penrose drain, only throwing a single knot on the first pass, or not completing the task). These data were compared to historic results for a simpler, traditional intracorporeal suturing task also performed on the PLS simulator.⁹

Construct validity

As a discrimination measure to determine construct validity, we defined three MAPs based on velocity, acceleration, and range of motion.⁹ We used the number of events measured that fall outside two standard deviations of a participant's overall velocity and acceleration data, referred to as “the number of extreme velocity and acceleration motion events” (Fig. 3).

FIG. 3.

Model of extreme motion events based on acceleration or velocity output from a motion tracking sensor during an intracorporeal suturing task. The thin line represents the participant's hypothetical mean. The broken lines hypothetically delineate all data lying within two standard deviations. The arrows indicate each hypothetical “event” where the participant goes beyond two standard deviations of their mean: we refer to these events as “extreme motion events.”

Velocity and acceleration extreme motion events have already been validated for the standard intracorporeal suturing task in an adult, as well as pediatric laparoscopic box trainer.^9,13 They allow discrimination based on “jerkiness” of movement: every time an operator makes a rough uncontrolled movement, this is represented by a sudden increase in the amplitude measurement of the motion sensor. The number of these extreme velocity and acceleration events is higher in the less experienced groups and decreases with increasing level of expertise. For the purpose of this study, analysis of motion was carried out using the participants' dominant hand.

Range of motion (90%) represents the range of movement made by each participant in the four degrees of freedom available. We used the 90% interval as a measure of the space used by each participant while completing the intracorporeal suturing task. The rationale was that by removing the extreme 10% of the range of motion, we would have an overall understanding of each participant's use of the space within the box trainer and determine whether this overall use of space allowed discrimination between levels of expertise.

The MAPs described above were measured for all candidates and cross-referenced to their level of expertise.

Statistical analyses

After the tracking data were compiled, we calculated the number of extreme velocity events, the number of extreme acceleration events, and the 90% range of motion in each of the four degrees of freedom available. Importantly, each standard deviation analysis was based on the individual participant's own pattern of motion and mean. Statistical analyses were performed using STATA software (StataCorp. 2013. Stata Statistical Software: Release 13. College Station, TX: StataCorp LP). Statistical significance was considered as a P < .05.

Analysis of variance (ANOVA) was done comparing the novice, intermediate, and expert data for the number of extreme velocity and acceleration events, as well as the range of motion (90%), in all four degrees of freedom. Post hoc analyses were then conducted to test whether data allowed for discrimination between groups.

Results

Sixty participants were recruited (8 novices, 13 intermediates, and 39 experts). Analysis of motion in the performance of an advanced task allowed discrimination between participants of varying levels of expertise in all four degrees of freedom using the velocity and acceleration (Table 1), but not using the range.

Table 1.

Dominant Hand Data Comparing Novice, Intermediate, and Expert Number of Extreme Velocity, Acceleration Events, and Range 90% in Four Motion Analysis Parameters

Dominant	Novice (N = 8) Mean ± SD	Intermediate (N = 13) Mean ± SD	Expert (N = 39) Mean ± SD	P
Pitch
Extreme velocity events	113.1 ± 83.0	82.2 ± 40.6	55.1 ± 27.2	.0013
Extreme acceleration events	180.3 ± 161.4	115.3 ± 50.3	84.0 ± 42.2	.0029
Range 90%	15.3 ± 3.5	14.2 ± 2.0	15.1 ± 2.1	.4244
Yaw
Extreme velocity events	174.3 ± 138.0	116.2 ± 63.9	79.4 ± 37.6	.0011
Extreme acceleration events	263.0 ± 206.5	186.2 ± 100.5	125.6 ± 57.0	.0015
Range 90%	10.5 ± 2.9	9.3 ± 1.7	9.5 ± 2.1	.4041
Roll
Extreme velocity events	56.8 ± 52.4	36.5 ± 21.7	23.3 ± 10.1	.0009
Extreme acceleration events	109.5 ± 88.5	80.8 ± 53.9	49.3 ± 22.6	.0013
Range 90%	2761.1 ± 2439.2	3242.9 ± 2876.4	3151.4 ± 3138.3	.9325
Surge
Extreme velocity events	91.5 ± 80.7	60.5 ± 32.3	42.6 ± 23.0	.0039
Extreme acceleration events	143.9 ± 122.8	97.8 ± 53.2	67.6 ± 33.5	.0029
Range 90%	5313.8 ± 1875.2	4138.8 ± 1191.8	4119.1 ± 1414.3	.1019

Failure rates for the advanced task were high: approximately a quarter of experts and intermediates were unable to successfully complete the task, while almost two-thirds of novices were unable to complete the task (Table 2). Given that participants who failed the task could not be assigned traditional PLS scores (decreasing the power of the PLS score analysis), no significant difference between the average PLS scores of different levels of expertise was noted with ANOVA or post hoc analysis. However, compared to the average PLS scores from the traditional PLS intracorporeal suturing task published in 2011,¹² average PLS scores for the advanced task were lower across all levels of expertise, reaching statistical significance for the intermediate and expert groups (Table 3).

Table 2.

Successful Task Completion for Advanced Task

	Successful	Unsuccessful ^a	Percent successful, %
Experts^b	29	9	76.3
Intermediates	10	3	76.9
Novices	3	5	37.5

(Avulsion, only single knot thrown first pass, task not completed).

One expert not included due to incomplete PLS data.

PLS, pediatric laparoscopic surgery.

Table 3.

Comparison of Average PLS Scores for the Standard Intracorporeal Suturing Task (from Azzie 2011 PLS J Peds Surg) Versus the Advanced Suturing Task

	Standard PLS Task		Advanced Task
	(Score ± SD)	n	(Score ± SD)	n	P value (unpaired t-test)
Novice	15 ± 22	20	13.9 ± 22.7	3	.943
Intermediate	70 ± 21	19	17.1 ± 62.7	10	.0267
Expert	77 ± 14	12	47.4 ± 28.8	29	.0001

PLS, pediatric laparoscopic surgery.

When comparing MAPs between the historic values for the traditional task to those for the more advanced suturing task, the number of extreme velocity and extreme acceleration events were significantly higher in the performance of the more advanced suturing task versus the standard task among intermediates and experts (P < .001 for all comparisons, using an unpaired two-tailed Student's t-test). This was true for velocity and acceleration in all degrees of freedom available in laparoscopic surgery (Pitch, Yaw, Roll, and Surge). This did not hold true for range of motion. For novices, there was a trend toward a greater number of extreme velocity and extreme acceleration events in the performance of the more advanced task compared to the traditional task in all degrees of freedom. These values reached statistical significance for extreme velocity events in the Pitch and Yaw, and for extreme acceleration events in the Yaw, Roll, and Surge (Appendix Table 1).

Discussion

Assessment of specific MAPs in the performance of an advanced intracorporeal suturing task allowed discrimination among novices, intermediates, and experts, thus establishing traditional construct validity for this model. The number of extreme velocity and extreme acceleration events, for all participants, in all degrees of freedom, were higher in the performance of this more advanced suturing task, indicating that experts, intermediates, and novices all struggled significantly more. The 90% range metric was not discriminatory.

The more challenging nature of this advanced task is also reflected by the fact that the average PLS scores were inferior for all three levels of expertise compared to the historic average PLS scores¹² obtained for the standard intracorporeal suturing task (Table 3), with the difference between the scores attaining statistical significance when comparing the expert and intermediate groups. Intuitively, it makes sense that the differences for novices were less marked between performances of the traditional and the advanced task because both tasks most challenged this group of participants: novices have a limited skill set given their lack of experience. The fact that only 76.3% of experts, 76.9% of intermediates, and 37.5% of the novices successfully completed the task also speaks to the greater technical challenge.

In terms of educational value, the consensus among experts who were involved in the testing was that the model mimicked the technical challenges in the laparoscopic management of a choledochal cyst. This made the model more relevant, thus adding to its educational value.

When comparing MAPs for different degrees of freedom in the standard intracorporeal suturing task, we had noted that pronation and supination (the Roll degree of freedom) was the most discriminatory. For the advanced task, all MAPs in all degrees of freedom were found to discriminate significantly: there were more extreme velocity and extreme acceleration events in the performance of the advanced task. This likely reflects the overall more complex nature of the task in all degrees of freedom. Performance of the advanced task involves operating in a vertical field (passing the suture in a superior to inferior vertical line through the Penrose drain), with fixed instrument insertion points in an incongruous horizontal plane. As mentioned above, this mimics the challenges in the laparoscopic approach to the reconstruction after excision of a choledochal cyst, or a Kasai portoenterostomy (Fig. 2).

As only a third of novices were able to successfully complete this advanced task and that among intermediates and experts the number of extreme motion events was significantly higher for this advanced task than for previously published extreme motion events in the standard task, this advanced task may be seen as being most useful for a cohort of participants with a preestablished critical level of technical skills (i.e., experts or intermediates). The complexity of the task may offer no advantage to novices, who may benefit more from focusing on the traditional task before moving on to a more complex one. We would therefore recommend this task for intermediates and experts who have already attained a certain laparoscopic proficiency, as a means to further hone their skills.

While one of the goals in this study was to explore the educational role of an advanced suturing task within the PLS simulator, the results highlight questions and reservations many have regarding the very definition of expertise, specifically as it pertains to psychomotor skills. Traditionally, this has been based on self-reported caseloads⁵ or certification (medical student, resident, and staff)¹⁰—or a combination of both.^14,15 It is intuitive that someone performing <10 minimal access surgery (MAS) cases annually is likely a novice. However, the difference between intermediates and experts is more subtle; and the threshold for calling a surgeon an expert is less clear. Some “intermediates” who perform 10–50 MAS cases annually exhibit expert MAS technique, while informal observation, video analysis, motion analysis, and PLS scores indicate that some experts perform at an intermediate level. This overlap of certain participants classified as intermediates and experts has been noted in other studies⁵ and highlights the ongoing challenge of defining expertise.^16,17 Other groups have proposed incorporating motion analysis as a performance-based tool to help designate psychomotor expertise.¹⁵

There are several limitations to this study. The comparison to performance parameters for the simpler task is historic. The investigators certainly observed the degree to which all participants struggled more with this new, advanced suturing task, but the metrics do not compare performances in a formal “head to head” manner. There is also no proof that performance will improve with practice of this advanced task, although mastering simple tasks through practice¹⁸ and moving on to more complex ones is the traditional paradigm for improving psychomotor skills. Finally, we acknowledge that the methodology used for validation was traditional and not keeping with that currently recommended by many for higher stake assessment. Given that our premise is the development of a tool for formative assessment and education rather than summative assessment and certification, most experts in the field agree that our traditional methodology is sufficient.⁸

Analysis of motion for an advanced, more complex intracorporeal suturing task has demonstrated construct validity for this model, which mimics the challenging portion of a reconstruction after excision of a choledochal cyst or a Kasai procedure in terms of suturing orientation and instrument coordination. Further study will help establish whether the model should be incorporated into a structured training and assessment program targeting technically more advanced participants.

Footnotes

Acknowledgments

The authors thank all surgeons, residents, and students who contributed their time to this study, for their valuable input and expertise.

Disclosure Statement

No competing financial interests exist.

Appendix

Appendix Table 1.

Comparison of Number of Extreme Motion Events Between the Standard and Advanced Intracorporeal Suturing Task

Dominant hand	Novice	Intermediate	Expert
Standard task, n	26	12	37
Advanced task, n	8	13	39

	P value comparing number of extreme events for the standard versus advanced task
Pitch
Extreme velocity events	.0798	.0006	<.0001
Extreme acceleration events	.1454	.0004	<.0001
Yaw
Extreme velocity events	.0344	.0004	<.0001
Extreme acceleration events	.041	.0001	<.0001
Roll
Extreme velocity events	.0646	.0003	<.0001
Extreme acceleration events	.0389	.0009	<.0001
Surge
Extreme velocity events	.0576	.0003	<.0001
Extreme acceleration events	.0451	.0002	<.0001

References

Oropesa

, Sanchez-Gonzalez

, Lamata

, et al. Methods and tools for objective assessment of psychomotor skills in laparoscopic surgery. J Surg Res, 2011; 171:e81–e95.

Oropesa

, Chmarra

, Sanchez-Gonzalez

, et al. Relevance of motion-related assessment metrics in laparoscopic surgery. Surg Innov, 2013; 20:299–312.

Hiemstra

, Chmarra

, Dankelman

, Jansen

. Intracorporeal suturing: Economy of instrument movements using a box trainer model. J Minim Invasive Gynecol, 2011; 18:494–499.

Rodriguez

, Sanchez-Ismayel

, Sanchez

, Pena

, Salamo

. Construct validity of an inanimate training model for laparoscopic appendectomy. JSLS, 2013; 17:445–449.

Retrosi

, Cundy

, Haddad

, Clarke

. Motion analysis-based skills training and assessment in pediatric laparoscopy: Construct, concurrent, and content validity for the eoSim simulator. J Laparoendosc Adv Surg Tech A, 2015; 25:944–950.

van Empel

, van Rijssen

, Commandeur

, et al. Validation of a new box trainer-related tracking device: The TrEndo. Surg Endosc, 2012; 26:2346–2352.

Bellorin

, Kundel

, Sharma

, Ramirez-Valderrama

, Lee

. Training model for laparoscopic Heller and Dor fundoplication: A tool for laparoscopic skills training and assessment-construct validity using the GOALS score. Surg Endosc, 2016; 30:3654–3660.

Ghaderi

, Manji

, Park

, et al. Technical skills assessment toolbox: A review using the unitary framework of validity. Ann Surg, 2015; 261:251–262.

Nasr

, Carrillo

, Gerstle

, Azzie

. Motion analysis in the pediatric laparoscopic surgery (PLS) simulator: Validation and potential use in teaching and assessing surgical skills. J Pediatr Surg, 2014; 49:791–794.

10.

Partridge

, Hughes

, Brennan

, Hennessey

. Accessible laparoscopic instrument tracking (“InsTrac”): Construct validity in a take-home box simulator. J Laparoendosc Adv Surg Tech A, 2014; 24:578–583.

11.

Peters

, Fried

, Swanstrom

, et al. Development and validation of a comprehensive program of education and assessment of the basic fundamentals of laparoscopic surgery. Surgery, 2004; 135:21–27.

12.

Azzie

, Gerstle

, Nasr

, et al. Development and validation of a pediatric laparoscopic surgery simulator. J Pediatr Surg, 2011; 46:897–903.

13.

Trudeau

, Nasr

, Carrillo

, Gerstle

, Azzie

. Construct validity and educational role for motion analysis in a laparoscopic trainer. Surg Endosc, 2015; 29:2491–2495.

14.

Schreuder

, van Hove

, Janse

, Verheijen

, Stassen

, Dankelman

. An “intermediate curriculum” for advanced laparoscopic skills training with virtual reality simulation. J Minim Invasive Gynecol, 2011; 18:597–606.

15.

Oropesa

, Sanchez-Gonzaez

, Chmarra

, et al. Supervised classification of psychomotor competence in minimally invasive surgery based on instruments motion analysis. Surg Endosc, 2014; 28:657–670.

16.

Sadideen

, Alvand

, Saadeddin

, Kneebone

. Surgical experts: Born or made?. Int J Surg, 2013; 11:773–778.

17.

Ericsson

. Deliberate practice and acquisition of expert performance: A general overview. Acad Emerg Med, 2008; 15:988–994.

18.

Fried

, Feldman

, Vassiliou

, et al. Proving the value of simulation in laparoscopic surgery. Ann Surg, 2004; 240:518–528.