Skip to main content

Surgical resident experience with common bile duct exploration and assessment of performance and autonomy with formative feedback

Abstract

Background

Common bile duct exploration (CBDE) is safe and effective for managing choledocholithiasis, but most US general surgeons have limited experience with CBDE and are uncomfortable performing this procedure in practice. Surgical trainee exposure to CBDE is limited, and their learning curve for achieving autonomous, practice-ready performance has not been previously described. This study tests the hypothesis that receipt of one or more prior CBDE operative performance assessments, combined with formative feedback, is associated with greater resident operative performance and autonomy.

Methods

Resident and attending assessments of resident operative performance and autonomy were obtained for 189 laparoscopic or open CBDEs performed at 28 institutions. Performance and autonomy were graded along validated ordinal scales. Cases in which the resident had one or more prior CBDE case evaluations (n = 48) were compared with cases in which the resident had no prior evaluations (n = 141).

Results

Compared with cases in which the resident had no prior CBDE case evaluations, cases with a prior evaluation had greater proportions of practice-ready or exceptional performance ratings according to both residents (27% vs. 11%, p = .009) and attendings (58% vs. 19%, p < .001) and had greater proportions of passive help or supervision only autonomy ratings according to both residents (17% vs. 4%, p = .009) and attendings (69% vs. 32%, p < .01).

Conclusions

Residents with at least one prior CBDE evaluation and formative feedback demonstrated better operative performance and received greater autonomy than residents without prior evaluations, underscoring the propensity of feedback to help residents achieve autonomous, practice-ready performance for rare operations.

Background

Choledocholithiasis occurs in approximately 8–20% of patients with cholelithiasis and confers increased risk of potentially life-threatening cholangitis and biliary pancreatitis. Patients with choledocholithiasis typically undergo a two-stage approach of cholecystectomy and pre- or postoperative endoscopic retrograde cholangiography (ERC) [1]. Alternatively, a single- stage approach of a laparoscopic common bile duct exploration (CBDE) performed in conjunction with laparoscopic cholecystectomy is associated with greater technical success rates [2, 3], decreased hospital length of stay [2, 4], and fewer retained stones at follow-up evaluation [5], and similar rates of morbidity and mortality compared with the two-stage approach [6, 7].

Despite its value, many surgeons are uncomfortable performing CBDE, and its use is declining in the USA [8]. As a consequence, surgical resident exposure to CBDE is declining [9]. This trend is consistent with evidence that graduating residents are performing fewer cases than in previous eras and have lower confidence in their operative abilities [10,11,12]. Feedback, of which there are many forms, has been shown to improve procedure performance in multiple specialties and levels of training [13]. Given the reduction in CBDE exposure, it is increasingly important to benchmark the operative abilities of general surgery residents in performing CBDE and to maximize the odds that trainees will achieve autonomous, practice-ready performance.

Using validated operative performance and autonomy metrics reported by both residents and attending surgeons for 189 laparoscopic or open CBDEs performed at 28 institutions, this study tests the hypothesis that receipt of one or more prior CBDE operative performance assessments, accompanied by formative feedback, is associated with greater resident performance and autonomy.

Methods

Study design and data source

This observational study used an existing multicenter dataset maintained by the Society for Improving Medical Professional Learning (SIMPL) collaborative, which maintains a database of resident and attending surgeon workplace-based assessments of resident operative performance and autonomy, in association with case complexity and verbal feedback from attendings, for a wide range of surgical procedures (described in detail at: https://www.simpl.org/simpl, accessed 6 September 2022). Residents and attendings use a smartphone app to generate these assessments, and attending surgeons who complete the assessments have the option of dictating feedback. With increasing interest in competency-based performance assessment in surgical education, the advent of a smartphone workplace-based assessment application from SIMPL has offered opportunities to better understand the progression of resident operative performance and autonomy [14].

The SIMPL dataset for this study included trainee postgraduate year, procedure type, resident and attending ratings of operative performance and autonomy, resident and attending ratings of case complexity, and attendings’ dictated feedback and debriefing narratives. We included evaluations for CBDEs (n = 189) that did not involve hepaticojejunostomy or choledochoenterostomy and then divided assessments into two groups: cases in which the resident had no prior CBDE evaluations (N = 141) and cases in which the resident had one or more CBDE evaluations (N = 48). The University of Florida Institutional Review Board approved this study (IRB# 202,200,845). This study was performed in accordance with the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guidelines (see Additional File 1: Table S1).

Operative performance, autonomy, and complexity assessments

SIMPL allows trainees and attending surgeons to assess trainee performance on five ordinal variables: critical deficiency, inexperienced with procedure, intermediate performance, practice-ready performance, and exceptional performance [14, 15]. Resident autonomy is quantified by the Zwisch Scale, consisting of four additional ordinal levels: show and tell, active help, passive help or supervision only [14, 16, 17]. The Zwisch Scale has been shown to be a valid and reliable way to differentiate faculty guidance levels provided from which to infer resident autonomy [14, 16, 17]. Operative cases were assessed by trainee and attending surgeon perception of case complexity relative to similar procedures: easiest third, average complexity, or hardest third [14]. Global scores for case complexity, performance, and autonomy were calculated as the mean between resident and attending assessments of each outcome with equal weight given to both evaluators.

Sentiment analysis

This study builds on previous work with Natural Language Processing (NLP) and applies advanced sentiment analysis techniques using deep learning methods to assess the degree of positivity or negativity in dictated feedback. Operative performance assessments by attendings were analyzed by NLP techniques, which allowed for systematic evaluation of dictated feedback to understand overall sentiment. Sentiment analysis is a subtask of NLP encompassing the extraction of opinions, evaluations, attitudes, and emotions from written language [18]. This methodology offers the potential for greater contextual understanding of language compared with classical approaches to sentiment analysis which utilize linguistic analysis or rule-based phrasing match against predefined positive and negative work lists [19].

Machine learning models are pre-trained on large bodies of generalized human language (e.g., Wikipedia) and then fine-tuned on a task-specific dataset, which generate performance advantages for multiple NLP tasks [20,21,22]. We completed sentiment analysis of SIMPL dictations using Python version 3.8.8, the PyTorch machine learning framework, the Huggingface library, and an established deep learning model that was fine-tuned on the Stanford Sentiment Treebank v2 [23] dataset of movie reviews for predicting positive and negative sentiment from language [24, 25]. To address limitations in the pre-trained model, dictations were truncated to 128 tokens [26]. For each transcription, the deep learning model predicted a sentiment label (positive or negative) and a corresponding sentiment score ranging from 0.5 to 1.0. For dictations expected to be negative, sentiment scores were subtracted from 1.0 so that all transcription scores ranged from 0 (most negative) to 1 (most positive). Examples of positive and negative sentiment language in dictations can be found in Additional File 2: Table S2.

Subgroup analysis of laparoscopic cases

The primary analysis of all CBDE cases demonstrated that there was a significantly greater proportion of laparoscopic cases performed by residents with prior CBDE evaluations. Therefore, we performed a subgroup analysis of laparoscopic CBDE cases, and repeated all elements of the primary analysis in this subgroup analysis of 46 cases of laparoscopic CBDE in which the resident had a prior evaluation versus the 100 cases of laparoscopic CBDE in which the resident had no prior evaluations. A visual flowchart of laparoscopic and open cases is illustrated in Additional File 3: Fig. S1.

Statistical analyses

Categorical variables were compared by Fisher’s exact test and presented as frequencies with percentages. Continuous variables were compared by the Kruskal–Wallis test and presented as median values with interquartile ranges (IQRs). All statistical tests were 2-sided with an alpha of 0.05. Given the lack of prior studies testing our hypothesis, all analyses were performed in an exploratory fashion. All primary analysis outcomes (i.e., those listed in Table 2 and the p values reported in Figs. 1, 2, and 3) were adjusted for multiple comparisons by performing Benjamini–Hochberg corrections.

Fig. 1
figure 1

Case complexity for common bile duct explorations was similar between cohorts in which the resident had no prior cases with formative feedback versus one or more prior cases with formative feedback. Long dashes represent the median value. Short dashes represent the 25th and 75th percentiles

Fig. 2
figure 2

Operative performance was greater among residents who had performed one or more prior cases of common bile duct exploration with formative feedback compared with residents who had no prior cases with formative feedback. Long dashes represent the median value. Short dashes represent the 25th and 75th percentiles. For “Show and tell” cases in which the attending performs the case, there was no resident performance rating

Fig. 3
figure 3

Operative autonomy was greater among residents who had performed one or more prior cases of common bile duct exploration with formative feedback compared with residents who had no prior cases with formative feedback. Long dashes represent the median value. Short dashes represent the 25th and 75th percentiles

Results

Case characteristics and complexity

We analyzed performance and autonomy ratings for 189 CBDEs performed by 92 residents who were evaluated by 64 attending surgeons at 28 institutions. Of 189 CBDEs, 141 (75%) had no prior CBDE performance assessments for that resident and 48 (25%) had at least one prior CBDE performance assessment, including 31 with one prior assessment, ten with two prior assessments, four with three prior assessments, and one with five prior assessments. In 12 of those 48 cases (25%), the resident was performing a CBDE with the same attending that had previously provided a CBDE evaluation for the resident. Of 189 CBDEs, 146 (77%) were laparoscopic and 43 (23%) involved an open approach. Residents with at least one prior CBDE evaluation had a greater proportion of laparoscopic CBDEs compared with the group without prior evaluations (96% vs. 71%, p < 0.001). Resident assessments of case complexity were similar between cases with and without prior evaluations; for attending assessments, there was a greater proportion of hardest third complexity cases in the no prior evaluations group (41.8% vs. 20.8%, p = 0.009). Averaging resident and attending assessments to calculate global assessments of case complexity demonstrated that the distribution of case complexity scores was higher in the no prior evaluations group, though the difference was not statistically significant (p = 0.13, Fig. 1, Table 1).

Table 1 Common bile duct exploration case characteristics stratified by resident experience

Operative performance and autonomy

Resident and attending assessment of trainee performance and autonomy were compared between residents with one or more prior evaluations and residents without prior evaluations (Table 2). Attending and trainee assessments were averaged to calculate global assessments of operative performance and autonomy (Figs. 2 and 3). Residents with at least one prior CBDE evaluation had a greater proportion of practice-ready or exceptional performance ratings. Only 15 of 151 (11%) residents with no prior CBDE evaluations rated themselves as practice-ready or better, while 13 of 48 (27%) residents with at least one prior CBDE evaluation rated themselves as practice-ready or better (p = 0.02). By contrast, attendings rated 27 of 141 (19%) residents with no prior CBDE evaluations as practice-ready or better, and rated 28 of 48 (58%) residents with at least one prior CBDE performance assessment as practice-ready or better (p < 0.001). Globally, one or more prior evaluations were associated with significantly greater operative performance (p < 0.001), as illustrated in Fig. 2.

Table 2 Resident operative performance and autonomy during common bile duct exploration stratified by resident experience

Residents with at least one prior CBDE evaluations were granted more operative autonomy than residents with no prior CBDE evaluations as assessed by both resident and attending (Table 2 and Fig. 3). Of 48 CBDEs performed by residents with at least one prior CBDE evaluation, 8 (17%) reported needing only supervision, rather than passive help, active help, or demonstration, compared to 6 of 141 (4.3%) CBDEs performed by residents with no prior CBDE evaluation (p = 0.02). Similarly, attendings reported only providing supervision for 11 of 48 (23%) residents with at least one prior CBDE evaluation, compared to 11 of 141 (8%) residents with no prior CBDE evaluation (p = 0.02). Globally, one or more prior evaluations were associated with significantly greater operative autonomy (p < 0.001), as illustrated in Fig. 3.

Dictated feedback

Dictated feedback was present for 87 of 189 (46%) of CBDE evaluation, with no significant difference between assessments for residents with no prior CBDE evaluation and residents with at least one prior CBDE evaluation (Table 2). Sentiment scores for dictated feedback were also comparable for residents with (median 0.8, IQR 0.0–1.0) and without (median 1.0, IQR 0.0–1.0) prior CBDE evaluations (p = 0.44).

Subgroup analysis of laparoscopic cases: case characteristics and complexity

LCBDE case characteristics, including resident postgraduate year, resident and attending gender, and case complexity, as evaluated by residents and attendings, were compared between residents with (N = 46) and without (N = 100) prior LCBDE evaluations (see Additional File 4: Table S3). Like results for all CBDEs, resident assessments of case complexity were similar between cases with and without prior evaluations; for attending assessments, there was a greater proportion of hardest third complexity cases in the no prior evaluations group (44.0% vs. 19.6%, p = 0.01). Averaging resident and attending assessments to calculate global assessments of case complexity demonstrated that the distribution of case complexity scores was higher in the no prior evaluations group (p = 0.05, see Additional File 5: Fig. S2).

Subgroup analysis of laparoscopic cases: operative performance and autonomy

In the subgroup of LCBDE cases, among residents with prior evaluations, 29% rated themselves as practice-ready, while only 11% of residents without prior evaluations evaluated themselves as practice-ready (p = 0.007, see Additional File 6: Table S4). Similarly, attendings rated 59% of residents with prior evaluations as being practice-ready or exceptional, while rating only 18% of residents without prior evaluations as practice-ready or exceptional (p < 0.001). Globally, one or more prior evaluations were associated with significantly greater operative performance (p < 0.001), consistent with the primary analysis (see Additional File 7: Fig. S3).

Residents with prior LCBDE evaluations were granted more operative autonomy, as assessed by both residents and attendings. Seventeen percent of residents with prior evaluations rated themselves as receiving supervision only during LCBDE (the greatest amount of autonomy), while only 6% of residents without prior evaluations rated themselves as receiving supervision only (p = 0.04, see Additional File 6: Table S4). Attendings rated 70% of residents with prior evaluations as receiving passive help or supervision only, while 34% of residents without prior evaluations experienced similar levels of autonomy (p = 0.02). Globally, one or more prior evaluations in LCBDE were associated with significantly greater operative autonomy (p < 0.001, see Additional File 8: Table S4).

Discussion

In this analysis of 189 CBDEs performed at 28 institutions, residents who had performed at least one prior CBDE with an evaluation demonstrated better operative performance and received greater autonomy than residents who had no prior CBDE evaluations. Nearly two-thirds of the residents in the prior evaluation group had only one prior evaluation, suggesting that the observed associations may be realized with minimal prior experience in the context of formative feedback; 75% of the residents in the prior evaluation group had never received formative feedback from the same attending, suggesting that gains in operative ability were transferrable between resident-attending teams. In this study, formative feedback was provided as numerical performance and autonomy assessments in all cases, as well as dictated feedback for almost half of all cases. Attendings (but not residents) rated CBDEs as more complex when the resident had no prior CBDE performance assessments, suggesting that attending surgeons perceived cases as more complex when the trainee is inexperienced. Sentiment analyses demonstrated that positivity in recorded, verbal feedback was similar between groups, suggesting that attending mood and positivity—which could be affected by resident experience level and performance—did not drive the observed associations between prior evaluations, performance, and autonomy. Finally, subgroup analyses demonstrated that prior CBDE experience with formative feedback was associated with greater performance and autonomy regardless of whether an open or laparoscopic approach was used.

Contributors to resident performance have long been a topic of interest in peer-reviewed literature. Prior studies have demonstrated an improvement in operative efficiency or error rate with formative feedback [13, 27,28,29]. Peer-reviewed literature on resident performance shows a positive association between postgraduate year and operative performance and autonomy [30]. In practice, surgeons with higher case volumes and experience are noted to have shorter duration of surgery and improved performance and patient outcomes [31, 32]. With respect to trainees, studies suggest that residents might be underprepared for independence with certain operations, especially when rarely performed [33, 34]. Despite the recent growth of more efficient forms of evaluation, such as via the SIMPL application, there have been no studies demonstrating improvement in operative performance and autonomy with operative evaluations and feedback.

There is limited trainee exposure to CBDE, with graduating general surgery residents performing one CBDE on average, and concern for declining operative autonomy for surgical residents [8,9,10,11,12, 35]. With these limitations, best practices for resident education and training become increasingly important. This study is unique as no study, to our knowledge, has shown that one or more prior receipt of a CBDE evaluation with formative feedback is associated with higher operative performance and autonomy. These data on CBDE learning curves provide an important quantitative benchmark against which other forms of experience, i.e., those not in the operating room, may be tested.

In recent years, the popularity of simulation courses to fill this educational gap has grown. Prior studies support the value of simulation, especially in the context of laparoscopic surgery [36,37,38,39]. There have been prior studies of the efficacy of simulation training in CBDE, yet it remains unclear whether CBDE skills gained in simulation transfer to the operating room [40,41,42]. Given the growing concern for resident case volumes and increased simulation opportunities, we suggest further evaluation of simulation in operative performance and autonomy, which may be assessed by SIMPL in a similar manner as done in this study.

Limitations

This study is agnostic to resident prior experience with CBDE and formative feedback that occurred outside the SIMPL app. We acknowledge that residents with no prior evaluations may have undocumented experience, and residents with prior evaluations may have more experience than reflected in the SIMPL app. This undocumented exposure is a significant confounder for this study, but it is not captured by the SIMPL app at this time. We have little reason to suspect that unmeasured prior experience and formative feedback would be different between cohorts, and any such unmeasured events would likely serve to dilute the strength of associations in our study and generate false negative results, rather than generate false positive associations. Similarly, there is a possibly of attendings preferentially selecting residents to perform CBDE when they have performed this procedure before, suggesting that final results are skewed towards residents with more experience than those not represented in this study. However, we have little reason to believe this would change the outcome that evaluations are associated with increased performance and autonomy, if anything, a population of more experience residents would be less likely to show improvement. Another limitation is that our statistical analysis does not differentiate between trainees with greater than one prior evaluation and trainees with just one prior evaluation; doing so would decrease the size of the prior evaluation cohort by more than one third, and in the absence of prior work with which to perform a power analysis, we favored the larger sample size. With this concession the sample size remains low, but large enough to detect statistically significant differences in the primary outcome of resident operative performance and autonomy. Attending prior experience was not measured, and it is feasible that less experienced attendings were more likely to be operating with less experienced residents, which could affect the results. Similarly, there is a possibly of attendings preferentially selecting residents to perform CBDE when they have performed this procedure before, suggesting that results are skewed towards residents with more experience than expected. Finally, the observational design of this study does not establish causality, and we hesitate to use causal inference methods in the context of several unmeasured but potentially important variables.

Conclusions

The single-stage approach to choledocholithiasis with CBDE is a clinically important procedure for which many surgical residents may not achieve practice-ready performance, due to limited case volumes. In this analysis of 189 CBDEs, residents who received one or more prior CBDE evaluations demonstrated better operative performance and experienced greater autonomy than residents with no prior CBDE evaluations. These findings underscore the value of feedback for helping surgical residents achieve autonomous, practice-ready performance for a rare operation.

Availability of data and materials

Data generated or analyzed during this study are included in this published article and its supplementary information files. Further information is available from the corresponding author on reasonable request.

Abbreviations

CBDE:

Common bile duct exploration

LCBDE:

Laparoscopic common bile duct exploration

ERC:

Endoscopic retrograde cholangiography

SIMPL:

Society for Improving Medical Professional Learning

References

  1. Wandling MW, Hungness ES, Pavey ES, Stulberg JJ, Schwab B, Yang AD, et al. Nationwide assessment of trends in choledocholithiasis management in the united states from 1998 to 2013. JAMA Surg. 2016;151(12):1125–30.

    Article  Google Scholar 

  2. Singh AN, Kilambi R. Single-stage laparoscopic common bile duct exploration and cholecystectomy versus two-stage endoscopic stone extraction followed by laparoscopic cholecystectomy for patients with gallbladder stones with common bile duct stones: systematic review and meta-analysis of randomized trials with trial sequential analysis. Surg Endosc. 2018;32(9):3763–76.

    Article  Google Scholar 

  3. Zhu HY, Xu M, Shen HJ, Yang C, Li F, Li KW, et al. A meta-analysis of single-stage versus two-stage management for concomitant gallstones and common bile duct stones. Clin Res Hepatol Gastroenterol. 2015;39(5):584–93.

    Article  Google Scholar 

  4. Bansal VK, Misra MC, Rajan K, Kilambi R, Kumar S, Krishna A, et al. Single-stage laparoscopic common bile duct exploration and cholecystectomy versus two-stage endoscopic stone extraction followed by laparoscopic cholecystectomy for patients with concomitant gallbladder stones and common bile duct stones: a randomized controlled trial. Surg Endosc. 2014;28(3):875–85.

    Article  Google Scholar 

  5. Ding G, Cai W, Qin M. Single-stage vs. two-stage management for concomitant gallstones and common bile duct stones: a prospective randomized trial with long-term follow-up. J Gastrointest Surg. 2014;18(5):947–51.

    Article  Google Scholar 

  6. Lu J, Cheng Y, Xiong XZ, Lin YX, Wu SJ, Cheng NS. Two-stage vs single-stage management for concomitant gallstones and common bile duct stones. World J Gastroenterol. 2012;18(24):3156–66.

    Article  Google Scholar 

  7. Dasari BV, Tan CJ, Gurusamy KS, Martin DJ, Kirk G, McKie L, et al. Surgical versus endoscopic treatment of bile duct stones. Cochrane Database Syst Rev. 2013;9:CD03327.

    Google Scholar 

  8. Baucom RB, Feurer ID, Shelton JS, Kummerow K, Holzman MD, Poulose BK. Surgeons, ERCP, and laparoscopic common bile duct exploration: do we need a standard approach for common bile duct stones? Surg Endosc. 2016;30(2):414–23.

    Article  Google Scholar 

  9. Warner RL, Coleman KC, Musgrove KA, Bardes JM, Borgstrom DC, Grabo DJ. A review of general surgery resident experience in common bile duct exploration in the ERCP era. Am J Surg. 2020;220(4):899–904.

    Article  Google Scholar 

  10. Drake FT, Horvath KD, Goldin AB, Gow KW. The general surgery chief resident operative experience: 23 years of national ACGME case logs. JAMA Surg. 2013;148(9):841–7.

    Article  Google Scholar 

  11. Bucholz EM, Sue GR, Yeo H, Roman SA, Bell RH Jr, Sosa JA. Our trainees’ confidence: results from a national survey of 4136 US general surgery residents. Arch Surg. 2011;146(8):907–14.

    Article  Google Scholar 

  12. Fonseca AL, Reddy V, Longo WE, Gusberg RJ. Graduating general surgery resident operative confidence: perspective from a national survey. J Surg Res. 2014;190(2):419–28.

    Article  Google Scholar 

  13. Trehan A, Barnett-Vanes A, Carty MJ, McCulloch P, Maruthappu M. The impact of feedback of intraoperative technical performance in surgery: a systematic review. BMJ Open. 2015;5(6): e006759.

    Article  Google Scholar 

  14. Bohnen JD, George BC, Williams RG, Schuller MC, DaRosa DA, Torbeck L, et al. The Feasibility of real-time intraoperative performance assessment with SIMPL (system for improving and measuring procedural learning): early experience from a multi-institutional trial. J Surg Educ. 2016;73(6):e118–30.

    Article  Google Scholar 

  15. Williams RG, Sanfey H, Chen XP, Dunnington GL. A controlled study to determine measurement conditions necessary for a reliable and valid operative performance assessment: a controlled prospective observational study. Ann Surg. 2012;256(1):177–87.

    Article  Google Scholar 

  16. DaRosa DA, Zwischenberger JB, Meyerson SL, George BC, Teitelbaum EN, Soper NJ, et al. A theory-based model for teaching and assessing residents in the operating room. J Surg Educ. 2013;70(1):24–30.

    Article  Google Scholar 

  17. George BC, Teitelbaum EN, Meyerson SL, Schuller MC, DaRosa DA, Petrusa ER, et al. Reliability, validity, and feasibility of the Zwisch scale for the assessment of intraoperative performance. J Surg Educ. 2014;71(6):e90–6.

    Article  Google Scholar 

  18. Liu B. Sentiment analysis and opinion mining: Morgan & Claypool; 2012.

  19. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I. Attention is all you need. In: Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 4-9 December 2017, Long Beach, CA, USA, 2017. p. 6000–10.

  20. Qiu XP, Sun TX, Xu YG, Shao YF, Dai N, Huang XJ. Pre-trained models for natural language processing: a survey. Sci China Technol Sci. 2020;63(10):1872–97.

    Article  Google Scholar 

  21. Brown T, Mann B, Ryder N, Subbiah M, Kaplan JD, Dhariwal P, et al. Language models are few-shot learners. Adv Neural Inf Process Syst. 2020;33:1877–901.

    Google Scholar 

  22. Devlin J, Chang M-W, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:181004805. 2018.

  23. Socher R, Perelygin A, Wu J, Chuang J, Manning CD, Ng AY, et al. editors. Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the 2013 conference on empirical methods in natural language processing; 2013.

  24. Sanh V, Debut L, Chaumond J, Wolf T. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:191001108. 2019.

  25. Hinton G, Vinyals O, Dean J. Distilling the knowledge in a neural network. arXiv preprint arXiv:150302531. 2015;2(7).

  26. Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, et al. Huggingface's transformers: State-of-the-art natural language processing. arXiv preprint arXiv:191003771. 2019.

  27. Boyle E, Al-Akash M, Gallagher AG, Traynor O, Hill AD, Neary PC. Optimising surgical training: use of feedback to reduce errors during a simulated surgical procedure. Postgrad Med J. 2011;87(1030):524–8.

    Article  Google Scholar 

  28. Boyle E, O’Keeffe DA, Naughton PA, Hill AD, McDonnell CO, Moneley D. The importance of expert feedback during endovascular simulator training. J Vasc Surg. 2011;54(1):240-8.e1.

    Article  Google Scholar 

  29. Grantcharov TP, Schulze S, Kristiansen VB. The impact of objective assessment and constructive feedback on improvement of laparoscopic performance in the operating room. Surg Endosc. 2007;21(12):2240–3.

    Article  Google Scholar 

  30. Chen JX, Miller LE, Filimonov A, Shuman EA, Marchiano E, George BC, et al. Factors affecting operative autonomy and performance during otolaryngology training: a multicenter trial. Laryngosc Investig Otolaryngol. 2022;7(2):404–8.

    Article  Google Scholar 

  31. Blohm M, Sandblom G, Enochsson L, Cengiz Y, Austrums E, Abdon E, et al. Learning by doing: an observational study of the learning curve for ultrasonic fundus-first dissection in elective cholecystectomy. Surg Endosc. 2022;36(6):4602–13.

    Article  Google Scholar 

  32. Maruthappu M, Gilbert BJ, El-Harasis MA, Nagendran M, McCulloch P, Duclos A, et al. The influence of volume and experience on individual surgical performance: a systematic review. Ann Surg. 2015;261(4):642–7.

    Article  Google Scholar 

  33. Abbott KL, Krumm AE, Kelley J, Kendrick DE, Clark M, Chen X, et al. Surgical trainee performance and alignment with surgical program director expectations. Ann Surg. 2021;276:e1095.

    Article  Google Scholar 

  34. Abbott KL, Kwakye G, Kim GJ, Luckoski JL, Krumm AE, Clark M, et al. US general surgical trainee performance for representative global surgery procedures. Am J Surg. 2022;223(2):224–8.

    Article  Google Scholar 

  35. Kunac A, Oliver JB, McFarlane JL, Anjaria DJ. General surgical resident operative autonomy vs patient outcomes: are we compromising training without net benefit to hospitals or patients? J Surg Educ. 2021;78(6):e174–82.

    Article  Google Scholar 

  36. Ahlberg G, Enochsson L, Gallagher AG, Hedman L, Hogman C, McClusky DA 3rd, et al. Proficiency-based virtual reality training significantly reduces the error rate for residents during their first 10 laparoscopic cholecystectomies. Am J Surg. 2007;193(6):797–804.

    Article  Google Scholar 

  37. Fried GM, Feldman LS, Vassiliou MC, Fraser SA, Stanbridge D, Ghitulescu G, et al. Proving the value of simulation in laparoscopic surgery. Ann Surg. 2004;240(3):518–25.

    Article  Google Scholar 

  38. Scott DJ, Bergen PC, Rege RV, Laycock R, Tesfay ST, Valentine RJ, et al. Laparoscopic training on bench models: better and more cost effective than operating room experience? J Am Coll Surg. 2000;191(3):272–83.

    Article  CAS  Google Scholar 

  39. Wehbe-Janek H, Colbert CY, Govednik-Horny C, White BA, Thomas S, Shabahang M. Residents’ perspectives of the value of a simulation curriculum in a general surgery residency program: a multimethod study of stakeholder feedback. Surgery. 2012;151(6):815–21.

    Article  Google Scholar 

  40. Kemp Bohan PM, Connelly CR, Crawford J, Bronson NW, Schreiber MA, Lucius CW, et al. Early analysis of laparoscopic common bile duct exploration simulation. Am J Surg. 2017;213(5):888–94.

    Article  Google Scholar 

  41. Teitelbaum EN, Soper NJ, Santos BF, Rooney DM, Patel P, Nagle AP, et al. A simulator-based resident curriculum for laparoscopic common bile duct exploration. Surgery. 2014;156(4):880–7.

    Article  Google Scholar 

  42. Lv Y, Sun H, Qian Z, Mao W, Yao W, Ding C, et al. The effect of a simple simulator on the application of laparoscopic common bile duct exploration in a low volume center. Minerva Chir. 2020;75(4):260–5.

    Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

Dr. Loftus was supported by a K23 GM140268 from the National Institute of General Medical Sciences (NIGMS) and by the Thomas H. Maren Junior Investigator Fund. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The National Institute of General Medical Sciences had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication. Dr. Balch was supported by the postgraduate training grant (T32 GM-08721).

Author information

Authors and Affiliations

Authors

Contributions

TJL, ACF, and GRU contributed to study design. TJL performed data collection. TJL, BS, and MQN performed the data analysis and drafted the manuscript. PWU, KLA, JAB, BCG, FDM, LC, PAE, BS, and GRU interpreted the data and provided critical revisions to the manuscript. All authors read and approved the final manuscript

Corresponding author

Correspondence to Tyler J. Loftus.

Ethics declarations

Ethics approval and consent to participate

This study was approved by The University of Florida Institutional Review Board (IRB# 202200845).

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

: Table S1. Table which illustrates the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guidelines

Additional file 2

: Table S2. Table which illustrates examples of positive and negative language in dictations

Additional file 3

: Fig. S1. Visual flowchart illustrating breakdown of cases, with 189 total cases performed with residents, 146 of which were laparoscopic and 43 were open. Of the 146 laparoscopic cases, 46 residents had one or more prior evaluations and 100 had no prior evaluations. Of the 43 open cases, 2 residents had one or more prior evaluations and 41 had no prior evaluations

Additional file 4

: Table S3. Table illustrating subgroup analysis data for laparoscopic common bile duct exploration case characteristics stratified by resident experience

Additional file 5

: Fig. S2. Figure illustrating that global case complexity for laparoscopic common bile duct explorations was higher in the cohort in which residents had no prior cases with formative feedback vs. one or more prior cases with formative feedback. Long dashes represent the median value. Short dashes represent the 25th and 75th percentiles.

Additional file 6

: Table S4. Table illustrating subgroup analysis data for resident operative performance and autonomy during laparoscopic common bile duct exploration stratified by resident experience

Additional file 7

: Fig. S3. Figure illustrating that global operative performance was greater among residents who had performed one or more prior cases of laparoscopic common bile duct exploration with formative feedback compared with residents who had no prior cases with formative feedback. Long dashes represent the median value. Short dashes represent the 25th and 75th percentiles.

Additional file 8

: Fig. S4. Figure illustrating that global operative autonomy was greater among residents who had performed one or more prior cases of laparoscopic common bile duct exploration with formative feedback compared with residents who had no prior cases with formative feedback. Long dashes represent the median value. Short dashes represent the 25th and 75th percentiles.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nyren, M.Q., Filiberto, A.C., Underwood, P.W. et al. Surgical resident experience with common bile duct exploration and assessment of performance and autonomy with formative feedback. World J Emerg Surg 18, 13 (2023). https://doi.org/10.1186/s13017-023-00480-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13017-023-00480-0

Keywords