SD-Net: joint surgical gesture recognition and skill assessment

Abstract Purpose Surgical gesture recognition has been an essential task for providing intraoperative context-aware assistance and scheduling clinical resources. However, previous methods present limitations in catching long-range temporal information, and many of them require additional sensors. To address these challenges, we propose a symmetric dilated network, namely SD-Net, to jointly recognize surgical gestures and assess surgical skill levels only using RGB surgical video sequences. Methods We utilize symmetric 1D temporal dilated convolution layers to hierarchically capture gesture clues under different receptive fields such that features in different time span can be aggregated. In addition, a self-attention network is bridged in the middle to calculate the global frame-to-frame relativity. Results We evaluate our method on a robotic suturing task from the JIGSAWS dataset. The gesture recognition task largely outperforms the state of the arts on the frame-wise accuracy up to $$\sim $$ ∼ 6 points and the F1@50 score $$\sim $$ ∼ 8 points. We also keep the 100% predicted accuracy for the skill assessment task using LOSO validation scheme. Conclusion The results indicate that our architecture is able to obtain representative surgical video features by extensively considering the spatial, temporal and relational context from raw video input. Furthermore, the better performance in multi-task learning implies that surgical skill assessment has a complementary effects to gesture recognition task.

Download Full-text

Towards Accurate and Interpretable Surgical Skill Assessment: A Video-Based Method Incorporating Recognized Surgical Gestures and Skill Levels

Medical Image Computing and Computer Assisted Intervention – MICCAI 2020 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-59716-0_64 ◽

2020 ◽

pp. 668-678

Author(s):

Tianyu Wang ◽

Yijie Wang ◽

Mian Li

Keyword(s):

Skill Assessment ◽

Surgical Skill ◽

Surgical Skill Assessment ◽

Skill Levels

Download Full-text

Gesture recognition and classification for surgical skill assessment

2011 IEEE International Symposium on Medical Measurements and Applications ◽

10.1109/memea.2011.5966681 ◽

2011 ◽

Cited By ~ 6

Author(s):

G. Saggio ◽

G. L. Santosuosso ◽

P. Cavallo ◽

C. A. Pinto ◽

M. Petrella ◽

...

Keyword(s):

Gesture Recognition ◽

Skill Assessment ◽

Surgical Skill ◽

Surgical Skill Assessment

Download Full-text

Automation of surgical skill assessment using a three-stage machine learning algorithm

Scientific Reports ◽

10.1038/s41598-021-84295-6 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Joël L. Lavanchy ◽

Joel Zindel ◽

Kadir Kirtac ◽

Isabell Twick ◽

Enes Hosgor ◽

...

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Surgical Skills ◽

Adverse Outcomes ◽

Machine Learning Algorithms ◽

Surgical Instruments ◽

Skill Assessment ◽

Surgical Skill ◽

Surgical Skill Assessment ◽

Motion Features

AbstractSurgical skills are associated with clinical outcomes. To improve surgical skills and thereby reduce adverse outcomes, continuous surgical training and feedback is required. Currently, assessment of surgical skills is a manual and time-consuming process which is prone to subjective interpretation. This study aims to automate surgical skill assessment in laparoscopic cholecystectomy videos using machine learning algorithms. To address this, a three-stage machine learning method is proposed: first, a Convolutional Neural Network was trained to identify and localize surgical instruments. Second, motion features were extracted from the detected instrument localizations throughout time. Third, a linear regression model was trained based on the extracted motion features to predict surgical skills. This three-stage modeling approach achieved an accuracy of 87 ± 0.2% in distinguishing good versus poor surgical skill. While the technique cannot reliably quantify the degree of surgical skill yet it represents an important advance towards automation of surgical skill assessment.

Download Full-text