Ranking surgical skills using an attention-enhanced Siamese network with piecewise aggregated kinematic data

Ogul, Burcin; Gilgien, Matthias; ÖZDEMİR, SUAT

doi:10.1007/s11548-022-02581-8

Ranking surgical skills using an attention-enhanced Siamese network with piecewise aggregated kinematic data

Atıf İçin Kopyala

Ogul B. B., Gilgien M., ÖZDEMİR S.

INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, cilt.17, sa.6, ss.1039-1048, 2022 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 17 Sayı: 6
Basım Tarihi: 2022
Doi Numarası: 10.1007/s11548-022-02581-8
Dergi Adı: INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, EMBASE, INSPEC, MEDLINE
Sayfa Sayıları: ss.1039-1048
Anahtar Kelimeler: Robot-assisted surgery, Skill assessment, Attention-enhanced Siamese networks, Assessment of surgical skills
Hacettepe Üniversitesi Adresli: Evet

Özet

Purpose Surgical skill assessment using computerized methods is considered to be a promising direction in objective performance evaluation and expert training. In a typical architecture for computerized skill assessment, a classification system is asked to assign a query action to a predefined category that determines the surgical skill level. Since such systems are still trained by manual, potentially inconsistent annotations, an attempt to categorize the skill level can be biased by potentially scarce or skew training data. Methods We approach the skill assessment problem as a pairwise ranking task where we compare two input actions to identify better surgical performance. We propose a model that takes two kinematic motion data acquired from robot-assisted surgery sensors and report the probability of a query sample having a better skill than a reference one. The model is an attention-enhanced Siamese Long Short-Term Memory Network fed by piecewise aggregate approximation of kinematic data. Results The proposed model can achieve higher accuracy than existing models for pairwise ranking in a common dataset. It can also outperform existing regression models when applied in their experimental setup. The model is further shown to be accurate in individual progress monitoring with a new dataset, which will serve as a strong baseline. Conclusion This relative assessment approach may overcome the limitations of having consistent annotations to define skill levels and provide a more interpretable means for objective skill assessment. Moreover, the model allows monitoring the skill development of individuals by comparing two activities at different time points.