Cross-lingual few-shot sign language recognition

Bilge, YUNUS; İKİZLER CİNBİŞ, NAZLI; CİNBİŞ, RAMAZAN

doi:10.1016/j.patcog.2024.110374

Cross-lingual few-shot sign language recognition

Bilge Y. C., İKİZLER CİNBİŞ N., CİNBİŞ R. G.

PATTERN RECOGNITION, 2024 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Basım Tarihi: 2024
Doi Numarası: 10.1016/j.patcog.2024.110374
Dergi Adı: PATTERN RECOGNITION
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, PASCAL, Applied Science & Technology Source, BIOSIS, Compendex, Computer & Applied Sciences, INSPEC, MLA - Modern Language Association Database, zbMATH
Anahtar Kelimeler: Cross-lingual sign language recognition, Few-shot learning, Sign language recognition
Hacettepe Üniversitesi Adresli: Evet

Özet

There are over 150 sign languages worldwide, each with numerous local variants and thousands of signs. However, collecting annotated data for each sign language to train a model is a laborious and expert-dependent task. To address this issue, this paper introduces the problem of few-shot sign language recognition (FSSLR) in a cross-lingual setting. The central motivation is to be able to recognize a novel sign, even if it belongs to a sign language unseen during training, based on a small set of examples. To tackle this problem, we propose a novel embedding-based framework that first extracts a spatio-temporal visual representation based on video and hand features, as well as hand landmark estimates. To establish a comprehensive test bed, we propose three meta-learning FSSLR benchmarks that span multiple languages, and extensively evaluate the proposed framework. The experimental results demonstrate the effectiveness and superiority of the proposed approach for few-shot sign language recognition in both monolingual and cross-lingual settings.