Isolated Sign Recognition with a Siamese Neural Network of RGB and Depth Streams


Tur A. O., Keles H.

18th IEEE International Conference on Smart Technologies (IEEE EUROCON), Novi-Sad, Sırbistan, 1 - 04 Temmuz 2019 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/eurocon.2019.8861945
  • Basıldığı Şehir: Novi-Sad
  • Basıldığı Ülke: Sırbistan
  • Anahtar Kelimeler: sign language recognition, convolutional neural networks, LSTM, GRU, regularization
  • Hacettepe Üniversitesi Adresli: Hayır

Özet

Sign recognition is a challenging problem due to high variance of the signs among different signers and multiple modalities of the input information. In addition, the challenges that exist in the action classification problems in computer vision are similar in this domain too, such as variations in illumination and background. In this work, we propose a Siamese Neural Network (SNN) architecture that is used to extract features from the RGB and the depth streams of a sign frame in parallel. We use a pretrained model for the SNN without any finetuning to our training data. We then apply global feature pooling to the depth and color features that the SNN generates and feed the concatenation of the selected features to a recurrent neural network (RNN) to discriminate the signs. We trained our model parameters with the Montalbano dataset and achieved 93.19% test accuracy with ResNet-50 and 91.61% with VGG-16 Network Models.