Isolated Sign Recognition with a Siamese Neural Network of RGB and Depth Streams

Tur A. O. , Keles H.

18th IEEE International Conference on Smart Technologies (IEEE EUROCON), Novi-Sad, Serbia, 1 - 04 July 2019 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/eurocon.2019.8861945
  • City: Novi-Sad
  • Country: Serbia
  • Keywords: sign language recognition, convolutional neural networks, LSTM, GRU, regularization


Sign recognition is a challenging problem due to high variance of the signs among different signers and multiple modalities of the input information. In addition, the challenges that exist in the action classification problems in computer vision are similar in this domain too, such as variations in illumination and background. In this work, we propose a Siamese Neural Network (SNN) architecture that is used to extract features from the RGB and the depth streams of a sign frame in parallel. We use a pretrained model for the SNN without any finetuning to our training data. We then apply global feature pooling to the depth and color features that the SNN generates and feed the concatenation of the selected features to a recurrent neural network (RNN) to discriminate the signs. We trained our model parameters with the Montalbano dataset and achieved 93.19% test accuracy with ResNet-50 and 91.61% with VGG-16 Network Models.