Combining 2D and 3D deep models for action recognition with depth information

KEÇELİ, ALİ; KAYA, AYDIN; CAN, AHMET

doi:10.1007/s11760-018-1271-3

Combining 2D and 3D deep models for action recognition with depth information

KEÇELİ A. S., KAYA A., CAN A. B.

SIGNAL IMAGE AND VIDEO PROCESSING, cilt.12, sa.6, ss.1197-1205, 2018 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 12 Sayı: 6
Basım Tarihi: 2018
Doi Numarası: 10.1007/s11760-018-1271-3
Dergi Adı: SIGNAL IMAGE AND VIDEO PROCESSING
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.1197-1205
Anahtar Kelimeler: Action recognition, Dyadic actions, Deep learning, Feature selection, RGB-D data, NETWORKS, CNN
Hacettepe Üniversitesi Adresli: Evet

Özet

In activity recognition, usage of depth data is a rapidly growing research area. This paper presents a method for recognizing single-person activities and dyadic interactions by using deep features extracted from both 3D and 2D representations, which are constructed from depth sequences. First, a 3D volume representation is generated by considering spatiotemporal information in depth frames of an action sequence. Then, a 3D-CNN is trained to learn features from these 3D volume representations. In addition to this, a 2D representation is constructed from the weighted sum of the depth sequences. This 2D representation is used with a pre-trained CNN model. Features learned from this model and the 3D-CNN model are used in training of the final approach after a feature selection step. Among the various classifiers, an SVM-based model produced the best results. The proposed method was tested on the MSR-Action3D dataset for single-person activities, the SBU dataset for dyadic interactions, and the NTU RGB+D dataset for both types of actions. Experimental results show that proposed 3D and 2D representations and deep features extracted from them are robust and efficient. The proposed method achieves comparable results with the state of the art methods in the literature.