Two-person interaction recognition via spatial multiple instance embedding

Sener, Fadime; Ikizler-Cinbis, NAZLI

doi:10.1016/j.jvcir.2015.07.016

Two-person interaction recognition via spatial multiple instance embedding

Sener F., Ikizler-Cinbis N.

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, cilt.32, ss.63-73, 2015 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 32
Basım Tarihi: 2015
Doi Numarası: 10.1016/j.jvcir.2015.07.016
Dergi Adı: JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.63-73
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
Hacettepe Üniversitesi Adresli: Evet

Özet

In this work, we look into the problem of recognizing two-person interactions in videos. Our method integrates multiple visual features in a weakly supervised manner by utilizing an embedding-based multiple instance learning framework. In our proposed method, first, several visual features that capture the shape and motion of the interacting people are extracted from each detected person region in a video. Then, two-person visual descriptors are formed. Since the relative spatial locations of interacting people are likely to complement the visual descriptors, we propose to use spatial multiple instance embedding, which implicitly incorporates the distances between people into the multiple instance learning process. Experimental results on two benchmark datasets validate that using two-person visual descriptors together with spatial multiple instance learning offers an effective way for inferring the type of the interaction. (C) 2015 Elsevier Inc. All rights reserved.