Object, Scene and Actions: Combining Multiple Features for Human Action Recognition

Ikizler-Cinbis N., Sclaroff S.

11th European Conference on Computer Vision, Heraklion, Yunanistan, 5 - 11 Eylül 2010, cilt.6311, ss.494-507, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası: 6311
Doi Numarası: 10.1007/978-3-642-15549-9_36
Basıldığı Şehir: Heraklion
Basıldığı Ülke: Yunanistan
Sayfa Sayıları: ss.494-507
Hacettepe Üniversitesi Adresli: Hayır

Özet

In many cases, human actions can be identified not only by the singular observation of the human body in motion, but also properties of the surrounding scene and the related objects. In this paper, we look into this problem and propose an approach for human action recognition that integrates multiple feature channels from several entities such as objects, scenes and people. We formulate the problem in a multiple instance learning (MIL) framework, based on multiple feature channels. By using a discriminative approach, we join multiple feature channels embedded to the MIL space. Our experiments over the large You Tube dataset show that scene and object information can be used to complement person features for human action recognition.