On Recognizing Actions in Still Images via Multiple Features

Sener F., Bas C., Ikizler-Cinbis N.

12th European Conference on Computer Vision (ECCV), Florence, İtalya, 7 - 13 Ekim 2012, cilt.7585, ss.263-272, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası: 7585
Doi Numarası: 10.1007/978-3-642-33885-4_27
Basıldığı Şehir: Florence
Basıldığı Ülke: İtalya
Sayfa Sayıları: ss.263-272
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
Hacettepe Üniversitesi Adresli: Evet

Özet

We propose a multi-cue based approach for recognizing human actions in still images, where relevant object regions are discovered and utilized in a weakly supervised manner. Our approach does not require any explicitly trained object detector or part/attribute annotation. Instead, a multiple instance learning approach is used over sets of object hypotheses in order to represent objects relevant to the actions. We test our method on the extensive Stanford 40 Actions dataset [1] and achieve significant performance gain compared to the state-of-the-art. Our results show that using multiple object hypotheses within multiple instance learning is effective for human action recognition in still images and such an object representation is suitable for using in conjunction with other visual features.