On Recognizing Actions in Still Images via Multiple Features

Creative Commons License

Sener F., Bas C., Ikizler-Cinbis N.

12th European Conference on Computer Vision (ECCV), Florence, Italy, 7 - 13 October 2012, vol.7585, pp.263-272 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume: 7585
  • Doi Number: 10.1007/978-3-642-33885-4_27
  • City: Florence
  • Country: Italy
  • Page Numbers: pp.263-272
  • Hacettepe University Affiliated: Yes


We propose a multi-cue based approach for recognizing human actions in still images, where relevant object regions are discovered and utilized in a weakly supervised manner. Our approach does not require any explicitly trained object detector or part/attribute annotation. Instead, a multiple instance learning approach is used over sets of object hypotheses in order to represent objects relevant to the actions. We test our method on the extensive Stanford 40 Actions dataset [1] and achieve significant performance gain compared to the state-of-the-art. Our results show that using multiple object hypotheses within multiple instance learning is effective for human action recognition in still images and such an object representation is suitable for using in conjunction with other visual features.