Learning Actions From the Web

Ikizler-Cinbis N., Cinbis R. G., Sclaroff S.

12th IEEE International Conference on Computer Vision, Kyoto, Japan, 29 September - 02 October 2009, pp.995-1002 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/iccv.2009.5459368
  • City: Kyoto
  • Country: Japan
  • Page Numbers: pp.995-1002
  • Hacettepe University Affiliated: No


This paper proposes a generic method for action recognition in uncontrolled videos. The idea is to use images collected from the Web to learn representations of actions and use this knowledge to automatically annotate actions in videos. Our approach is unsupervised in the sense that it requires no human intervention other than the text querying. Its benefits are two-fold: 1) we can improve retrieval of action images, and 2) we can collect a large generic database of action poses, which can then be used in tagging videos. We present experimental evidence that using action images collected from the Web, annotating actions is possible.