Visual Attention-driven Spatial Pooling for Image Memorability


Celikkale B. , Erdem A. , ERDEM M. E.

26th IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Oregon, United States Of America, 23 - 28 June 2013, pp.976-983 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume:
  • Doi Number: 10.1109/cvprw.2013.142
  • City: Oregon
  • Country: United States Of America
  • Page Numbers: pp.976-983

Abstract

In daily life, humans demonstrate astounding ability to remember images they see on magazines, commercials, TV, the web and so on, but automatic prediction of intrinsic memorability of images using computer vision and machine learning techniques was not investigated until a few years ago. However, despite these recent advances, none of the available approaches makes use of any attentional mechanism, a fundamental aspect of human vision, which selects relevant image regions for higher-level processing. Our goal in this paper is to explore the role of visual attention in understanding memorability of images. In particular, we present an attention-driven spatial pooling strategy for image memorability and show that the regions estimated by bottom-up and object-level saliency maps are more effective in predicting memorability than considering a fixed spatial pyramid structure as in the previous studies.