Visual Attention-driven Spatial Pooling for Image Memorability

Celikkale B., Erdem A., ERDEM M. E.

26th IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Oregon, Amerika Birleşik Devletleri, 23 - 28 Haziran 2013, ss.976-983, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası:
Doi Numarası: 10.1109/cvprw.2013.142
Basıldığı Şehir: Oregon
Basıldığı Ülke: Amerika Birleşik Devletleri
Sayfa Sayıları: ss.976-983
Hacettepe Üniversitesi Adresli: Evet

Özet

In daily life, humans demonstrate astounding ability to remember images they see on magazines, commercials, TV, the web and so on, but automatic prediction of intrinsic memorability of images using computer vision and machine learning techniques was not investigated until a few years ago. However, despite these recent advances, none of the available approaches makes use of any attentional mechanism, a fundamental aspect of human vision, which selects relevant image regions for higher-level processing. Our goal in this paper is to explore the role of visual attention in understanding memorability of images. In particular, we present an attention-driven spatial pooling strategy for image memorability and show that the regions estimated by bottom-up and object-level saliency maps are more effective in predicting memorability than considering a fixed spatial pyramid structure as in the previous studies.