The ability to predict, given an image or a video, where a human might fixate elements of a viewed scene has long been of interest in the vision community. In this note we propose a different view of the gaze-shift mechanism as that of a motor system implementation of an active random sampling strategy that the Human Visual System has evolved in order to efficiently and effectively infer properties of the surrounding world. We show how it can be exploited to carry on an attentive analysis of dynamic scenes.
展开▼