Spatial and Motion Saliency Prediction Method Using Eye Tracker Data for Video Summarization

Paul Manoranjan; Salehin Md. Musfequs

首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Spatial and Motion Saliency Prediction Method Using Eye Tracker Data for Video Summarization

【24h】

Spatial and Motion Saliency Prediction Method Using Eye Tracker Data for Video Summarization

机译：使用眼跟踪器数据进行视频汇总的空间和运动耐药性预测方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Video summarization is the process to extract the most significant contents of a video and to represent it in a concise form. The existing methods for video summarization could not achieve a satisfactory result for a video with camera movement and significant illumination changes. To solve these problems, in this paper, a new framework for video summarization is proposed based on eye tracker data, as human eyes can track moving object accurately in these cases. The smooth pursuit is the state of eye movement when a user follows a moving object in a video. This motivates us to implement a new method to distinguish smooth pursuit from other type of gaze points, such as fixation and saccade. The smooth pursuit provides only the location of moving objects in a video frame; however, it does not indicate whether the located moving objects are very attractive (i.e., salient regions) to viewers or not, as well as the amount of motion of the moving objects. The amount of salient regions and object motions are the two important features to measure the viewer's attention level for determining the key frames for video summarization. To find the most attractive objects, a new spatial saliency prediction method is also proposed by constructing a saliency map around each smooth pursuit gaze point based on human visual field, such as fovea, parafoveal, and perifovea regions. To identify the amount of object motions, the total distances between the current and the previous gaze points of viewers during smooth pursuit are measured as a motion saliency score. The motivation is that the movement of eye gaze is related to the motion of the objects during smooth pursuit. Finally, both spatial and motion saliency maps are combined to obtain an aggregated saliency score for each frame and a set of key frames are selected based on user selected or system default skimming ratio. The proposed method is implemented on Office video data set that contains videos with camera movements and illumination changes. Experimental results confirm the superior performance of the proposed spatial and motion saliency prediction method compared with the state-of-the-art methods.

机译：视频摘要是提取视频最重要内容的过程并以简洁的形式表示。用于视频摘要的现有方法无法实现相机运动和显着的照明变化的视频的令人满意的结果。为了解决这些问题，本文基于眼睛跟踪器数据提出了一种用于视频摘要的新框架，因为人眼可以在这些情况下准确地跟踪移动物体。当用户在视频中遵循移动对象时，平滑追踪是眼球运动状态。这使我们能够实施一种新的方法，以区分从其他类型的凝视点，例如固定和扫视。平滑追求仅提供视频帧中的移动物体的位置;然而，它并不意味着所定位的移动物体是非常有吸引力的（即，显着区域），以及移动物体的运动量。突出区域和对象运动的量是测量观众的注意力水平，以确定用于确定视频概括的关键帧的两个重要功能。为了找到最具吸引力的物体，还通过构建基于人类视野的每个平滑追踪凝视点周围的显着性图，例如FOVEA，PARAFOVEAL和PERIFOVEA地区来提出新的空间显着性预测方法。为了识别物体运动量，测量在平滑追踪期间观察者的电流与先前凝视点之间的总距离被测量为运动显着分数。动机是眼睛凝视的运动与在平滑追踪过程中对象的运动有关。最后，组合空间和运动显着性图以获得每个帧的聚合显着性分数，并且基于用户选择或系统默认扫描比选择一组关键帧。所提出的方法在办公室视频数据集上实现，其中包含具有相机移动和照明变化的视频。实验结果证实了与最先进的方法相比，所提出的空间和运动显着性预测方法的优越性。

著录项

来源
《IEEE Transactions on Circuits and Systems for Video Technology》 |2019年第6期|1856-1867|共12页
作者
Paul Manoranjan; Salehin Md. Musfequs;
展开▼
作者单位

Charles Sturt Univ Sch Comp & Math Bathurst NSW 2795 Australia;

Charles Sturt Univ Sch Comp & Math Bathurst NSW 2795 Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Human visual field; eye tracker; gaze point; saliency map; video summary; smooth pursuit;

机译：人类视野;眼睛跟踪器;凝视点;显着图;视频摘要;平稳追求;

相似文献

外文文献
中文文献
专利

1. Spatial and Motion Saliency Prediction Method Using Eye Tracker Data for Video Summarization [J] . Paul Manoranjan, Salehin Md. Musfequs IEEE Transactions on Circuits and Systems for Video Technology . 2019,第6期

机译：使用眼动仪数据进行视频摘要的空间和运动显着性预测方法
2. Benchmark three-dimensional eye-tracking dataset for visual saliency prediction on stereoscopic three-dimensional video [J] . Banitalebi-Dehkordi Amin, Nasiopoulos Eleni, Pourazad Mahsa T., Journal of electronic imaging . 2016,第1期

机译：基准三维眼动跟踪数据集，用于立体三维视频的视觉显着性预测
3. COGNIMUSE: a multimodal video database annotated with saliency, events, semantics and emotion with application to summarization [J] . Athanasia Zlatintsi, Petros Koutras, Georgios Evangelopoulos, EURASIP journal on image and video processing . 2017,第1期

机译：COGNIMUSE：一个多模态视频数据库，带有显着性，事件，语义和情感注释，并应用于摘要
4. Human visual field based saliency prediction method using Eye Tracker data for video summarization [C] . Md. Musfequs Salehin, Manoranjan Paul IEEE International Conference on Multimedia Expo Workshops . 2016

机译：使用眼动仪数据进行视频摘要的基于人类视野的显着性预测方法
5. Spatial Pyramid Context-aware Moving Object Detection and Tracking for Full Motion Video and Wide Aerial Motion Imagery [D] . Poostchi, Mahdieh. 2017

机译：用于全动态视频和宽广航拍图像的空间金字塔上下文相关运动对象检测和跟踪
6. A parallel spatiotemporal saliency and discriminative online learning method for visual target tracking in aerial videos [O] . Amirhossein Aghamohammadi, Mei Choo Ang, Elankovan A. Sundararajan, 2012

机译：航空视频视觉目标跟踪的并行时空显着性和判别式在线学习方法
7. COGNIMUSE: a multimodal video database annotated with saliency, events, semantics and emotion with application to summarization [O] . Athanasia Zlatintsi, Petros Koutras, Georgios Evangelopoulos, 2017

机译：Cognimuse：具有概述的显着性，事件，语义和情感的多模式视频数据库

Spatial and Motion Saliency Prediction Method Using Eye Tracker Data for Video Summarization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅