A Super Descriptor Tensor Decomposition for Dynamic Scene Recognition

Khokher Muhammad Rizwan; Bouzerdoum Abdesselam; Son Lam Phung

首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >A Super Descriptor Tensor Decomposition for Dynamic Scene Recognition

【24h】

A Super Descriptor Tensor Decomposition for Dynamic Scene Recognition

机译：动态场景识别的超级描述符张量分解

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a new approach for dynamic scene recognition based on a super descriptor tensor decomposition. Recently, local feature extraction based on dense trajectories has been used for modeling motion. However, dense trajectories usually include a large number of unnecessary trajectories, which increase noise, add complexity, and limit the recognition accuracy. Another problem is that the traditional bag-of-words techniques encode and concatenate the local features extracted from multiple descriptors to form a single large vector for classification. This concatenation not only destroys the spatio-temporal structure among the features but also yields high dimensionality. To address these problems, first, we propose to refine the dense trajectories by selecting only salient trajectories in a region of interest containing motion. Visual descriptors consisting of oriented gradient and motion boundary histograms are then computed along the refined dense trajectories. In case of camera motion, a short-window video stabilization is integrated to compensate for global motion. Second, the extracted features from multiple descriptors are encoded using a super descriptor tensor model. To this end, the TUCKER-3 tensor decomposition is employed to obtain a compact set of salient features, followed by feature selection via Fisher ranking. Experiments are conducted using two benchmark dynamic scene recognition datasets: Maryland "in-the-wild" and YUPPEN dynamic scenes. Experimental results show that the proposed approach outperforms several existing methods in terms of recognition accuracy and achieves a performance comparable with the state-of-the-art deep learning methods. The proposed approach achieves classification rates of 89.2% for Maryland and 98.1% for YUPPEN datasets.

机译：本文提出了一种基于超描述符张量分解的动态场景识别新方法。最近，基于密集轨迹的局部特征提取已用于对运动进行建模。但是，密集的轨迹通常包括大量不必要的轨迹，这会增加噪声，增加复杂度并限制识别精度。另一个问题是，传统的词袋技术对从多个描述符中提取的局部特征进行编码和连接，以形成用于分类的单个大向量。这种级联不仅破坏了特征之间的时空结构，而且产生了高维数。为了解决这些问题，首先，我们建议通过仅在包含运动的感兴趣区域中选择显着轨迹来细化密集轨迹。然后沿着精确的密集轨迹计算由定向梯度和运动边界直方图组成的视觉描述符。在摄像机运动的情况下，集成了一个短窗口视频稳定功能以补偿全局运动。其次，使用超级描述符张量模型对从多个描述符中提取的特征进行编码。为此，采用TUCKER-3张量分解来获得一组紧凑的显着特征，然后通过Fisher等级进行特征选择。实验使用两个基准动态场景识别数据集进行：马里兰州“荒野”和YUPPEN动态场景。实验结果表明，该方法在识别准确度方面优于几种现有方法，并具有与最新的深度学习方法相当的性能。所提出的方法对马里兰州的分类率为89.2％，对于YUPPEN数据集的分类率为98.1％。

著录项

来源
《IEEE Transactions on Circuits and Systems for Video Technology 》 |2019年第4期| 1063-1076| 共14页
作者
Khokher Muhammad Rizwan; Bouzerdoum Abdesselam; Son Lam Phung;
展开▼
作者单位

Univ Wollongong, Sch Elect Comp & Telecommun Engn, Wollongong, NSW 2522, Australia;

Univ Wollongong, Sch Elect Comp & Telecommun, Wollongong, NSW 2522, Australia|Hamad Bin Khalifa Univ, Coll Sci & Engn, Informat & Comp Technol Div, Doha, Qatar;

Univ Wollongong, Sch Elect Comp & Telecommun Engn, Wollongong, NSW 2522, Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Dynamic scene recognition; tensor decomposition; super descriptor vector; refined dense trajectories;

机译：动态场景识别;张量分解;超级描述符向量;精致的密集轨迹;

相似文献

外文文献
中文文献
专利

1. A Super Descriptor Tensor Decomposition for Dynamic Scene Recognition [J] . Khokher Muhammad Rizwan, Bouzerdoum Abdesselam, Son Lam Phung IEEE Transactions on Circuits and Systems for Video Technology . 2019 ,第4期

机译：动态场景识别的超描述符张量分解
2. Multilinear Supervised Neighborhood Embedding of a Local Descriptor Tensor for Scene/Object Recognition [J] . Han X.-H., Chen Y.-W., Ruan X. Image Processing, IEEE Transactions on . 2012 ,第3期

机译：用于场景/对象识别的局部描述符张量的多线性监督邻域嵌入
3. Multilinear Supervised Neighborhood Embedding with Local Descriptor Tensor for Face Recognition [J] . Xian-Hua HAN, Xu QIAO, Yen-Wei CHEN IEICE transactions on information and systems . 2011 ,第1期

机译：具有局部描述符张量的多线性监督邻域嵌入的人脸识别
4. Violent Scene Detection using a Super Descriptor Tensor Decomposition [C] . Muhammad Rizwan Khokher, Abdesselam Bouzerdoum, Son Lam Phung International Conference on Digital Image Computing: Techniques and Applications . 2015

机译：使用超级描述符张量分解的暴力场景检测
5. Object & scene recognition using color descriptors and adaptive color KLT. [D] . Bagci, Volkan Halil. 2011

机译：使用颜色描述符和自适应颜色KLT进行对象和场景识别。
6. Scene-Level Geographic Image Classification Based on a Covariance Descriptor Using Supervised Collaborative Kernel Coding [O] . Chunwei Yang, Huaping Liu, Shicheng Wang, 2016

机译：基于协同核编码的协方差描述符的场景级地理图像分类
7. Violent scene detection using a super descriptor tensor decomposition [O] . KHOKHER, Muhammad Rizwan, Bouzerdoum, Abdesselam, Phung, Son Lam 2015

机译：使用超级描述符张量分解的暴力场景检测

A Super Descriptor Tensor Decomposition for Dynamic Scene Recognition

摘要

著录项

相似文献

相关主题

期刊订阅