Discovering Latent Discriminative Patterns for Multi-Mode Event Representation

Xie Wenlong; Yao Hongxun; Sun Xiaoshuai; Han Tingting; Zhao Sicheng; Chua Tat-Seng

首页> 外文期刊>IEEE transactions on multimedia >Discovering Latent Discriminative Patterns for Multi-Mode Event Representation

【24h】

Discovering Latent Discriminative Patterns for Multi-Mode Event Representation

机译：发现多模式事件表示的潜在鉴别模式

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Representation of videos is essential since it conveys an understanding of video content and enables many higher level tasks to be tackled efficiently. However, it is challenging to propose a rational representation for complex event videos, as most video information is either noisy or redundant. In this paper, we propose a compact event representation method that can concisely describe the inner modes of events. We deem that an optimal event representation scheme should reflect the long-term and high-level visual semantics (visual topics) of events, so different from previous frame-level video semantics representation methods and concept-based video representation methods, we investigate the problem from the perspective of segment-level video representations. We then present three appealing properties of segment-level visual semantics. Based on the observation, we propose different algorithms that rely on a novel deep-visual-word-based video encoding method to discover latent discriminative patterns of events. Finally, our multi-mode event representation is obtained by concatenating the discovered patterns as inner modes. We adopt our event representation for representative event parts mining, which can highlight the visual topics of events and remarkably prune the raw videos. We validate our event representation method based on complex event detection task. Experimental results on two standard benchmarking datasets, MED11 and CCV Dataset, show that the proposed method can significantly outperform the state-of-the-art approaches.

机译：视频的表示是必不可少的，因为它传达了对视频内容的理解，并且能够有效地解决许多更高的级别任务。然而，提出复杂事件视频的合理表示是挑战，因为大多数视频信息都是嘈杂的或冗余。在本文中，我们提出了一种紧凑的事件表示方法，可以简明地描述事件的内部模式。我们认为，最佳事件表示方案应反映事件的长期和高级视觉语义（视觉主题），与之前的帧级视频语义表示方法和基于概念的视频表示方法不同，我们调查了这个问题从段级视频表示的角度来看。然后我们提出了三个段级视觉语义的吸引人的属性。基于观察，我们提出了不同的算法依赖于基于新的深视网型的视频编码方法来发现潜在的事件模式。最后，通过将发现的模式连接为内模式来获得我们的多模式事件表示。我们通过我们的活动代表代表事件零件挖掘，可以突出显示事件的视觉主题，并显着修剪原始视频。基于复杂事件检测任务，我们验证了我们的事件表示方法。两个标准基准数据集，MED11和CCV数据集的实验结果表明，该方法可以显着优于最先进的方法。

著录项

来源
《IEEE transactions on multimedia》 |2019年第6期|1425-1436|共12页
作者
Xie Wenlong; Yao Hongxun; Sun Xiaoshuai; Han Tingting; Zhao Sicheng; Chua Tat-Seng;
展开▼
作者单位

Harbin Inst Technol Dept Comp Sci & Technol Harbin 150001 Heilongjiang Peoples R China;

Harbin Inst Technol Dept Comp Sci & Technol Harbin 150001 Heilongjiang Peoples R China;

Harbin Inst Technol Dept Comp Sci & Technol Harbin 150001 Heilongjiang Peoples R China;

Harbin Inst Technol Dept Comp Sci & Technol Harbin 150001 Heilongjiang Peoples R China;

Univ Calif Berkeley Dept Elect Engn & Comp Sci Berkeley CA 94720 USA;

Natl Univ Singapore Sch Comp Singapore 117417 Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Segment-level; visual topics; event representation; latent patterns; event epitomes;

机译：段级;视觉主题;事件表示;潜在模式;事件表观;

相似文献

外文文献
中文文献
专利

1. Discovering Latent Discriminative Patterns for Multi-Mode Event Representation [J] . Xie Wenlong, Yao Hongxun, Sun Xiaoshuai, IEEE transactions on multimedia . 2019,第6期

机译：发现多模式事件表示的潜在判别模式
2. Discovering Interpretable Representations for Both Deep Generative and Discriminative Models [J] . Tameem Adel, Zoubin Ghahramani, Adrian Weller JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：发现深度生成模型和判别模型的可解释表示
3. Erratum to: Latent discriminative representation learning for speaker recognition [J] . Duolin Huang, Qirong Mao, Zhongchen Ma, Frontiers of Information Technology & Electronic Engineering . 2021,第6期

机译：误解：扬声器识别的潜在歧视性代表学习
4. Discovering Student Behavior Patterns from Event Logs: Preliminary Results on a Novel Probabilistic Latent Variable Model [C] . Chen Qiao, Xiao Hu IEEE International Conference on Advanced Learning Technologies . 2018

机译：从事件日志中发现学生的行为模式：新型概率潜在变量模型的初步结果
5. Behavioral pattern analysis: Towards a new representation of systems requirements based on actions and events. [D] . El-Ansary, Assem I. 2005

机译：行为模式分析：基于行为和事件，以新的方式表示系统需求。
6. Multilinear Discriminative Spatial Patterns for Movement-Related Cortical Potential Based on EEG Classification with Tensor Representation [O] . Qian Cai, Jianfeng Yan, Hongfang Han, 2021

机译：基于EEG分类的张量表示的移动相关皮质潜力的多线性辨别空间模式
7. Discovering Discriminative and Interpretable Patterns for Surgical Motion Analysis [O] . Germain Forestier, François Petitjean, Pavel Senin, 2017

机译：发现外科运动分析的判别和可解释模式

Discovering Latent Discriminative Patterns for Multi-Mode Event Representation

摘要

著录项

相似文献

相关主题

期刊订阅