From Text to Video: Exploiting Mid-Level Semantics for Large-Scale Video Classification

机译：从文本到视频：利用中级语义进行大规模视频分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatically classifying large scale of video data is an urgent yet challenging task. To bridge the semantic gap between low-level features and high-level video semantics, we propose a method to represent videos with their mid-level semantics. Inspired by the problem of text classification, we regard the visual objects in videos as the words in documents, and adapt the TF-IDF word weighting method to encode videos by visual objects. Some extensions upon the proposed method are also made according to the characteristics of videos. We integrate the proposed semantic encoding method with the popular two-stream CNN model for video classification. Experiments are conducted on two large-scale video datasets, CCV and ActivityNet. The experimanetal results validates the effectiveness of our method.

机译：自动分类大量视频数据是一项紧迫而又具有挑战性的任务。为了弥合低级功能和高级视频语义之间的语义鸿沟，我们提出了一种以中级语义表示视频的方法。受文本分类问题的启发，我们将视频中的视觉对象视为文档中的单词，并采用TF-IDF单词加权方法对视频进行视觉对象编码。还根据视频的特性对提出的方法进行了一些扩展。我们将提出的语义编码方法与流行的两流CNN模型进行视频分类。实验是在两个大型视频数据集CCV和ActivityNet上进行的。实验结果验证了我们方法的有效性。

著录项

来源
《International Conference on Pattern Recognition》|2018年|1695-1700|共6页
会议地点
作者
Ji Zhang; Kuizhi Mei; Xiao Wang; Yu Zheng; Jianping Fan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantics; Task analysis; Visualization; Streaming media; Detectors; Encoding; Bridges;

机译：语义;任务分析;可视化;流媒体;检测器;编码;桥梁;

相似文献

外文文献
中文文献
专利

1. Exploiting Mid-Level Semantics for Large-Scale Complex Video Classification [J] . Zhang Ji, Mei Kuizhi, Zheng Yu, IEEE transactions on multimedia . 2019,第10期

机译：利用中级语义进行大型复杂视频分类
2. Extraction of mid-level semantics from gesture videos using a Bayesian network [J] . Dimitrios I. Kosmopoulos, Ilias G. Maglogiannis International Journal of Intelligent Systems Technologies and Applications . 2006,第3a4期

机译：使用贝叶斯网络从手势视频中提取中层语义
3. Text-to-video: a semantic search engine for internet videos [J] . Lu Jiang, Shoou-I Yu, Deyu Meng, International Journal of Multimedia Information Retrieval . 2016,第1期

机译：文字到视频：互联网视频的语义搜索引擎
4. From Text to Video: Exploiting Mid-Level Semantics for Large-Scale Video Classification [C] . Ji Zhang, Kuizhi Mei, Xiao Wang, International Conference on Pattern Recognition . 2018

机译：从文本到视频：利用中级语义进行大规模视频分类
5. Weakly supervised learning from multiple modalities: Exploiting video, audio and text for video understanding. [D] . Cour, Timothee. 2009

机译：多种模式的弱监督学习：利用视频，音频和文本进行视频理解。
6. Exploiting the “video game craze”: A case study of the tobacco industry’s use of video games as a marketing tool [O] . Patricia A. McDaniel, Susan R. Forsyth 2012

机译：利用“视频游戏热潮”：以烟草业使用视频游戏作为营销工具的案例研究
7. Extraction of Mid-Level Semantics from Gesture Videos using a Bayesian Network [O] . Dimitrios I. Kosmopoulos A, Ilias Maglogiannis B 2008

机译：使用贝叶斯网络从手势视频中提取中级语义

From Text to Video: Exploiting Mid-Level Semantics for Large-Scale Video Classification

摘要

著录项

相似文献

相关主题

期刊订阅