Content-based video parsing and indexing based on audio-visualinteraction

Tsekeridou S.; Pitas I.

首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Content-based video parsing and indexing based on audio-visualinteraction

【24h】

Content-based video parsing and indexing based on audio-visualinteraction

机译：基于视听交互的基于内容的视频解析和索引

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A content-based video parsing and indexing method is presented in this paper, which analyzes both information sources (auditory and visual) and accounts for their inter-relations and synergy to extract high-level semantic information. Both frame- and object-based access to the visual information is employed. The aim of the method is to extract semantically meaningful video scenes and assign semantic label(s) to them. Due to the temporal nature of video, time has to be accounted for. Thus, time-constrained video representations and indices are generated. The current approach searches for specific types of content information relevant to the presence or absence of speakers or persons. Audio-source parsing and indexing leads to the extraction of a speaker label mapping of the source over time. Video-source parsing and indexing results in the extraction of a talking-face shot mapping over time. Integration of the audio and visual mappings constrained by interaction rules leads to higher levels of video abstraction and even partial detection of its context

机译：本文提出了一种基于内容的视频解析和索引方法，该方法分析了信息源（听觉和视觉），并说明了它们之间的相互关系和协同作用，以提取高级语义信息。基于帧和基于对象的视觉信息访问都被采用。该方法的目的是提取语义上有意义的视频场景并为其分配语义标签。由于视频的时间特性，必须考虑时间。因此，产生了时间受限的视频表示和索引。当前方法搜索与说话者或人物的存在与否有关的特定类型的内容信息。音频源解析和索引会导致提取源的扬声器标签映射。视频源解析和索引会导致随着时间的推移提取说话人镜头的映射。受交互规则约束的音频和视觉映射的集成导致更高级别的视频抽象，甚至部分检测其上下文

著录项

来源
《IEEE Transactions on Circuits and Systems for Video Technology》 |2001年第4期|p.522-535|共14页
作者
Tsekeridou S.; Pitas I.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
audio-visual systems; content-based retrieval; database indexing; feature extraction; image representation; video databases; video signal processing; audio mapping; audio-source indexing; audio-source parsing; audio-visual interaction; content information; content-b;

机译：视听系统;基于内容的检索;数据库索引;特征提取;图像表示;视频数据库;视频信号处理;音频映射;音频源索引;音频源解析;视听交互;内容信息;content-b;

相似文献

外文文献
中文文献
专利

1. Content-based indexing and teaching focus mining for lecture videos [J] . Yu-Tzu Lin, Bai-Jang Yen, Chia-Hu Chang, Interactive technology and smart education . 2010,第3期

机译：基于内容的索引和教学视频的教学重点挖掘
2. Content-based indexing and teaching focus mining for lecture videos [J] . Yu-Tzu Lin, Bai-Jang YenChia-Hu Chang, Greg C. Lee Interactive Technology and Smart Education . 2010,第3期

机译：基于内容的索引和教学视频的教学重点挖掘
3. A novel ontology for 3D semantics: ontology-based 3D model indexing and content-based video retrieval applied to the medical domain [J] . Leslie F. Sikos International journal of metadata, semantics and ontologies . 2017,第1期

机译：一种新颖的3D语义本体：应用于医学领域的基于本体的3D模型索引和基于内容的视频检索
4. Probabilistic Approach to Content-Based Indexing and Categorization of Temporally Aggregated Shots in News Videos [C] . Kazimierz Choros Asian conference on intelligent information and database systems . 2016

机译：新闻视频中基于内容的索引和临时聚合镜头分类的概率方法
5. Content-based video analysis, indexing and representation using multimodal information. [D] . Li, Ying. 2003

机译：使用多模式信息进行基于内容的视频分析，索引和表示。
6. Content-based indexing of images and video. [O] . A Pentland 1997

机译：基于内容的图像和视频索引。
7. A Study on Shot Segmentation and Indexing of Language Education Videos by Content-based Visual Feature Analysis [O] . Heejun Han 2017

机译：基于内容的视觉特征分析研究语言教育视频的射击分割和索引
8. Indexing, Learning and Content-Based Retrieval for Special Purpose Image Databases [R] . Huiskes, M. J., Pauwels, E. J. 2004

机译：基于索引，学习和基于内容的专用图像数据库检索

Content-based video parsing and indexing based on audio-visualinteraction

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅