Detecting person presence in TV shows with linguistic and structural features

机译：检测具有语言和结构特征的电视节目中的人身

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Person detection and recognition in videos is a hard problem due to the intrinsic ambiguities of the sound and image channels and their interaction. Whatever method is used to extract person hypotheses from the audio or the image channels, person recognition in videos relies on a multimodal decision process that merges the different hypotheses produced in order to decide, for each frame, who is present in the video at the audio level, at the image level or at the content level (person mention in speech or inserted text boxes). In this framework the focus of this paper is to produce a list of person presence hypotheses from the audio channel of a video document only, to be used in addition to person presence detected at the image level by a multimodal fusion process. In this study we focus on the audio channel only, using two kinds of features: linguistic features corresponding to the way a person is mentioned by a speaker; structural features corresponding to the context of occurrence of a name in a show. We show that both sets of features are complementary and that good results can be achieved on a TV show corpus annotated with person presence labels.

机译：由于声音和图像通道及其交互的固有歧义性，视频中的人物检测和识别是一个难题。无论使用哪种方法从音频或图像通道中提取人的假设，视频中的人识别都依赖于多模式决策过程，该过程将产生的不同假设进行合并，以便针对每个帧确定在视频中出现在音频中的人级别，图像级别或内容级别（语音中提及的人或插入的文本框）。在这种框架下，本文的重点是仅从视频文档的音频通道中生成人身假设的列表，除了通过多模式融合过程在图像级别检测到的人身存在之外，还将使用这些假设。在本研究中，我们仅使用两种特征来关注音频通道：与讲话者提及人的方式相对应的语言特征；与节目中出现名字的上下文相对应的结构特征。我们证明这两组功能是互补的，并且可以在带有人身标签的电视节目语料库上实现良好的效果。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.5077- 5080|共4页
会议地点 Kyoto(JP)
作者
Bechet Frederic;
展开▼
作者单位

Aix Marseille Univ LIF/CNRS, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类通信理论;
关键词

相似文献

外文文献
中文文献
专利

1. Study on a method for detecting failure of digital television transmission based on extraction of feature information of embedded invisible marker signals and statistical test [J] . Osamu Sugimoto, Ryoichi Kawada, Masahiro Wada, 電子情報通信学会技術研究報告. オフィスシステム . 2002,第313期

机译：基于嵌入式隐形标记信号特征信息提取和统计检验的数字电视传输故障检测方法研究
2. Study on a method for detecting failure of digital television transmission based on extraction of feature information of embedded invisible marker signals and statistical test [J] . Osamu Sugimoto, Ryoichi Kawada, Masahiro Wada, 電子情報通信学会技術研究報告. 画像工学. Image Engineering . 2002,第315期

机译：基于嵌入式隐形标记信号特征信息提取和统计检验的数字电视传输故障检测方法研究
3. Study on a method for detecting failure of digital television transmission based on extraction of feature information of embedded invisible marker signals and statistical test [J] . Osamu Sugimoto, Ryoichi Kawada, Masahiro Wada, 電子情報通信学会技術研究報告. オフィスシステム . 2002,第313期

机译：基于嵌入式无形标记信号的特征信息和统计测试检测数字电视传输失效方法的研究
4. Detecting person presence in TV shows with linguistic and structural features [C] . Bechet Frederic IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：用语言和结构特征检测电视节目中的人数
5. A Comparison of the Situational and Linguistic Features of High-Profile Criminal Trials and TV Series Courtroom Trials [D] . Chen, Meishan. 2018

机译：备受瞩目的刑事审判和电视连续剧法庭审判的情境和语言特征比较
6. Structural features of free N-glycans occurring in plants and functional features of de-N-glycosylation enzymes ENGase and PNGase: the presence of unusual plant complex type N-glycans [O] . Megumi Maeda, Yoshinobu Kimura 2014

机译：植物中游离N-聚糖的结构特征和de-N-糖基化酶ENGase和PNGase的功能特征：存在异常植物复合型N-聚糖
7. DETECTING PERSON PRESENCE IN TV SHOWS WITH LINGUISTIC AND STRUCTURAL FEATURES [O] . Frederic Bechet, Benoit Favre, Geraldine Damnati 2012

机译：具有语言和结构特征的电视节目中人的出现

Detecting person presence in TV shows with linguistic and structural features

摘要

著录项

相似文献

相关主题

期刊订阅