Detecting person presence in TV shows with linguistic and structural features

机译：用语言和结构特征检测电视节目中的人数

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Person detection and recognition in videos is a hard problem due to the intrinsic ambiguities of the sound and image channels and their interaction. Whatever method is used to extract person hypotheses from the audio or the image channels, person recognition in videos relies on a multimodal decision process that merges the different hypotheses produced in order to decide, for each frame, who is present in the video at the audio level, at the image level or at the content level (person mention in speech or inserted text boxes). In this framework the focus of this paper is to produce a list of person presence hypotheses from the audio channel of a video document only, to be used in addition to person presence detected at the image level by a multimodal fusion process. In this study we focus on the audio channel only, using two kinds of features: linguistic features corresponding to the way a person is mentioned by a speaker; structural features corresponding to the context of occurrence of a name in a show. We show that both sets of features are complementary and that good results can be achieved on a TV show corpus annotated with person presence labels.

机译：由于声音和图像频道的内在模糊及其互动，视频中的人员检测和识别是一个难题。无论哪种方法用于从音频或图像通道中提取人假设，视频中的人员识别依赖于合并所产生的不同假设的多模式决策过程，以便为每个帧判断在音频处的视频中存在的每个帧。级别，在图像级别或内容级别（在语音或插入的文本框中提到）。在该框架中，本文的焦点是在仅通过多模式融合过程中在图像级别检测到的人存在之外，仅从视频文档的音频信道中生成一个人存在假设的列表。在这项研究中，我们仅关注音频通道，使用两种特征：语言特征对应于扬声器提到的人的方式;对应于显示中名称的上下文的结构特征。我们表明这两组功能都是互补的，并且可以在与人存在标签注释的电视节目语料库上实现良好的结果。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2011年||共4页
会议地点
作者
Bechet Frederic;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Study on a method for detecting failure of digital television transmission based on extraction of feature information of embedded invisible marker signals and statistical test [J] . Osamu Sugimoto, Ryoichi Kawada, Masahiro Wada, 電子情報通信学会技術研究報告. オフィスシステム . 2002,第313期

机译：基于嵌入式隐形标记信号特征信息提取和统计检验的数字电视传输故障检测方法研究
2. Study on a method for detecting failure of digital television transmission based on extraction of feature information of embedded invisible marker signals and statistical test [J] . Osamu Sugimoto, Ryoichi Kawada, Masahiro Wada, 電子情報通信学会技術研究報告. 画像工学. Image Engineering . 2002,第315期

机译：基于嵌入式隐形标记信号特征信息提取和统计检验的数字电视传输故障检测方法研究
3. Study on a method for detecting failure of digital television transmission based on extraction of feature information of embedded invisible marker signals and statistical test [J] . Osamu Sugimoto, Ryoichi Kawada, Masahiro Wada, 電子情報通信学会技術研究報告. オフィスシステム . 2002,第313期

机译：基于嵌入式无形标记信号的特征信息和统计测试检测数字电视传输失效方法的研究
4. Detecting person presence in TV shows with linguistic and structural features [C] . Bechet Frederic IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP . 2012

机译：检测具有语言和结构特征的电视节目中的人身
5. A Comparison of the Situational and Linguistic Features of High-Profile Criminal Trials and TV Series Courtroom Trials [D] . Chen, Meishan. 2018

机译：备受瞩目的刑事审判和电视连续剧法庭审判的情境和语言特征比较
6. Structural features of free N-glycans occurring in plants and functional features of de-N-glycosylation enzymes ENGase and PNGase: the presence of unusual plant complex type N-glycans [O] . Megumi Maeda, Yoshinobu Kimura 2014

机译：植物中游离N-聚糖的结构特征和de-N-糖基化酶ENGase和PNGase的功能特征：存在异常植物复合型N-聚糖
7. DETECTING PERSON PRESENCE IN TV SHOWS WITH LINGUISTIC AND STRUCTURAL FEATURES [O] . Frederic Bechet, Benoit Favre, Geraldine Damnati 2012

机译：具有语言和结构特征的电视节目中人的出现

Detecting person presence in TV shows with linguistic and structural features

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅