首页> 外文会议>Conference on multimedia storage and archiving systems >Audio-video feature correlation: faces and speech
【24h】

Audio-video feature correlation: faces and speech

机译:音视频特征关联:面部和语音

获取原文
获取外文期刊封面目录资料

摘要

Abstract: This paper presents a study of the correlation of features automatically extracted from the audio stream and the video stream of audiovisual documents. In particular, we were interested in finding out whether speech analysis tools could be combined with face detection methods, and to what extend they should be combined. A generic audio signal partitioning algorithm as first used to detect Silence/Noise/Music/Speech segments in a full length movie. A generic object detection method was applied to the keyframes extracted from the movie in order to detect the presence or absence of faces. The correlation between the presence of a face in the keyframes and of the corresponding voice in the audio stream was studied. A third stream, which is the script of the movie, is warped on the speech channel in order to automatically label faces appearing in the keyframes with the name of the corresponding character. We naturally found that extracted audio and video features were related in many cases, and that significant benefits can be obtained from the joint use of audio and video analysis methods. !13
机译:摘要:本文研究了从视听文档的音频流和视频流中自动提取的特征之间的相关性。尤其是,我们有兴趣了解语音分析工具是否可以与面部检测方法结合使用,以及应将其扩展到何种程度。一种通用的音频信号分配算法,首先用于检测全长电影中的静音/噪声/音乐/语音片段。通用对象检测方法应用于从电影中提取的关键帧,以检测面部是否存在。研究了关键帧中面部的存在与音频流中相应语音之间的相关性。第三流(即电影的脚本)在语音通道上变形,以便自动用相应字符的名称标记出现在关键帧中的面孔。我们自然发现,提取的音频和视频特征在很多情况下都是相关的,并且可以通过联合使用音频和视频分析方法来获得显着的收益。 !13

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号