首页> 外文会议>Advances in Multimedia Information Processing - PCM 2008 >Speaker Clustering Aided by Visual Dialogue Analysis
【24h】

Speaker Clustering Aided by Visual Dialogue Analysis

机译:视觉对话分析辅助说话人聚类

获取原文
获取原文并翻译 | 示例

摘要

Speaker clustering aims to automatically cluster speech segments for each speaker. By speaker clustering, we can discover main cast list from long videos and retrieve their relevant video clips for efficient browsing. In this paper, we propose a dialogue supervised speaker clustering method, which makes use of the visual dialogue analysis results to improve the performance of speaker clustering. Compared with the traditional approach based only on acoustic features, the dialogue supervised speaker clustering approach can get significant improvement on the clustering result for movie and TV series.
机译:说话者聚类旨在自动为每个说话者聚类语音片段。通过演讲者聚类,我们可以从长视频中发现主要演员表,并检索其相关视频剪辑以进行有效浏览。本文提出了一种对话监督的说话人聚类方法,该方法利用视觉对话分析结果来提高说话人聚类的性能。与仅基于声学特征的传统方法相比,对话监督的说话人聚类方法可以大大改善电影和电视剧的聚类结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号