首页> 外文会议>International conference on computer aided systems theory >Who is Really Talking? A Visual-Based Speaker Diarization Strategy
【24h】

Who is Really Talking? A Visual-Based Speaker Diarization Strategy

机译:谁真的在说话?基于视觉的说话人差异化策略

获取原文

摘要

The speaker activity at the Canary Islands Parliament is recorded, and later manually annotated. This task can be modelled as a diarization problem, that is a way to automatically annotated who and when is speaking. In this paper, we propose the use of the visual cue to solve the diarization task. To perform this approach, it is mandatory to detect individuals, determine the one speaking, and extract features for matching. In order to test the performance of our proposal, we evaluate four different strategies based on the visual shot features.
机译:记录了加那利群岛议会发言人的活动,并随后进行了手动注释。可以将此任务建模为差异化问题,这是一种自动注释谁和何时讲话的方法。在本文中,我们提出使用视觉提示来解决数字化任务。要执行此方法,必须检测个人,确定一个人说话并提取特征以进行匹配。为了测试我们的建议的效果,我们基于视觉镜头功能评估了四种不同的策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号