首页> 外文会议>2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel. >Dominant speaker identification for multipoint videoconferencing
【24h】

Dominant speaker identification for multipoint videoconferencing

机译:多点视频会议的主要说话人识别

获取原文
获取原文并翻译 | 示例

摘要

A multi-point conference is an efficient and cost effective substitute for a face to face meeting. It involves three or more participants placed in separate locations, where each participant employs a single microphone and camera. The routing and processing of the audiovisual information is very demanding on the network. This raises a need for reducing the amount of information that flows through the system. One solution is to identify the dominant speaker and partially discard information originating from non-active participants. We propose a novel method for dominant speaker identification using speech activity information from time intervals of different lengths. In comparison to other speaker selection methods, experimental results demonstrate reduction in the number of false speaker switches and improved robustness to transient audio interferences.
机译:多点会议是面对面会议的有效且具有成本效益的替代方法。它涉及三个或更多参与者,他们分别位于不同的位置,每个参与者都使用一个麦克风和摄像头。视听信息的路由和处理在网络上要求很高。这就需要减少流经系统的信息量。一种解决方案是识别主导讲话者并部分丢弃源自非活动参与者的信息。我们提出了一种新的方法,用于使用来自不同长度的时间间隔的语音活动信息来识别说话人。与其他扬声器选择方法相比,实验结果表明减少了错误的扬声器开关数量,并提高了对瞬态音频干扰的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号