首页> 外文会议>International Conference on Information, Communications and Signal Processing >Audiovisual speaker localization in medium smart meeting room
【24h】

Audiovisual speaker localization in medium smart meeting room

机译:中等智能会议室的视听演讲人定位

获取原文

摘要

The issue of automatic selection of the current active speaker among more than thirty participants located in the medium-sized meeting room is considered. Techniques of video tracking and sound source localization are implemented for recording AVI files of speaker remarks in the developed smart meeting room. Video processing of streams from five cameras serves for registration of participants in fixed chair positions, tracking main speaker based on histogram comparison and AdaBoosted cascade classifier for face detection. Multichannel sound source localization based on GCC-PHAT method is used for estimation of the speaker position by four microphone arrays. In the 18dB SNR case the sound source localization rate was about 97% and fine RMSE was lower 0.23 m.
机译:考虑了位于中等大型会议室的30多个参与者中当前活跃扬声器自动选择的问题。 视频跟踪和声源定位的技术用于在开发的智能会议室中录制扬声器备注的AVI文件。 从五个摄像机的流式流式处理用于固定椅子位置的参与者的登记,基于直方图比较和Adaboosted级联分类器进行脸部检测的主扬声器。 基于GCC-PHAT方法的多通道声源定位用于通过四个麦克风阵列估计扬声器位置。 在18dB的SNR情况下,声源定位速率约为97%,细微的RMSE为0.23米。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号