首页> 外文会议> >Robust automatic video-conferencing with multiple cameras and microphones
【24h】

Robust automatic video-conferencing with multiple cameras and microphones

机译:具有多个摄像头和麦克风的强大的自动视频会议

获取原文

摘要

An automatic video-conferencing system is proposed which employs acoustic source localization, video face tracking and pose estimation, and multi-channel speech enhancement. The video portion of the system tracks talkers by utilizing source motion, contour geometry, color data and simple facial features. Decisions involving which camera to use are based on an estimate of the head's gazing angle. This head pose estimation is achieved using a very general head model which employs hairline features and a learned network classification procedure. Finally, a wavelet microphone array technique is used to create an enhanced speech waveform to accompany the recorded video signal. The system presented in this paper is robust to both visual clutter (e.g. ovals in the scene of interest which are not faces) and audible noise (e.g. reverberations and background noise).
机译:提出了一种自动视频会议系统,该系统采用声学源定位,视频面跟踪和姿势估计,以及多通道语音增强。系统的视频部分通过利用源运动,轮廓几何,颜色数据和简单的面部特征来跟踪对讲机。涉及使用的相机的决定是基于头部注视角度的估计。使用使用吹线特征和学习网络分类程序的非常一般的头部模型实现了该头部姿势估计。最后,使用小波麦克风阵列技术来创建增强的语音波形来伴随记录的视频信号。本文所呈现的系统对视觉杂波(例如,兴趣的场景中的椭圆形不面临)和听觉噪声(例如,混响和背景噪声)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号