首页> 外文期刊>Journal of signal processing systems for signal, image, and video technology >An Adaptive Non Reference Anchor Array Framework for Audio Retrieval in Teleconferencing Environment
【24h】

An Adaptive Non Reference Anchor Array Framework for Audio Retrieval in Teleconferencing Environment

机译:电话会议环境中用于音频检索的自适应非参考锚阵列框架

获取原文
获取原文并翻译 | 示例

摘要

In this paper, an adaptive framework for audio retrieval in live teleconferencing environments with multiple participants is proposed. The framework uses a non reference anchor array (NRA) to capture the interfering speech sources, in addition to the primary array that captures the speech source of interest (SOI). A linearly constrained-minimum variance (LC-MV) beamformer is used herein such that the signal coming from the look direction is preserved while interferences coming from the non look direction are nulled. Additionally, the reverberant component of the speech acquired by this framework is removed by a novel method that uses the linear prediction (LP) residual cepstrum. This method does not require the computation of the acoustic impulse response (AIR) of the teleconferencing room and hence is computationally efficient. The NRA framework is therefore able to remove correlated noise coming from the direction of the SOI and also dereverberating the noise free signal. The performance of the proposed framework is evaluated by conducting experiments on clean speech acquisition from distant microphone arrays. Experiments on distant speech recognition are also conducted using the TIMIT and MONC databases. Experimental results obtained from the proposed framework indicate a reasonable improvement over correlation, subspace and standard minimum variance beam-forming methods. The application of the framework in audio retrieval in a live teleconferencing environment with multiple participants is also discussed.
机译:本文提出了一种具有多个参与者的实时电话会议环境中的音频检索自适应框架。除了捕获感兴趣的语音源(SOI)的主要阵列之外,该框架还使用非参考锚点阵列(NRA)捕获干扰的语音源。这里使用线性约束最小方差(LC-MV)波束形成器,使得来自视向的信号被保留,而来自非视向的干扰被消除。此外,通过使用线性预测(LP)残留倒谱的新颖方法,可以消除此框架获取的语音的混响分量。此方法不需要计算电话会议厅的声脉冲响应(AIR),因此计算效率很高。因此,NRA框架可以消除来自SOI方向的相关噪声,并且可以消除无噪声信号的干扰。通过对来自远距离麦克风阵列的干净语音进行实验,评估了所提出框架的性能。还使用TIMIT和MONC数据库进行了远程语音识别的实验。从提出的框架获得的实验结果表明,在相关性,子空间和标准最小方差波束形成方法方面有合理的改进。还讨论了该框架在具有多个参与者的实时电话会议环境中的音频检索中的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号