首页> 外文会议>International Conference on Signal Image Technologies Internet Based Systems >Time Delay Estimation of Reverberant Meeting Speech: On the Use of Multichannel Linear Prediction
【24h】

Time Delay Estimation of Reverberant Meeting Speech: On the Use of Multichannel Linear Prediction

机译:混响会议语音的时间延迟估计:关于多通道线性预测的使用

获取原文
获取外文期刊封面目录资料

摘要

Effective and efficient access to multiparty meeting recordings requires techniques for meeting analysis and indexing. Since meeting participants are generally stationary, speaker location information may be used to identify meeting events e.g., detect speaker changes. Time-delay estimation (TDE) utilizing cross-correlation of multichannel speech recordings is a common approach for deriving speech source location information. Recent research improved TDE by calculating TDE from linear prediction (LP) residual signals obtained from LP analysis on each individual speech channel. This paper investigates the use of LP residuals for speech TDE, where the residuals are obtained from jointly modeling the multiple speech channels. Experiments conducted with a simulated reverberant room and real room recordings show that jointly modeled LP better predicts the LP coefficients, compared to LP applied to individual channels. Both the individually and jointly modeled LP exhibit similar TDE performance, and outperform TDE on the speech alone, especially with the real recordings.
机译:对多方面会议录制的有效和有效访问需要进行分析和索引的技术。由于会议参与者通常是静止的,因此扬声器位置信息可用于识别会议事件,例如,检测扬声器更改。利用多通道语音记录的互相关的时间延迟估计(TDE)是用于导出语音源位置信息的常用方法。最近的研究通过从LP分析从LP分析获得的线性预测(LP)残留信号来改进TDE。本文研究了LP残差用于语音TDE,其中剩余物是从联合建模多个语音通道获得的。用模拟混响室和真正的房间录音进行的实验表明,与应用于各个通道的LP相比,联合建模的LP更好地预测LP系数。单独和共同建模的LP都表现出类似的TDE性能,并仅在语音上占TDE,尤其是真正的录音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号