Time Delay Estimation of Reverberant Meeting Speech: On the Use of Multichannel Linear Prediction

机译：混响会议语音的时间延迟估计：关于多通道线性预测的使用

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Effective and efficient access to multiparty meeting recordings requires techniques for meeting analysis and indexing. Since meeting participants are generally stationary, speaker location information may be used to identify meeting events e.g., detect speaker changes. Time-delay estimation (TDE) utilizing cross-correlation of multichannel speech recordings is a common approach for deriving speech source location information. Recent research improved TDE by calculating TDE from linear prediction (LP) residual signals obtained from LP analysis on each individual speech channel. This paper investigates the use of LP residuals for speech TDE, where the residuals are obtained from jointly modeling the multiple speech channels. Experiments conducted with a simulated reverberant room and real room recordings show that jointly modeled LP better predicts the LP coefficients, compared to LP applied to individual channels. Both the individually and jointly modeled LP exhibit similar TDE performance, and outperform TDE on the speech alone, especially with the real recordings.

机译：对多方面会议录制的有效和有效访问需要进行分析和索引的技术。由于会议参与者通常是静止的，因此扬声器位置信息可用于识别会议事件，例如，检测扬声器更改。利用多通道语音记录的互相关的时间延迟估计（TDE）是用于导出语音源位置信息的常用方法。最近的研究通过从LP分析从LP分析获得的线性预测（LP）残留信号来改进TDE。本文研究了LP残差用于语音TDE，其中剩余物是从联合建模多个语音通道获得的。用模拟混响室和真正的房间录音进行的实验表明，与应用于各个通道的LP相比，联合建模的LP更好地预测LP系数。单独和共同建模的LP都表现出类似的TDE性能，并仅在语音上占TDE，尤其是真正的录音。

著录项

来源
《International Conference on Signal Image Technologies Internet Based Systems》|2008年||共7页
会议地点
作者
E. Cheng; I. S. Burnett; C. Ritz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词

相似文献

外文文献
中文文献
专利

1. A class of multichannel sparse linear prediction algorithms for time delay estimation of speech sources [J] . Signal processing . 2020,第Apra期

机译：一类用于语音源时延估计的多通道稀疏线性预测算法
2. Processing of reverberant speech for time-delay estimation [J] . Yegnanarayana B., Prasanna S.R.M., Duraiswami R., IEEE Transactions on Speech and Audio Proceessing . 2005,第6期

机译：混响语音的处理，用于时延估计
3. Japanese speech intelligibility estimation and prediction using objective intelligibility indices under noisy and reverberant conditions [J] . Kobayashi Yosuke, Kondo Kazuhiro Applied Acoustics . 2019,第DECa期

机译：在嘈杂和混响条件下使用客观清晰度指数进行日语语音清晰度估计和预测
4. Time Delay Estimation of Reverberant Meeting Speech: On the Use of Multichannel Linear Prediction [C] . E. Cheng, I. S. Burnett, C. Ritz International Conference on Signal Image Technologies Internet Based Systems . 2008

机译：混响会议语音的时间延迟估计：关于多通道线性预测的使用
5. Reverberant speech enhancement using linear prediction residual signal. [D] . Joshi, Bhavin Bharat. 2005

机译：使用线性预测残差信号的混响语音增强。
6. Intelligibility and Clarity of Reverberant Speech: Effects of Wide Dynamic Range Compression Release Time and Working Memory [O] . Paul N. Reinhart, Pamela E. Souza -1

机译：回响语音的清晰度和清晰度：宽动态范围压缩释放时间和工作记忆的影响
7. Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction [O] . Cheng, Eva, Burnett, I., Ritz, Christian 2007

机译：混响会议语音的时延估计：关于多通道线性预测的使用
8. Estimate-Maximize Algorithms for Multichannel Time Delay and Signal Estimation [R] . Segal, M., Weinstein, E., Musicus, B. R. 1991

机译：估计最大化多通道时延和信号估计的算法

Time Delay Estimation of Reverberant Meeting Speech: On the Use of Multichannel Linear Prediction

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅