首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Determining Mixing Parameters From Multispeaker Data Using Speech-Specific Information
【24h】

Determining Mixing Parameters From Multispeaker Data Using Speech-Specific Information

机译:使用特定于语音的信息从多扬声器数据确定混音参数

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose an approach for processing multispeaker speech signals collected simultaneously using a pair of spatially separated microphones in a real room environment. Spatial separation of microphones results in a fixed time-delay of arrival of speech signals from a given speaker at the pair of microphones. These time-delays are estimated by exploiting the impulse-like characteristic of excitation during speech production. The differences in the time-delays for different speakers are used to determine the number of speakers from the mixed multispeaker speech signals. There is difference in the signal levels due to differences in the distances between the speaker and each of the microphones. The differences in the signal levels dictate the values of the mixing parameters. Knowledge of speech production, especially the excitation source characteristics, is used to derive an approximate weight function for locating the regions specific to a given speaker. The scatter plots of the weighted and delay-compensated mixed speech signals are used to estimate the mixing parameters. The proposed method is applied on the data collected in actual laboratory environment for an underdetermined case, where the number of speakers is more than the number of microphones. Enhancement of speech due to a speaker is also examined using the information of the time-delays and the mixing parameters, and is evaluated using objective measures proposed in the literature.
机译:在本文中,我们提出了一种在实际房间环境中使用一对空间分离的麦克风同时处理多扬声器语音信号的方法。麦克风的空间分离导致语音信号从给定扬声器到达一对麦克风的时间延迟固定。这些时间延迟是通过利用语音产生过程中的类似脉冲的激励特性来估计的。使用不同扬声器的时间延迟差异来确定来自混合多扬声器语音信号的扬声器数量。由于扬声器和每个麦克风之间的距离不同,信号电平也会有所不同。信号电平的差异决定了混合参数的值。语音产生的知识,尤其是激励源的特性,可用于得出近似权重函数,以定位特定于给定说话者的区域。加权和延迟补偿的混合语音信号的散点图用于估计混合参数。在扬声器数量大于麦克风数量不确定的情况下,将所提出的方法应用于在实际实验室环境中收集的数据。还使用时间延迟和混合参数的信息来检查由于说话者引起的语音增强,并使用文献中提出的客观措施对其进行评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号