首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Towards Interpretive Models for 2-D Processing of Speech
【24h】

Towards Interpretive Models for 2-D Processing of Speech

机译:走向语音二维处理的解释模型

获取原文
获取原文并翻译 | 示例

摘要

This paper considers 2-D Fourier analysis of local time-frequency regions of wideband spectrograms, a representation we refer to as the wideband Grating Compression Transform (WGCT). We develop frequency-dependent, speech-production-based models of speech signals for the WGCT, building on previous work in modeling narrowband-based GCT representations (NGCT). Comparisons show important distinctions, including dual behavior, between the wideband and narrowband models, and distinct ways in which vocal tract/formant content is distributed redundantly throughout the NGCT and WGCT spaces. Our results motivate a novel taxonomy of speech-signal behavior as an interpretative framework (i.e., in relation to speech-production characteristics) for 2-D processing of speech using the GCT, as well as for other 2-D approaches and time-frequency distributions such as the auditory spectrogram. We demonstrate and evaluate the ability of the model to represent real speech content through demodulation techniques for analysis/synthesis of wideband spectrograms. Finally, we develop a co-channel speaker separation method, using prior and estimated pitch information, based on the WGCT, as well as through fusion with the NGCT. These GCT-based separation systems are compared against and further fused with a reference sinusoidal separation system.
机译:本文考虑了宽带频谱图局部时频区域的二维傅立叶分析,我们将其表示为宽带光栅压缩变换(WGCT)。我们基于对基于窄带的GCT表示(NGCT)建模的先前工作,为WGCT开发了基于频率,基于语音产生的语音信号模型。比较显示宽带模型和窄带模型之间的重要区别,包括双重行为,以及声道/共振峰内容在NGCT和WGCT空间中冗余分布的独特方式。我们的研究结果激发了一种新颖的语音信号行为分类法,作为使用GCT进行语音2-D处理以及其他2-D方法和时频的解释框架(即,与语音产生特性有关)分布,如听觉频谱图。我们通过解调技术分析/合成宽带频谱图,演示并评估了该模型代表真实语音内容的能力。最后,我们基于WGCT以及与NGCT的融合,使用先验和估计的音高信息,开发了同声道扬声器分离方法。将这些基于GCT的分离系统与参考正弦分离系统进行比较并进一步融合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号