首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Joint Time–Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement
【24h】

Joint Time–Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement

机译:联合时频分割算法用于瞬时语音分解和语音增强

获取原文
获取原文并翻译 | 示例

摘要

We develop an algorithm, the joint time-frequency segmentation algorithm, where the wavelet packet coefficients of the analyzed speech signal are represented as tiles of a time-frequency representation adapted to the characteristics of the signal itself. Further, our algorithm enables the decomposition of the speech signal into transient and non-transient components, respectively. Any block of wavelet packet coefficients, whose tiling height is larger than or equal to the tiling width belongs to the transient component and vice versa for the non-transient component. The transient component is selectively amplified and recombined with the original speech to generate the modified speech with energy adjusted to be equal to the original speech. The intelligibility of the original and modified speech is evaluated by 16 human listeners. Word recognition rate results show that the modified speech significantly improves speech intelligibility in background noise, i.e., by 10% absolute at 0 dB to 27% absolute at -30 dB.
机译:我们开发了一种算法,即联合时频分割算法,其中,分析后的语音信号的小波包系数表示为适合信号自身特性的时频表示形式。此外,我们的算法能够将语音信号分别分解为瞬态和非瞬态分量。小波包系数的任何块(其拼接高度大于或等于拼接宽度)都属于瞬态分量,对于非瞬态分量,反之亦然。瞬时分量被有选择地放大并与原始语音重新组合,以产生能量被调整为等于原始语音的经修改的语音。原始语音和经过修改的语音的可懂度由16位听众进行评估。单词识别率结果表明,修改后的语音可显着提高背景噪声中的语音清晰度,即在0 dB时绝对值提高10%,在-30 dB时绝对值提高到27%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号