【24h】

Speech Enhancement by Spectral Component Selection

机译:通过频谱分量选择增强语音

获取原文

摘要

Most algorithms for speech enhancement in spectral domain focus on the acquisition of an estimator of the clean speech parameter, such as spectrum or amplitude. Enhanced speech quality with these methods mainly depends upon the accuracy of the estimator. When the signal-to-noise ratio (SNR) becomes lower, e.g., less than 3DB, the enhanced speech often shows unsatisfied quality . Here, we propose a new method for speech enhancement in spectral domain. It bases on our view that speech characteristic is perceived mainly by part of the spectral components which have higher local (or instantaneous) SNRs. These components have a special importance on speech SNR enhancement and speech understand. Identifying these components in the noisy spectrum with a decision system and constructing enhanced speech just by the same nosy amplitude and phase, the result is similar to those popular algorithms. All needed to do is a local SNR decision for each spectral component. The threshold for the decision is derived and two popular methods for local SNR (LSNR) estimation are suggested. It shows better performance under lower SNR signals. Though the number of these kind of spectral components varies with different signal SNR, any algorithms which can pick up more than 90percent of it is sufficient to match any existing methods. Primary considerations and results are shown.
机译:用于频谱域中语音增强的大多数算法都集中在获取干净语音参数(例如频谱或幅度)的估计量。这些方法增强的语音质量主要取决于估计器的准确性。当信噪比(SNR)变得较低(例如小于3DB)时,增强的语音通常显示出不满意的质量。在这里,我们提出了一种新的频谱域语音增强方法。它基于我们的观点,语音特征主要是由具有较高局部(或瞬时)SNR的部分频谱分量感知的。这些组件对语音SNR增强和语音理解特别重要。使用决策系统识别噪声频谱中的这些成分,并通过相同的噪声幅度和相位构造增强的语音,其结果类似于那些流行的算法。所有要做的就是为每个频谱分量确定一个本地SNR。得出决策的阈值,并提出了两种流行的本地SNR(LSNR)估计方法。它在较低的SNR信号下显示出更好的性能。尽管这些频谱分量的数量随信号SNR的不同而变化,但是任何能够吸收90%以上的算法都足以匹配任何现有方法。显示了主要注意事项和结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号