首页> 外文会议>Proceedings of the Second International conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) >Performance analysis of SS based speech enhancement algorithms for ASR with Non-stationary Noisy Database-NOIZEUS
【24h】

Performance analysis of SS based speech enhancement algorithms for ASR with Non-stationary Noisy Database-NOIZEUS

机译:非平稳噪声数据库的基于SS的ASR语音增强算法性能分析-NOIZEUS

获取原文
获取原文并翻译 | 示例

摘要

In Human computer interface the correct translation by computer and exact perception of human depends on the quality of speech input to the machine. Hence speech enhancement technique is very essential in any HCI technique. In this paper we present an algorithm for speech enhancement on the non-stationary noisy database (NOIZEUES). The speech enhancement technique, Spectral Subtraction technique falls under the category of methods based on short time spectral amplitude (or power) estimate. This algorithm is found to be very simple and computationally efficient. Various modifications are made in the spectral subtraction technique and therefore spectral over subtraction, improved spectral over subtraction, iterative spectral over subtraction and multiband spectral subtraction techniques are derived. In this paper basic performance of spectral subtraction, spectral over subtraction and improved spectral over subtraction technique and SS with cross spectral terms with non-stationary noisy database is discussed. It is observed that algorithm works effectively to reduce additive noise present in the noisy speech signal. Limitations of spectral subtraction such as dependence on VAD accuracy and musical noise are described in the proposed paper. All listed algorithms are executed on available database experimentation results of speech and enhanced speech are provided in result section of this paper. Also numerical results are displayed graphically.
机译:在人机界面中,计算机的正确翻译和对人的准确感知取决于输入到机器的语音质量。因此,语音增强技术在任何HCI技术中都是非常重要的。在本文中,我们提出了一种用于非平稳噪声数据库(NOIZEUES)上语音增强的算法。语音增强技术,频谱减法技术属于基于短时频谱幅度(或功率)估计的方法类别。发现该算法非常简单并且计算效率高。在频谱减法技术中进行了各种修改,因此得出了频谱减法,改进频谱减法,迭代频谱减法和多频带频谱减法技术。本文讨论了光谱减法,光谱加减法和改进的光谱加减法技术以及具有非平稳噪声数据库的带有互谱项的SS的基本性能。观察到该算法有效地减少了在有声语音信号中存在的加性噪声​​。频谱减法的局限性,例如对VAD精度和音乐噪声的依赖性,在本文中有所描述。所有列出的算法都在语音的可用数据库实验结果上执行,本文的结果部分提供了增强语音。数值结果也以图形方式显示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号