首页> 美国卫生研究院文献>The Journal of the Acoustical Society of America >An ideal quantized mask to increase intelligibility and quality of speech in noise
【2h】

An ideal quantized mask to increase intelligibility and quality of speech in noise

机译:理想的量化掩码可提高噪声中语音的清晰度和质量

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Time-frequency (T-F) masks represent powerful tools to increase the intelligibility of speech in background noise. Translational relevance is provided by their accurate estimation based only on the signal-plus-noise mixture, using deep learning or other machine-learning techniques. In the current study, a technique is designed to capture the benefits of existing techniques. In the ideal quantized mask (IQM), speech and noise are partitioned into T-F units, and each unit receives one of N attenuations according to its signal-to-noise ratio. It was found that as few as four to eight attenuation steps (IQM4, IQM8) improved intelligibility over the ideal binary mask (IBM, having two attenuation steps), and equaled the intelligibility resulting from the ideal ratio mask (IRM, having a theoretically infinite number of steps). Sound-quality ratings and rankings of noisy speech processed by the IQM4 and IQM8 were also superior to that processed by the IBM and equaled or exceeded that processed by the IRM. It is concluded that the intelligibility and sound-quality advantages of infinite attenuation resolution can be captured by an IQM having only a very small number of steps. Further, the classification-based nature of the IQM might provide algorithmic advantages over the regression-based IRM during machine estimation.
机译:时频(T-F)遮罩代表功能强大的工具,可提高背景噪音中语音的清晰度。通过使用深度学习或其他机器学习技术,仅基于信号加噪声混合的精确估计,即可提供翻译相关性。在当前的研究中,一种技术旨在捕获现有技术的好处。在理想的量化掩码(IQM)中,语音和噪声被划分为T-F单元,每个单元根据其信噪比接收N个衰减之一。已经发现,与理想的二进制掩码(IBM,具有两个衰减步骤)相比,少至四到八个衰减步骤(IQM4,IQM8)提高了清晰度,并且与理想比率掩码(IRM,在理论上具有无限大)产生的清晰度相等。步骤数)。 IQM4和IQM8处理的声音质量等级和嘈杂语音的等级也优于IBM处理的声音等级和等级,等于或超过IRM处理的声音等级。可以得出结论,无限衰减分辨率的清晰度和声音质量优势可以通过仅具有非常少量步骤的IQM来捕获。此外,在机器估计过程中,IQM的基于分类的性质可能会提供优于基于回归的IRM的算法优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号