首页> 外文学位 >Digital speech processing in the context of a human auditory model.
【24h】

Digital speech processing in the context of a human auditory model.

机译:在人类听觉模型中进行数字语音处理。

获取原文
获取原文并翻译 | 示例

摘要

Digital speech processing in the context of a digital hearing model can improve the subjective quality of the speech processing algorithms. This subjective quality is a measure of how "good" the processed speech sounds to the listener. Subjective quality can be measured by paired comparison tests where subjects are asked to choose between two stimuli the one that sounds the best. This dissertation proposes that if digital speech processing is performed on speech that has been preprocessed using a digital hearing model, the resulting speech, after undoing the preprocessing of the digital hearing model, will sound "better" as measured by subjective quality evaluations, whether or not standard objective distortion measures indicate otherwise.;This dissertation proposes a digital hearing model for application in digital speech processing. This hearing model approximates the perception of intensity. Two digital processing algorithms were used to validate the claims of this dissertation. The first was spectral subtraction and the second was subband vector quantization. The results obtained from the subjective quality evaluations demonstrated evidence in support of the hypothesis of this dissertation. There was a 90% preference for coding in the perceptual domain for magnitude compression of 58:1 through 176:1 and a preference above 70% for noise suppression of speech corrupted by additive Gaussian noise of 18 dB and lower signal-to-noise ratios.
机译:在数字听力模型的上下文中,数字语音处理可以提高语音处理算法的主观质量。这种主观质量是对处理后的语音对听众的“良好”程度的一种度量。可以通过配对比较测试来衡量主观质量,在这种比较测试中,要求受试者在两种刺激中选择一个听起来最好的刺激。本文提出,如果对已经使用数字听力模型进行预处理的语音进行数字语音处理,则在取消数字听力模型的预处理后,所得语音在通过主观质量评估来衡量时听起来“更好”,无论是没有标准的客观失真测量方法表明相反的情况。本文提出了一种用于数字语音处理的数字听力模型。该听力模型近似强度的感知。两种数字处理算法被用来验证本文的权利要求。第一个是频谱相减,第二个是子带矢量量化。从主观质量评估中获得的结果证明了支持本文假设的证据。对于在58%至176:1的幅度压缩中在感知域中进行编码,有90%的偏好,对于在18dB的附加高斯噪声和较低信噪比的情况下破坏的语音的噪声抑制,偏好高于70% 。

著录项

  • 作者

    Christiansen, Mark Wesley.;

  • 作者单位

    Brigham Young University.;

  • 授予单位 Brigham Young University.;
  • 学科 Electrical engineering.;Audiology.
  • 学位 Ph.D.
  • 年度 1990
  • 页码 113 p.
  • 总页数 113
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号