Digital speech processing in the context of a human auditory model.

机译：在人类听觉模型中进行数字语音处理。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Digital speech processing in the context of a digital hearing model can improve the subjective quality of the speech processing algorithms. This subjective quality is a measure of how "good" the processed speech sounds to the listener. Subjective quality can be measured by paired comparison tests where subjects are asked to choose between two stimuli the one that sounds the best. This dissertation proposes that if digital speech processing is performed on speech that has been preprocessed using a digital hearing model, the resulting speech, after undoing the preprocessing of the digital hearing model, will sound "better" as measured by subjective quality evaluations, whether or not standard objective distortion measures indicate otherwise.;This dissertation proposes a digital hearing model for application in digital speech processing. This hearing model approximates the perception of intensity. Two digital processing algorithms were used to validate the claims of this dissertation. The first was spectral subtraction and the second was subband vector quantization. The results obtained from the subjective quality evaluations demonstrated evidence in support of the hypothesis of this dissertation. There was a 90% preference for coding in the perceptual domain for magnitude compression of 58:1 through 176:1 and a preference above 70% for noise suppression of speech corrupted by additive Gaussian noise of 18 dB and lower signal-to-noise ratios.

机译：在数字听力模型的上下文中，数字语音处理可以提高语音处理算法的主观质量。这种主观质量是对处理后的语音对听众的“良好”程度的一种度量。可以通过配对比较测试来衡量主观质量，在这种比较测试中，要求受试者在两种刺激中选择一个听起来最好的刺激。本文提出，如果对已经使用数字听力模型进行预处理的语音进行数字语音处理，则在取消数字听力模型的预处理后，所得语音在通过主观质量评估来衡量时听起来“更好”，无论是没有标准的客观失真测量方法表明相反的情况。本文提出了一种用于数字语音处理的数字听力模型。该听力模型近似强度的感知。两种数字处理算法被用来验证本文的权利要求。第一个是频谱相减，第二个是子带矢量量化。从主观质量评估中获得的结果证明了支持本文假设的证据。对于在58％至176：1的幅度压缩中在感知域中进行编码，有90％的偏好，对于在18dB的附加高斯噪声和较低信噪比的情况下破坏的语音的噪声抑制，偏好高于70％。

著录项

作者
Christiansen, Mark Wesley.;
展开▼
作者单位

Brigham Young University.;

展开▼
授予单位 Brigham Young University.;
学科 Electrical engineering.;Audiology.
学位 Ph.D.
年度 1990
页码 113 p.
总页数 113
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Context modulates processing of speech sounds in the right auditory cortex of human subjects. [J] . Kujala A, Alho K, Valle S, Neuroscience Letters: An International Multidisciplinary Journal Devoted to the Rapid Publication of Basic Research in the Brain Sciences . 2002 ,第2期

机译：上下文可调节人类受试者右听皮层中语音的处理。
2. Auditory-like filterbank: An optimal speech processor for efficient human speech communication [J] . PRASANTA KUMAR GHOSH, LOUIS M GOLDSTEIN, SHRIKANTH S NARAYANAN Sadhana . 2011 ,第5期

机译：类似于听觉的滤波器库：用于高效人类语音通信的最佳语音处理器
3. Auditory-like filterbank: An optimal speech processor for efficient human speech communication [J] . PRASANTA KUMAR GHOSH, LOUIS M. GOLDSTEIN, SHRIKANTH S. NARAYANAN Sadhana: Academy Proceedings in Engineering Science . 2011 ,第5期

机译：类似于听觉的滤波器库：用于高效人类语音通信的最佳语音处理器
4. Accurate speech segmentation by mimicking human auditory processing [C] . King Sarah, Hasegawa-Johnson Mark IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：通过模仿人类听觉处理来进行准确的语音分割
5. Dynamic and Adaptive Processing of Speech in the Human Auditory Cortex [D] . ?Khalighinejad, Bahar 2020

机译：在人类听觉皮层的讲话和动态自适应处理
6. Auditory cortical micro-networks show differential connectivity during voice and speech processing in humans [O] . Florence Steiner, Marine Bobin, Sascha Frühholz 2021

机译：听觉皮质微网络在人类中的语音和语音处理期间显示差分连接
7. Auditory-like filterbank: An optimal speech processor for efficient human speech communication [O] . Prasanta Kumar Ghosh, Louis M Goldstein, Shrikanth S Narayanan 2015

机译：听觉式滤波器组：用于高效人类语音通信的最佳语音处理器

Digital speech processing in the context of a human auditory model.

摘要

著录项

相似文献

相关主题

期刊订阅