首页> 美国卫生研究院文献>The Journal of the Acoustical Society of America >A psychoacoustic method to find the perceptual cues of stop consonants in natural speech
【2h】

A psychoacoustic method to find the perceptual cues of stop consonants in natural speech

机译:一种在自然语言中找到停止辅音的感知线索的心理声学方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Synthetic speech has been widely used in the study of speech cues. A serious disadvantage of this method is that it requires prior knowledge about the cues to be identified in order to synthesize the speech. Incomplete or inaccurate hypotheses about the cues often lead to speech sounds of low quality. In this research a psychoacoustic method, named three-dimensional deep search (3DDS), is developed to explore the perceptual cues of stop consonants from naturally produced speech. For a given sound, it measures the contribution of each subcomponent to perception by time truncating, highpass∕lowpass filtering, or masking the speech with white noise. The AI-gram, a visualization tool that simulates the auditory peripheral processing, is used to predict the audible components of the speech sound. The results are generally in agreement with the classical studies that stops are characterized by a short duration burst followed by a F2 transition, suggesting the effectiveness of the 3DDS method. However, it is also shown that ∕ba∕ and ∕pa∕ may have a wide band click as the dominant cue. F2 transition is not necessary for the perception of ∕ta∕ and ∕ka∕. Moreover, many stop consonants contain conflicting cues that are characteristic of competing sounds. The robustness of a consonant sound to noise is determined by the intensity of the dominant cue.
机译:合成语音已广泛用于语音提示的研究中。该方法的严重缺点在于,它需要先确定有关线索的知识,才能合成语音。关于提示的不完整或不正确的假设通常会导致语音质量低下。在这项研究中,一种被称为三维深度搜索(3DDS)的心理声学方法被开发出来,以探索自然产生的语音中停止辅音的感知线索。对于给定的声音,它通过时间截断,高通-低通滤波或用白噪声掩盖语音来测量每个子组件对感知的贡献。 AI-gram是一种模拟听觉外围处理的可视化工具,用于预测语音的可听成分。该结果通常与经典研究一致,该研究的特点是停止时间短,然后出现F2过渡,这表明3DDS方法的有效性。但是,也表明∕ ba ∕和∕ pa ∕可能具有宽带点击作为主要提示。 F2过渡对于∕ ta ∕和kaka的感知不是必需的。此外,许多停止辅音包含相互竞争的线索,这是竞争声音的特征。辅音对噪声的鲁棒性取决于主导提示的强度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号