首页> 外文学位 >An MRI-based articulatory and acoustic study of American English liquid sounds /r/ and /l/.
【24h】

An MRI-based articulatory and acoustic study of American English liquid sounds /r/ and /l/.

机译:基于MRI的美式英语声音/ r /和/ l /的发音和声学研究。

获取原文
获取原文并翻译 | 示例

摘要

In American English, the liquid sounds /r/ and /l/ are the most articulatorily variable and complex sounds. They can be produced by several distinct types of tongue configurations and are the most troublesome sounds for children and nonnative English-speakers to learn. Better understanding of this many-to-one mapping between articulation and acoustics would be beneficial to other areas such as speech pathology, speaker verification, speech recognition and speech synthesis.;In this dissertation, two articulatory configurations for each liquid sound were studied (a "retroflex" /r/ vs. a "bunched" /r/, and a light /l/ vs. a dark /l/). Different from previous work on liquids, finite element analysis has been performed to obtain the acoustic responses of the three-dimensional (3-D) vocal tract models, which are based on volumetric magnetic resonance (MR) imaging. Area function models were derived based on the wave propagation property inside the vocal tract.;The retroflex /r/ and the bunched /r/ show similar patterns of F1-F3 but very different spacing between F4 and F5. The results from the formant acoustic sensitivity functions and simple-tube vocal tract models suggested that this F4/F5 difference can be explained largely by differences in whether the long cavity behind the palatal constriction acts as a half- or a quarter-wavelength resonator. For both the retroflex /r/ and the bunched /r/, F4 and F5 (along with F3 for the particular speakers studied in this research) come from the long back cavity. However, these formants are half wavelength resonances for the retroflex /r/, but quarter wavelength resonances for the bunched /r/.;While both the dark /l/ and the light /l/ have a linguo-alveolar contact and two lateral channels, they differ in the length of the linguo-alveolar contact and in the presence of the linguopalatal contacts caused by raising the sides of the tongue. Both have similar patterns in F1-F3, but differ in the number and locations of zeros in spectrum. For the dark /l/, only one zero occurs below 6 kHz and it is produced by the cross mode posterior to the linguo-alveolar contact. For the light /l/, three zeros below 6 kHz are produced by the asymmetrical channels, the supralingual cavity and the cross mode posterior to the linguo-alveolar contact. The results from two simple vocal tract models show that the lateral channels have to be asymmetrical with an effective length between 3-6 cm to get a zero in the region of F3-F5.;Based on the Buckeye database, the acoustic variability and discriminative power of liquids were studied with the mel-frequency band energy coefficients as acoustic parameter. Analysis of variance shows that the inter-speaker variability of /r/ is larger than any other phonemes except /sh/, /s/ and /zh/. On average, /r/ and /l/ have larger inter-speaker variability than any other broad phonetic class. The F-ratio averages of liquids are larger than glides, fricatives, affricates and stops, but smaller than nasals. The speaker identification experiments show that the ranking of the average discriminative power for liquids and other broad phonetic classes is: /r/ > Glides > /l/ > Affricates > Fricatives > Stops > Nasals > Vowels.
机译:在美式英语中,/ r /和/ l /液体声音是发音最易变且最复杂的声音。它们可以由几种不同类型的舌头形态产生,是儿童和非英语母语人士学习时最麻烦的声音。更好地理解发音与声学之间的多对一映射关系将有助于其他领域,例如语音病理学,说话者验证,语音识别和语音合成。;本文研究了每种液体声音的两种发音配置(a “ retroflex” / r /相对于“成束的” / r /,以及浅色/ l /相对于深色/ l /)。与先前在液体上的工作不同,已经进行了有限元分析来获得三维(3-D)声道模型的声学响应,该模型基于体积磁共振(MR)成像。根据声波在声道内的传播特性,推导了面积函数模型。Retroflex / r /和成束的/ r /表现出相似的F1-F3模式,但F4和F5之间的间距差别很大。共振峰声学灵敏度函数和简单管声道模型的结果表明,这种F4 / F5差异可以通过differences部狭窄的长腔充当半波长或四分之一波长谐振器的差异来很大程度上解释。对于Retroflex / r /和成束的/ r /,F4和F5(以及本研究中所研究的特定扬声器的F3)都来自长后腔。但是,这些共振峰是反折反射/ r /的一半波长共振,而对于成束的/ r /却是四分之一波长共振;虽然暗的/ l /和亮的/ l /都具有舌-肺泡接触和两个横向通道,它们的舌-牙槽间接触长度和舌高引起的舌pal接触存在差异。两者在F1-F3中具有相似的模式,但是频谱中零的数量和位置不同。对于暗的/ l /,在6 kHz以下仅出现一个零,这是由舌-肺泡接触后的交叉模产生的。对于光l / l,不对称通道,舌上腔和舌-肺泡接触后的交叉模产生低于6 kHz的三个零。两个简单声道模型的结果表明,横向通道必须是不对称的,有效长度在3-6厘米之间,才能在F3-F5区域获得零。基于Buckeye数据库,声学变异性和判别力以梅尔频带能量系数为声学参数研究了液体的功率。方差分析表明,/ r /的扬声器间差异大于/ sh /,/ s /和/ zh /以外的其他任何音素。平均而言,/ r /和/ l /的说话人间变异性大于任何其他广泛的语音分类。液体的F比平均值大于滑行,摩擦,跳动和停止,但小于鼻孔。说话人识别实验表明,液体和其他广泛语音类别的平均判别力排名为:/ r />滑音> / l />属语>摩擦语>音标>鼻音>元音。

著录项

  • 作者

    Zhou, Xinhui.;

  • 作者单位

    University of Maryland, College Park.;

  • 授予单位 University of Maryland, College Park.;
  • 学科 Language Linguistics.;Engineering Electronics and Electrical.;Engineering Mechanical.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 171 p.
  • 总页数 171
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号