首页> 外文学位 >The Voice Source in Speech Production: Data, Analysis and Models.
【24h】

The Voice Source in Speech Production: Data, Analysis and Models.

机译:语音产生中的语音源:数据,分析和模型。

获取原文
获取原文并翻译 | 示例

摘要

Analysis of the voice source with respect to voice quality is essential to the understanding of the human speech production system, which can lead to better speech modeling for improving a vast range of applications. However, due to the position of the vocal folds, analyzing the source is often hampered by the lack of direct observations with which to calibrate algorithms.;In this dissertation, two approaches to voice source and voice quality analysis were pursued. In the first approach, the source waveform was extracted by analyzing the glottal area waveforms from high-speed imaging of the vocal folds. These direct observations led to the development of a new source model, which is more accurate compared to existing models. A codebook search technique was then proposed to estimate the source signal from the acoustic data. Results were promising for a number of model parameters such as the open quotient and speed of opening. However, error analysis showed that the algorithm required reasonable formant-frequency constraints which may be difficult to obtain automatically in some cases.;In the second approach, voice source related measures were used in three voice quality applications: voice source analysis, automatic gender classification and prosody analysis. In voice source analysis, acoustic measures were examined in the context of the voice source model parameters obtained from model-fitting the glottal arca waveforms. Results showed that correlations could be made between model parameters and the related acoustic measures, such as the asymmetry coefficient and harmonic-to-noise ratio measures. It was also shown that the model parameters and related acoustic measures were affected by the type of voice quality (pressed, normal and breathy). In gender classification, voice source related measures were found to be more helpful in younger (10-14 year old) speakers, where traditional pitch and formant frequency features were less useful. Analysis of prosody showed that, amongst other things, features correlated to pitch accents were not necessarily centered at the target syllable, and depended on the position of other prosodic events.
机译:关于语音质量的语音源分析对于理解人类语音生成系统至关重要,这可以导致更好的语音建模,从而改善广泛的应用。然而,由于人声褶皱的位置,通常缺乏直接观察来校准算法的方法而难以对信号源进行分析。本文研究了两种语音源和语音质量分析方法。在第一种方法中,通过分析声带的高速成像中的声门区域波形来提取源波形。这些直接的观察导致开发了新的源模型,与现有模型相比,该模型更为准确。然后提出了一种码本搜索技术,以从声学数据中估计源信号。对于许多模型参数(例如开商和开门速度),结果令人鼓舞。然而,误差分析表明,该算法需要合理的共振峰频率约束,在某些情况下可能难以自动获得。第二种方法是在三种语音质量应用中使用了与语音源相关的措施:语音源分析,自动性别分类和韵律分析。在语音源分析中,在通过对声门Arca波形进行模型拟合获得的语音源模型参数的上下文中检查了声学测量。结果表明,可以在模型参数和相关的声学度量(例如不对称系数和谐波噪声比度量)之间建立关联。还表明,模型参数和相关的声学测量受语音质量类型(受压,正常和呼吸)的影响。在性别分类中,发现与语音源相关的措施对年轻(10-14岁)的说话者更有用,而传统的音高和共振峰频率特征则没有那么大的用处。对韵律的分析表明,除其他事项外,与音高相关的特征不一定以目标音节为中心,而是取决于其他韵律事件的位置。

著录项

  • 作者

    Shue, Yen-Liang.;

  • 作者单位

    University of California, Los Angeles.;

  • 授予单位 University of California, Los Angeles.;
  • 学科 Language Linguistics.;Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 189 p.
  • 总页数 189
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号