The Voice Source in Speech Production: Data, Analysis and Models.

机译：语音产生中的语音源：数据，分析和模型。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Analysis of the voice source with respect to voice quality is essential to the understanding of the human speech production system, which can lead to better speech modeling for improving a vast range of applications. However, due to the position of the vocal folds, analyzing the source is often hampered by the lack of direct observations with which to calibrate algorithms.;In this dissertation, two approaches to voice source and voice quality analysis were pursued. In the first approach, the source waveform was extracted by analyzing the glottal area waveforms from high-speed imaging of the vocal folds. These direct observations led to the development of a new source model, which is more accurate compared to existing models. A codebook search technique was then proposed to estimate the source signal from the acoustic data. Results were promising for a number of model parameters such as the open quotient and speed of opening. However, error analysis showed that the algorithm required reasonable formant-frequency constraints which may be difficult to obtain automatically in some cases.;In the second approach, voice source related measures were used in three voice quality applications: voice source analysis, automatic gender classification and prosody analysis. In voice source analysis, acoustic measures were examined in the context of the voice source model parameters obtained from model-fitting the glottal arca waveforms. Results showed that correlations could be made between model parameters and the related acoustic measures, such as the asymmetry coefficient and harmonic-to-noise ratio measures. It was also shown that the model parameters and related acoustic measures were affected by the type of voice quality (pressed, normal and breathy). In gender classification, voice source related measures were found to be more helpful in younger (10-14 year old) speakers, where traditional pitch and formant frequency features were less useful. Analysis of prosody showed that, amongst other things, features correlated to pitch accents were not necessarily centered at the target syllable, and depended on the position of other prosodic events.

机译：关于语音质量的语音源分析对于理解人类语音生成系统至关重要，这可以导致更好的语音建模，从而改善广泛的应用。然而，由于人声褶皱的位置，通常缺乏直接观察来校准算法的方法而难以对信号源进行分析。本文研究了两种语音源和语音质量分析方法。在第一种方法中，通过分析声带的高速成像中的声门区域波形来提取源波形。这些直接的观察导致开发了新的源模型，与现有模型相比，该模型更为准确。然后提出了一种码本搜索技术，以从声学数据中估计源信号。对于许多模型参数（例如开商和开门速度），结果令人鼓舞。然而，误差分析表明，该算法需要合理的共振峰频率约束，在某些情况下可能难以自动获得。第二种方法是在三种语音质量应用中使用了与语音源相关的措施：语音源分析，自动性别分类和韵律分析。在语音源分析中，在通过对声门Arca波形进行模型拟合获得的语音源模型参数的上下文中检查了声学测量。结果表明，可以在模型参数和相关的声学度量（例如不对称系数和谐波噪声比度量）之间建立关联。还表明，模型参数和相关的声学测量受语音质量类型（受压，正常和呼吸）的影响。在性别分类中，发现与语音源相关的措施对年轻（10-14岁）的说话者更有用，而传统的音高和共振峰频率特征则没有那么大的用处。对韵律的分析表明，除其他事项外，与音高相关的特征不一定以目标音节为中心，而是取决于其他韵律事件的位置。

著录项

作者
Shue, Yen-Liang.;
展开▼
作者单位

University of California, Los Angeles.;

展开▼
授予单位 University of California, Los Angeles.;
学科 Language Linguistics.;Engineering Electronics and Electrical.
学位 Ph.D.
年度 2010
页码 189 p.
总页数 189
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Glottal Source Analysis of Voice Deficits in Newly Diagnosed Drug-naive Patients with Parkinson's Disease: Correlation Between Acoustic Speech Characteristics and Non-Speech Motor Performance [J] . Biomedical signal processing and control . 2020,第Mara期

机译：新诊断为帕金森氏病的初次药物治疗的初次患者语音缺陷的声门来源分析：声学语音特征与非语音运动表现之间的相关性
2. Hearing Lips and Seeing Voices: How Cortical Areas Supporting Speech Production Mediate Audiovisual Speech Perception [J] . Jeremy I. Skipper12 Virginie van Wassenhove3 Howard C. Nusbaum2 and Steven L. Small12 Cerebral Cortex . 2007,第10期

机译：听到嘴唇并听到声音：支持语音产生的皮质区域如何介导视听语音感知
3. Hearing lips and seeing voices: how cortical areas supporting speech production mediate audiovisual speech perception. [J] . Skipper JI, van-Wassenhove V, Nusbaum HC, Cerebral cortex . 2007,第10期

机译：听力嘴唇和看到声音：皮质地区如何支持演讲制作调解视听语言的感知。
4. Modeling source-tract interaction in speech production: Voicing onset vs.vowel height after a voiceless obstruent [C] . Jorge C. Lucero, Laura L. Koenig, Susanne Fuchs Annual conference of the International Speech Communication Association . 2012

机译：模拟语音产生中的源-场交互：无声遮挡后发声开始与元音高度
5. The Voice Source in Speech Production: From Models to Applications. [D] . Chen, Gang. 2014

机译：语音制作中的语音源：从模型到应用程序。
6. Hearing Lips and Seeing Voices: How Cortical Areas Supporting Speech Production Mediate Audiovisual Speech Perception [O] . Jeremy I. Skipper, Virginie van Wassenhove, Howard C. Nusbaum, -1

机译：听到嘴唇并听到声音：支持语音产生的皮质区域如何介导视听语音感知
7. Prodorshok I: A Bengali Isolated Speech Dataset for Voice-Based Assistive Technologies - A comparative analysis of the effects of data augmentation on HMM-GMM and DNN classifiers [O] . Reza, Mohi, Rashid, Warida, Mostakim, Moin 2017

机译：prodorshok I：基于语音的孟加拉语孤立语音数据集辅助技术 - 数据效果的比较分析增强Hmm-Gmm和DNN分类器

The Voice Source in Speech Production: Data, Analysis and Models.

摘要

著录项

相似文献

相关主题

期刊订阅