SIM-simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals

Matthias Frohlich; Dirk Michaelis; Hans Werner Strube

首页> 外文期刊>The Journal of the Acoustical Society of America >SIM-simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals

【24h】

SIM-simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals

机译：SIM-同时对声语音信号的声门流模型进行逆滤波和匹配

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A new method "simultaneous inverse filtering and model matching" (SIM) is proposed that allows one to calculate voice source measures without any user interaction. it is based on the discrete all-pole modeling (DAP) technique for inverse filtering (IF), which is modified to include a model of the glottal flow as integral part [LF model, Fant et al., STL-QPSR (Stockholm) 4/1985, 1-13 (1986)]. As the correct LF parameters are initially unknown, they are estimated in an iterative procedure using multi-dimensional optimization techniques that are initialized according to the results of an exhaustive search. The error criteria applied reflect how well the IF is performed after the spectral contribution of the glottal flow has been removed. The resulting optimal LF parameter constellation serves as the basis to calculate 11 voice source measures. The performance was evaluated using synthesized signals and recordings of natural utterances. For the synthesized signals, the accuracy to reproduce the original parameters was high (correlations exceeding 0.88) for measures when the starting point of the glottal cycle did not enter explicitly. Errors were smaller compared to conventional estimation methods where the measures were estimated from the IF signal. The analysis of natural utterances indicates that problems still exist with regard to robustness, but that under advantageous conditions the open quotient, the speed quotient, the closing quotient, the parabolic spectral parameter, and the negative peak amplitude of the glottal flow derivative can indeed be determined automatically by the SIM method.

机译：提出了一种新的方法“同时逆滤波和模型匹配”（SIM），该方法无需用户交互即可计算语音源度量。它基于用于逆滤波（IF）的离散全极点建模（DAP）技术，该技术经过修改以包含声门流模型作为整体部分[LF模型，Fant等人，STL-QPSR（Stockholm） 4 / 1985，1-13（1986）]。由于正确的LF参数最初是未知的，因此将使用多维优化技术在迭代过程中对它们进行估算，这些优化技术将根据详尽搜索的结果进行初始化。所应用的误差标准反映了去除声门流频谱贡献后中频的执行情况。得到的最佳LF参数星座图是计算11种语音源度量的基础。使用合成信号和自然发声记录来评估性能。对于合成信号，当声门循环的起点未明确输入时，测量的原始信号再现精度较高（相关性超过0.88）。与传统的估计方法相比，误差较小，传统的估计方法是根据IF信号估计测量值的。对自然话语的分析表明，在鲁棒性方面仍然存在问题，但是在有利条件下，声门流导数的开数，速度数，闭数，抛物线谱参数和负峰值幅度确实可以是由SIM方法自动确定。

著录项

来源
《The Journal of the Acoustical Society of America》 |2001年第1期|共10页
作者
Matthias Frohlich; Dirk Michaelis; Hans Werner Strube;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类声学;
关键词

相似文献

外文文献
中文文献
专利

1. SIM-simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals [J] . Matthias Frohlich, Dirk Michaelis, Hans Werner Strube The Journal of the Acoustical Society of America . 2001,第1期

机译：SIM-同时对声语音信号的声门流模型进行逆滤波和匹配
2. HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering [J] . Raitio T.Suni A.Yamagishi J.Pulakka H.Nurminen J.Vainio M.Alku P. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第1期

机译：基于声门逆滤波的基于HMM的语音合成
3. Determination of glottal closure instants from clean and telephone quality speech signals using single frequency filtering [J] . Sudarsana Reddy Kadiri, B. Yegnanarayana Computer speech and language . 2020,第Nova期

机译：使用单频滤波测定清洁和电话质量语音信号的光门闭合时刻
4. OBJECTIVE QUALITY MEASURES FOR GLOTTAL INVERSE FILTERING OF SPEECH PRESSURE SIGNALS [C] . Tom Backstrom, Matti Airas, Laura Lehto, IEEE International Conference on Acoustics, Speech, and Signal Processing . 2005

机译：物理质量措施，用于语音压力信号的发光逆滤波
5. Estimation of glottal source features from the spectral envelope of the acoustic speech signal. [D] . Torres, Juan Felix. 2010

机译：从声音语音信号的频谱包络估计声门源特征。
6. Particle Swarm Optimization Based Feature Enhancement and Feature Selection for Improved Emotion Recognition in Speech and Glottal Signals [O] . Hariharan Muthusamy, Kemal Polat, Sazali Yaacob -1

机译：基于粒子群优化的特征增强和特征选择用于语音和声门信号中的情感识别
7. Acoustic coupling during incomplete glottal closure and its effect on the inverse filtering of oral airflow [O] . Matías Zañartu, Julio C. Ho, Daryush D. Mehta, 2013

机译：在不完全光泽的闭合期间的声学耦合及其对口腔气流逆滤波的影响
8. Performance Evaluation of Glottal Inverse Filtering Algorithms Using a Physiologically Based Articulatory Speech Synthesizer. [R] . Quatieri, T. F., Mehta, D. D., Chien, Y., 2017

机译：基于生理学发音语音合成器的声门反滤波算法性能评估。

SIM-simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals

摘要

著录项

相似文献

相关主题

期刊订阅