Multi-speaker articulatory trajectory formation based on speaker-independent articulatory HMMs

Sadao Hiroya; Takemi Mochida

首页> 外文期刊>Speech Communication >Multi-speaker articulatory trajectory formation based on speaker-independent articulatory HMMs

【24h】

Multi-speaker articulatory trajectory formation based on speaker-independent articulatory HMMs

机译：基于独立于说话者的关节HMM的多说话者关节轨迹形成

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Inter-speaker variability in the speech spectrum domain has been modeled using speaker-adaptive training (SAT), in which speaker-independent phoneme-specific hidden Markov models (HMMs) were used along with a speaker-adaptive matrix. In this paper, multi-speaker articulatory trajectory formation based on this method is presented. Both speaker-independent and speaker-specific features are statistically separated from a multi-speaker articulatory database, which consists of the mid-sagittal motion data of the lips, incisor, and tongue measured with an electro-magnetic articulographic (EMA) system. We evaluated the proposed method in terms of the RMS error between the measured and estimated articulatory parameters. When multi-speaker models of articulatory parameters with two speaker-adaptive matrices for each speaker were used, the average RMS error of articulatory parameters was 1.29 mm and showed no statistically significant difference from that for speaker-dependent models (1.22 mm). For comparison, multi-speaker models of the conventional speech spectrum were also constructed using a multi-speaker spectrum database, which consists of speech data simultaneously recorded during the articulatory measurements. The average spectral distance between the vocal-tract and estimated spectrum from two-matrix models was 4.19 dB and showed a statistically significant difference from that for speaker-dependent models (3.97 dB). These results indicate that modeling of inter-speaker variability in the articulatory parameter domain with a small number of matrices for each speaker almost perfectly approximates the speaker dependency of articulation and is better than that in the speech spectrum domain.

机译：已经使用说话者自适应训练（SAT）对语音频谱域中的说话者之间的可变性进行了建模，其中使用了与说话者无关的音素特有的隐马尔可夫模型（HMM）和说话者自适应矩阵。本文提出了一种基于这种方法的多扬声器发音轨迹的形成方法。独立于说话者的特征和特定于说话者的特征都从多说话者发音数据库中进行了统计分离，该数据库由使用电磁关节造影（EMA）系统测量的嘴唇，门齿和舌头的中矢状运动数据组成。我们根据测得的和估计的关节参数之间的RMS误差对提出的方法进行了评估。当使用每个说话者具有两个说话者自适应矩阵的发音参数的多说话者模型时，发音参数的平均RMS误差为1.29 mm，与说话者依赖模型（1.22 mm）相比，没有统计学上的显着差异。为了进行比较，还使用多扬声器频谱数据库构建了常规语音频谱的多扬声器模型，该数据库由在发音测量过程中同时记录的语音数据组成。两个矩阵模型的声道与估计频谱之间的平均频谱距离为4.19 dB，与说话者相关模型的平均频谱距离（3.97 dB）相比，具有统计上的显着差异。这些结果表明，在发音参数域中的说话者间可变性的建模，每个说话者的矩阵数量很少，几乎完美地近似了发音的说话者依赖性，并且比在语音频谱域中更好。

著录项

来源
《Speech Communication》 |2006年第12期|p.1677-1690|共14页
作者
Sadao Hiroya; Takemi Mochida;
展开▼
作者单位

NTT Communication Science Laboratories, NTT Corporation, 3-1 Morinosato-Wakamiya, Atsugi-shi, Kanagawa 243-0198, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类语言、文字;
关键词
inter-speaker variability; articulatory trajectory formation; articulatory HMMs; speaker-adaptive training;

机译：说话者之间的变异性;发音轨迹的形成;发音HMM;说话者自适应训练;

相似文献

外文文献
中文文献
专利

1. Acoustic-Articulatory Modeling With the Trajectory HMM [J] . Zhang L., Renals S. IEEE signal processing letters . 2008,第1期

机译：轨迹HMM的声学-关节建模
2. Acoustic-Articulatory Modeling With the Trajectory HMM [J] . Le Zhang, Renals S. IEEE signal processing letters . 2008,第期

机译：轨迹HMM的声学-关节建模
3. Estimation of Articulatory Trajectories Based on Gaussian Mixture Model (GMM) With Audio-Visual Information Fusion and Dynamic Kalman Smoothing [J] . Ozbek I.Y., Hasegawa-Johnson M., Demirekler M. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第5期

机译：基于高斯混合模型（GMM）的视听信息融合和动态卡尔曼平滑的关节运动轨迹估计
4. MULTI-SPEAKER ARTICULATORY RECONSTRUCTION BASED ON AN EIGEN ARTICULATORY HMM [C] . Sadao HIROYA, Takemi MOCHIDA, IEEE IEEE International Conference on Acoustics, Speech, and Signal Processing . 2005

机译：基于特征性术语的多扬声器关节重建
5. Modeling articulatory dynamics using HMM techniques for automatic speech recognition. [D] . Erler, Kevin J. 1994

机译：使用HMM技术对发音动力学进行建模以实现自动语音识别。
6. On smoothing articulatory trajectories obtained from Gaussian mixture model basedacoustic-to-articulatory inversion [O] . Prasanta K. Ghosh, a), Shrikanth S. Narayanan -1

机译：基于高斯混合模型获得的平滑运动轨迹声音到发音的反转
7. Acoustic-Articulatory Modelling with the Trajectory HMM [O] . Zhang Le, Renals Steve 2010

机译：轨迹HMM的声学-关节建模

Multi-speaker articulatory trajectory formation based on speaker-independent articulatory HMMs

摘要

著录项

相似文献

相关主题

期刊订阅