Phonetic alignment: speech synthesis-based vs. Viterbi-based

F. Malfrere; O. Deroo; T. Dutoit; C. Ris

首页> 外文期刊>Speech Communication >Phonetic alignment: speech synthesis-based vs. Viterbi-based

【24h】

Phonetic alignment: speech synthesis-based vs. Viterbi-based

机译：语音对齐：基于语音合成与基于维特比

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we compare two different methods for automatically phonetically labeling a continuous speech database, as usually required for designing a speech recognition or speech synthesis system. The first method is based on temporal alignment of speech on a synthetic speech pattern; the second method uses either a continuous density hidden Markov models (HMM) or a hybrid HMM/ANN (artificial neural network) system in forced alignment mode. Both systems have been evaluated on read utterances not part of the training set of the HMM systems, and compared to manual segmentation. This study outlines the advantages and drawbacks of both methods. The speech synthetic system has the great advantage that no training stage (hence no large labeled database) is needed, while HMM systems easily handle multiple phonetic transcriptions (phonetic lattice). We deduce a method for the automatic creation of large phonetically labeled speech databases, based on using the synthetic speech segmentation tool to bootstrap the training process of either a HMM or a hybrid HMM/ANN system. The importance of such segmentation tools is a key point for the development of improved multilingual speech synthesis and recognition systems.

机译：在本文中，我们比较了自动语音标注连续语音数据库的两种不同方法，这是设计语音识别或语音合成系统通常需要的。第一种方法基于语音在合成语音模式上的时间对齐；第二种方法是在强制对齐模式下使用连续密度隐藏马尔可夫模型（HMM）或混合HMM / ANN（人工神经网络）系统。两种系统均根据不属于HMM系统训练集的部分的阅读语音进行了评估，并与手动分段进行了比较。这项研究概述了这两种方法的优缺点。语音合成系统具有很大的优势，即不需要训练阶段（因此不需要大型的标记数据库），而HMM系统可以轻松处理多个语音转录（语音格）。我们基于使用合成语音分割工具来引导HMM或HMM / ANN混合系统的训练过程，推导了一种自动创建大型带有语音标记的语音数据库的方法。这种分割工具的重要性是开发改进的多语言语音合成和识别系统的关键。

著录项

来源
《Speech Communication》 |2003年第4期|p.503-515|共13页
作者
F. Malfrere; O. Deroo; T. Dutoit; C. Ris;
展开▼
作者单位

Faculte Polytechnique de Mons-TCTS, 31, Bld Dolez, B-7000 Mom, Belgium;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类语言、文字;
关键词
speech segmentation; hidden markov models; hybrid HMM/ANN systems; speech synthesis; large speech corpora;

机译：语音分割隐藏的马尔可夫模型;HMM / ANN混合系统;语音合成大型语音语料库;

相似文献

外文文献
中文文献
专利

1. Comparison of motor-phonetic versus phonetic-phonological speech therapy approaches in patients with a cleft (lip and) palate: a study in Uganda [J] . International journal of pediatric otorhinolaryngology . 2020,第期

机译：裂口（唇部和）腭裂患者的电动语音与语音语音言语言语疗法的比较：乌干达的研究
2. Poorer phonetic perceivers show greater benefit in phonetic-phonological speech learning [J] . Journal of speech, language, and hearing research: JSLHR . 2013,第3期

机译：较差的语音感知者在语音语音学习中显示出更大的优势
3. Electrophysiological indicators of phonetic and non-phonetic multisensory interactions during audiovisual speech perception. [J] . Klucharev V, Mottonen R, Sams M Brain research. Cognitive brain research . 2003,第1期

机译：视听语音感知过程中语音和非语音多感官互动的电生理指标。
4. Phonetically-oriented word error alignment for speech recognition error analysis in speech translation [C] . Nicholas Ruiz, Marcello Federico IEEE Workshop on Automatic Speech Recognition and Understanding . 2015

机译：用于语音翻译的语音识别错误分析的面向语音的单词错误对齐
5. The acoustic-phonetic characteristics of infant-directed speech in Mandarin Chinese and their relation to infant speech perception in the first year of life. [D] . Liu, Huei-Mei. 2002

机译：普通话中婴幼儿语音的语音特征及其与婴儿语音感知的关系。
6. Poorer Phonetic Perceivers Show Greater Benefit in Phonetic-Phonological Speech Learning [O] . Erin M. Ingvalson, Allison M. Barr, Patrick C. M. Wong -1

机译：较差的语音感知者在语音语音语音学习中显示出更大的优势
7. Improving the Accuracy of the Speech Synthesis Based Phonetic Alignment Using Multiple Acoustic Features [O] . Sérgio Paulo, Luís C. Oliveira 2003

机译：使用多个声学特征提高基于语音合成的语音对齐的准确性
8. A Phonetic-Context Controlled Strategy for Segmentation and Phonetic Labeling of Speech. [R] . Mermelstein, P. 1974

机译：一种语音 - 语境控制的语音分词和语音标注策略。

Phonetic alignment: speech synthesis-based vs. Viterbi-based

摘要

著录项

相似文献

相关主题

期刊订阅