On the Utility of Syllable-Based Acoustic Models for Pronunciation Variation Modelling

Annika H?m?l?inen; Lou Boves; Johan de Veth; Louis ten Bosch

首页> 外文期刊>EURASIP journal on audio, speech, and music processing >On the Utility of Syllable-Based Acoustic Models for Pronunciation Variation Modelling

【24h】

On the Utility of Syllable-Based Acoustic Models for Pronunciation Variation Modelling

机译：基于音节的声学模型在语音变化建模中的效用

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent research on the TIMIT corpus suggests that longer-length acoustic models are more appropriate for pronunciation variation modelling than the context-dependent phones that conventional automatic speech recognisers use. However, the impressive speech recognition results obtained with longer-length models on TIMIT remain to be reproduced on other corpora. To understand the conditions in which longer-length acoustic models result in considerable improvements in recognition performance, we carry out recognition experiments on both TIMIT and the Spoken Dutch Corpus and analyse the differences between the two sets of results. We establish that the details of the procedure used for initialising the longer-length models have a substantial effect on the speech recognition results. When initialised appropriately, longer-length acoustic models that borrow their topology from a sequence of triphones cannot capture the pronunciation variation phenomena that hinder recognition performance the most.

机译：TIMIT语料库的最新研究表明，与传统自动语音识别器使用的上下文相关电话相比，较长长度的声学模型更适合于语音变化建模。但是，在TIMIT上使用较长长度的模型获得的令人印象深刻的语音识别结果仍有待在其他语料库上重现。为了了解更长的声学模型可导致识别性能显着提高的条件，我们在TIMIT和Spoken Dutch语料库上进行了识别实验，并分析了两组结果之间的差异。我们确定用于初始化较长长度模型的过程的详细信息会对语音识别结果产生重大影响。如果进行适当初始化，则较长的声学模型会从三音节序列中借用其拓扑结构，因此无法捕获最大程度地影响识别性能的发音变化现象。

著录项

来源
《EURASIP journal on audio, speech, and music processing》 |2007年第1期|共11页
作者
Annika H?m?l?inen; Lou Boves; Johan de Veth; Louis ten Bosch;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Acoustic Pronunciation Variations Modeling for Standard Malay Speech Recognition [J] . Noraini Seman, Kamaruzaman Jusoff Computer and Information Science . 2008,第4期

机译：用于标准马来语语音识别的原声发音变化建模
2. Syllable-based acoustical modeling for Japanese spontaneous recognition [J] . Jun Ogata, Yasuo Ariki 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2002,第527期

机译：基于音节的日语自然识别声学模型
3. Syllable-based acoustical modeling for Japanese spontaneous recognition [J] . Jun Ogata, Yasuo Ariki 電子情報通信学会技術研究報告. 音声. Speech . 2002,第529期

机译：基于音节的日语自然识别声学模型
4. Modeling Syllable-Based Pronunciation Variation for Accented Mandarin Speech Recognition [C] . Zhang Shilei, Shi Qin, Qin Yong 2010 20th International Conference on Pattern Recognition . 2010

机译：重读普通话语音识别的基于音节的语音变化建模
5. Pronunciation Variation Modeling for Automatic Speech Recognition [D] . Zheng, Jing 2014

机译：自动语音识别的语音变化建模
6. Commentary: Utility-free heuristic models of two-option choice can mimic predictions of utility-stage models under many conditions [O] . Camillo Padoa-Schioppa 2015

机译：评论：两种选择的无效用启发式模型可以在许多条件下模仿效用阶段模型的预测
7. On the Utility of Syllable-Based Acoustic Models for Pronunciation Variation Modelling [O] . 2007

机译：基于音节的声学模型在语音变化建模中的效用

On the Utility of Syllable-Based Acoustic Models for Pronunciation Variation Modelling

摘要

著录项

相似文献

相关主题

期刊订阅