Phone transition acoustic modeling: application to speaker independent and spontaneous speech systems

机译：电话过渡声学建模：应用于独立于扬声器的自发语音系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

HMM-based large vocabulary speech reocgnition usually have a very large number of statistical parameters. For better estimation, the number of parameters is reduced by sharing them across models. The parameter sharing is decided by regression trees which are built using phonetic classes designed either by a human expert or by data-driven methods. In situations where neither of these are by data-drive methods. In situations where neither of these are reliable, it may be useful to have techniques for non-decision-tree based state tying which perform comparably to those based on traditional methods. In this paper we propose two methods for nondecision tree based parameer learning in HMM-based systems. In the first method (conetext-dependent state typing), we restructure acoustic models to explicitly capture the transitions between phones in continuous pseech. In the second method (transition-based subword units), we redefine the basic sound units used to mdoel speech to model transitions betwene sounds explicitly. Experiments show that context-dependent state typing is a viable option for large vocabulary systems. They also show that using transition-based subword units can improve performance on spontaneous speech.

机译：基于HMM的大词汇量语音识别通常具有非常大量的统计参数。为了更好地进行估计，可以通过在模型之间共享参数来减少参数的数量。参数共享是由回归树决定的，回归树是使用由人类专家或通过数据驱动的方法设计的语音类构建的。在这些都不是通过数据驱动方法的情况下。在这两种方法都不可靠的情况下，拥有与基于传统方法的性能相当的基于非决策树的状态绑定技术可能会很有用。在本文中，我们提出了两种用于基于HMM的系统中基于非决策树的参数学习的方法。在第一种方法（依赖于上下文的状态键入）中，我们重构声学模型以显式捕获连续语音中电话之间的过渡。在第二种方法（基于过渡的子词单位）中，我们重新定义用于语音的基本声音单位，以显式建模声音之间的过渡。实验表明，依赖于上下文的状态键入对于大型词汇系统是可行的选择。他们还表明，使用基于过渡的子词单元可以提高自发语音的性能。

著录项

来源
《6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16.-Oct.20 2000 Beijing International Convention Center,Beijing, China 》|2000年|p.572-575|共4页
会议地点
作者
Jon P.Nedel; Rita Singh; Richard M.Stern;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类世界各国文化与文化事业 ;
关键词

相似文献

外文文献
专利

1. An acoustic-phonetic-based speaker adaptation technique forimproving speaker-independent continuous speech recognition [J] . Yunxin Zhao IEEE Transactions on Speech and Audio Proceessing . 1994 ,第3期

机译：基于声学的说话人自适应技术，用于改善与说话人无关的连续语音识别
2. An acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition [J] . Yunxin Zhao IEEE Transactions on Speech and Audio Proceeding . 1994 ,第3期

机译：基于声学的说话人自适应技术，用于改善与说话人无关的连续语音识别
3. Context-independent phonetic hidden Markov models for speaker-independent continuous speech recognition [J] . Lee K.-F. IEEE Transactions on Acoustics, Speech, and Signal Processing . 1990 ,第4期

机译：上下文无关的语音隐藏马尔可夫模型，用于与说话者无关的连续语音识别
4. Phone transition acoustic modeling: application to speaker independent and spontaneous speech systems [C] . Jon P.Nedel, Rita Singh, Richard M.Stern International conference on spoken language processing . 2000

机译：电话过渡声学建模：应用于扬声器独立和自发的语音系统
5. Acoustic modeling and speaker normalization strategies with application to robust in-vehicle speech recognition and dialect classification. [D] . Yapanel, Umit. 2005

机译：声学建模和说话人归一化策略及其在强大的车载语音识别和方言分类中的应用。
6. Zebra finches exhibit speaker-independent phonetic perception of human speech [O] . Verena R. Ohms, Arike Gill, Caroline A. A. Van Heijningen, 2010

机译：斑马雀科动物表现出与说话者无关的语音
7. Acoustic -Phonetic Speech Parameters for Speaker-Independent Speech Recognition [O] . Om Deshmukh, Carol Y. Espy-wilson, Amit Juneja 2002

机译：语音语音参数用于与说话者无关的语音识别

Phone transition acoustic modeling: application to speaker independent and spontaneous speech systems

摘要

著录项

相似文献

相关主题

期刊订阅