Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach

机译：判读式语音建模：一种高利润，功能丰富的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the problem of learning the mapping between words and their possible pronunciations in terms of sub-word units. Most previous approaches have involved generative modeling of the distribution of pronunciations, usually trained to maximize likelihood. We propose a discriminative, feature-rich approach using large-margin learning. This approach allows us to optimize an objective closely related to a discriminative task, to incorporate a large number of complex features, and still do inference efficiently. We test the approach on the task of lexical access; that is, the prediction of a word given a phonetic transcription. In experiments on a subset of the Switchboard conversational speech corpus, our models thus far improve classification error rates from a previously published result of 29.1% to about 15%. We find that large-margin approaches outperform conditional random field learning, and that the Passive-Aggressive algorithm for large-margin learning is faster to converge than the Pegasos algorithm.

机译：我们解决了根据子词单位学习单词及其可能发音之间的映射的问题。先前的大多数方法都涉及对发音分布进行生成建模，通常会对其进行训练以最大程度地提高其可能性。我们提出了一种使用大幅度学习的有区别的，功能丰富的方法。这种方法使我们能够优化与判别任务密切相关的目标，以纳入大量复杂功能，并且仍然可以高效地进行推理。我们测试了词汇访问任务的方法;也就是说，根据语音转录对单词的预测。在“总机”会话语音语料集的子集上进行的实验中，到目前为止，我们的模型将分类错误率从之前公布的29.1％提高到了大约15％。我们发现大利润率方法优于条件随机场学习，并且大利润率学习的被动攻击算法比Pegasos算法收敛更快。

著录项

来源
《Annual meeting of the Association for Computational Linguistics;ACL 2012》|2012年|p.194-203|共10页
会议地点
作者
Hao Tang; Joseph Keshet; Karen Livescu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Discriminative Pronunciation Modeling Using the MPE Criterion [J] . Meixu SONG, Jielin PAN, Qingwei ZHAO, IEICE transactions on information and systems . 2015,第3期

机译：使用MPE标准进行判读语音建模
2. Pronunciation Proficiency Evaluation based on Discriminatively Refined Acoustic Models [J] . Ke Yan, Shu Gong International Journal of Information Technology and Computer Science . 2011,第2期

机译：基于判别改进的声学模型的语音水平评估
3. Within-word pronunciation variation modeling for Arabic ASRs:a direct data-driven approach [J] . Dia AbuZeina, Wasfi Al-Khatib, Moustafa Elshafei, International journal of speech technology . 2012,第2期

机译：阿拉伯语ASR的词内发音变化建模：直接数据驱动方法
4. Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach [C] . Hao Tang, Joseph Keshet, Karen Livescu Annual meeting of the Association for Computational Linguistics . 2012

机译：辨别性的发音建模：大边缘，丰富的方法
5. Improved online learning and modeling for feature-rich discriminative machine translation. [D] . Eidelman, Vladimir Alexander. 2013

机译：改进的在线学习和建模功能，可用于功能丰富的区分性机器翻译。
6. RFMix: A Discriminative Modeling Approach for Rapid and Robust Local-Ancestry Inference [O] . Brian K. Maples, Simon Gravel, Eimear E. Kenny, 2013

机译：RFMix：快速稳健的局部祖先推理的判别建模方法
7. A Discriminative Approach to Pronunciation Variation Modeling in Speech Recognition [O] . Adde Line 2013

机译：语音识别中语音变化建模的判别方法

Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach

摘要

著录项

相似文献

相关主题

期刊订阅