首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach

【24h】

Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach

机译：辨别性的发音建模：大边缘，丰富的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the problem of learning the mapping between words and their possible pronunciations in terms of sub-word units. Most previous approaches have involved generative modeling of the distribution of pronunciations, usually trained to maximize likelihood. We propose a discriminative, feature-rich approach using large-margin learning. This approach allows us to optimize an objective closely related to a discriminative task, to incorporate a large number of complex features, and still do inference efficiently. We test the approach on the task of lexical access; that is, the prediction of a word given a phonetic transcription. In experiments on a subset of the Switchboard conversational speech corpus, our models thus far improve classification error rates from a previously published result of 29.1% to about 15%. We find that large-margin approaches outperform conditional random field learning, and that the Passive-Aggressive algorithm for large-margin learning is faster to converge than the Pegasos algorithm.

机译：我们解决了在子字单元方面学习单词与他们可能的发音之间的映射的问题。大多数以前的方法有发音的分布涉及生成模型，平时训练的最大化可能性。我们提出了一种使用大保证金学习的歧视性，具有丰富的方法。这种方法使我们能够优化与歧视任务密切相关的目标，以包含大量复杂的功能，并且仍然有效地推断。我们测试词汇访问任务的方法;也就是说，给出了语音转录的单词的预测。在对话板对话语音语料库的子集上的实验中，我们的模型远远提高了先前发布结果的分类误差率为29.1％至约15％。我们发现大边缘接近胜过条件随机场学习，而且大边缘学习的被动攻击算法比PegasoS算法更快地收敛。

著录项

来源
《Annual meeting of the Association for Computational Linguistics 》|2012年||共10页
会议地点
作者
Hao Tang; Joseph Keshet; Karen Livescu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程 ;
关键词

相似文献

外文文献
中文文献
专利

1. Discriminative Pronunciation Modeling Using the MPE Criterion [J] . Meixu SONG, Jielin PAN, Qingwei ZHAO, IEICE transactions on information and systems . 2015 ,第3期

机译：使用MPE标准进行判读语音建模
2. Pronunciation Proficiency Evaluation based on Discriminatively Refined Acoustic Models [J] . Ke Yan, Shu Gong International Journal of Information Technology and Computer Science . 2011 ,第2期

机译：基于判别改进的声学模型的语音水平评估
3. Within-word pronunciation variation modeling for Arabic ASRs:a direct data-driven approach [J] . Dia AbuZeina, Wasfi Al-Khatib, Moustafa Elshafei, International journal of speech technology . 2012 ,第2期

机译：阿拉伯语ASR的词内发音变化建模：直接数据驱动方法
4. Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach [C] . Hao Tang, Joseph Keshet, Karen Livescu Annual meeting of the Association for Computational Linguistics;ACL 2012 . 2012

机译：判读式语音建模：一种高利润，功能丰富的方法
5. Improved online learning and modeling for feature-rich discriminative machine translation. [D] . Eidelman, Vladimir Alexander. 2013

机译：改进的在线学习和建模功能，可用于功能丰富的区分性机器翻译。
6. RFMix: A Discriminative Modeling Approach for Rapid and Robust Local-Ancestry Inference [O] . Brian K. Maples, Simon Gravel, Eimear E. Kenny, 2013

机译：RFMix：快速稳健的局部祖先推理的判别建模方法
7. A Discriminative Approach to Pronunciation Variation Modeling in Speech Recognition [O] . Adde Line 2013

机译：语音识别中语音变化建模的判别方法

Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach

摘要

著录项

相似文献

相关主题

期刊订阅