首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach
【24h】

Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach

机译:辨别性的发音建模:大边缘,丰富的方法

获取原文

摘要

We address the problem of learning the mapping between words and their possible pronunciations in terms of sub-word units. Most previous approaches have involved generative modeling of the distribution of pronunciations, usually trained to maximize likelihood. We propose a discriminative, feature-rich approach using large-margin learning. This approach allows us to optimize an objective closely related to a discriminative task, to incorporate a large number of complex features, and still do inference efficiently. We test the approach on the task of lexical access; that is, the prediction of a word given a phonetic transcription. In experiments on a subset of the Switchboard conversational speech corpus, our models thus far improve classification error rates from a previously published result of 29.1% to about 15%. We find that large-margin approaches outperform conditional random field learning, and that the Passive-Aggressive algorithm for large-margin learning is faster to converge than the Pegasos algorithm.
机译:我们解决了在子字单元方面学习单词与他们可能的发音之间的映射的问题。大多数以前的方法有发音的分布涉及生成模型,平时训练的最大化可能性。我们提出了一种使用大保证金学习的歧视性,具有丰富的方法。这种方法使我们能够优化与歧视任务密切相关的目标,以包含大量复杂的功能,并且仍然有效地推断。我们测试词汇访问任务的方法;也就是说,给出了语音转录的单词的预测。在对话板对话语音语料库的子集上的实验中,我们的模型远远提高了先前发布结果的分类误差率为29.1%至约15%。我们发现大边缘接近胜过条件随机场学习,而且大边缘学习的被动攻击算法比PegasoS算法更快地收敛。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号