首页> 外文会议>Annual meeting of the Association for Computational Linguistics;ACL 2012 >Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach
【24h】

Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach

机译:判读式语音建模:一种高利润,功能丰富的方法

获取原文

摘要

We address the problem of learning the mapping between words and their possible pronunciations in terms of sub-word units. Most previous approaches have involved generative modeling of the distribution of pronunciations, usually trained to maximize likelihood. We propose a discriminative, feature-rich approach using large-margin learning. This approach allows us to optimize an objective closely related to a discriminative task, to incorporate a large number of complex features, and still do inference efficiently. We test the approach on the task of lexical access; that is, the prediction of a word given a phonetic transcription. In experiments on a subset of the Switchboard conversational speech corpus, our models thus far improve classification error rates from a previously published result of 29.1% to about 15%. We find that large-margin approaches outperform conditional random field learning, and that the Passive-Aggressive algorithm for large-margin learning is faster to converge than the Pegasos algorithm.
机译:我们解决了根据子词单位学习单词及其可能发音之间的映射的问题。先前的大多数方法都涉及对发音分布进行生成建模,通常会对其进行训练以最大程度地提高其可能性。我们提出了一种使用大幅度学习的有区别的,功能丰富的方法。这种方法使我们能够优化与判别任务密切相关的目标,以纳入大量复杂功能,并且仍然可以高效地进行推理。我们测试了词汇访问任务的方法;也就是说,根据语音转录对单词的预测。在“总机”会话语音语料集的子集上进行的实验中,到目前为止,我们的模型将分类错误率从之前公布的29.1%提高到了大约15%。我们发现大利润率方法优于条件随机场学习,并且大利润率学习的被动攻击算法比Pegasos算法收敛更快。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号