...
首页> 外文期刊>IEICE transactions on information and systems >Structured Adaptive Regularization of Weight Vectors for a Robust Grapheme-to-Phoneme Conversion Model
【24h】

Structured Adaptive Regularization of Weight Vectors for a Robust Grapheme-to-Phoneme Conversion Model

机译:健壮的音素到音素转换模型的权重向量的结构化自适应正则化

获取原文
           

摘要

Grapheme-to-phoneme (g2p) conversion, used to estimate the pronunciations of out-of-vocabulary (OOV) words, is a highly important part of recognition systems, as well as text-to-speech systems. The current state-of-the-art approach in g2p conversion is structured learning based on the Margin Infused Relaxed Algorithm (MIRA), which is an online discriminative training method for multiclass classification. However, it is known that the aggressive weight update method of MIRA is prone to overfitting, even if the current example is an outlier or noisy. Adaptive Regularization of Weight Vectors (AROW) has been proposed to resolve this problem for binary classification. In addition, AROW's update rule is simpler and more efficient than that of MIRA, allowing for more efficient training. Although AROW has these advantages, it has not been applied to g2p conversion yet. In this paper, we first apply AROW on g2p conversion task which is structured learning problem. In an evaluation that employed a dataset generated from the collective knowledge on the Web, our proposed approach achieves a 6.8% error reduction rate compared to MIRA in terms of phoneme error rate. Also the learning time of our proposed approach was shorter than that of MIRA in almost datasets.
机译:音素到音素(g2p)转换,用于估计语音(OOV)单词的发音,是识别系统以及文本到语音系统的重要组成部分。目前,g2p转换中的最新方法是基于边缘融合松弛算法(MIRA)的结构化学习,该算法是用于多类分类的在线判别式训练方法。但是,众所周知,即使当前示例异常或嘈杂,MIRA的主动权重更新方法也容易过拟合。已经提出了权重向量的自适应正则化(AROW)来解决此问题以进行二进制分类。此外,AROW的更新规则比MIRA的更新规则更简单,更有效,从而可以进行更有效的培训。尽管AROW具有这些优点,但尚未应用于g2p转换。在本文中,我们首先将AROW应用于结构化学习问题的g2p转换任务。在评估中,使用了从网络上的集体知识生成的数据集,相对于MIRA,我们提出的方法在音素错误率方面实现了6.8%的错误减少率。而且,在几乎所有数据集中,我们提出的方法的学习时间都比MIRA的学习时间短。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号