首页> 外文期刊>IEICE Transactions on Information and Systems >Learning Speech Variability in Discriminative Acoustic Model Adaptation
【24h】

Learning Speech Variability in Discriminative Acoustic Model Adaptation

机译:学习判别声学模型自适应中的语音变异性

获取原文
获取原文并翻译 | 示例
       

摘要

We present a new discriminative method of acoustic model adaptation that deals with a task-dependent speech variability. We have focused on differences of expressions or speaking styles between tasks and set the objective of this method as improving the recognition accuracy of indistinctly pronounced phrases dependent on a speaking style.The adaptation appends subword models for frequently observable variants of subwords in the task. To find the task-dependent variants, low-confidence words are statistically selected from words with higher frequency in the task's adaptation data by using their word lattices. HMM parameters of subword models dependent on the words are discriminatively trained by using linear transforms with a minimum phoneme error (MPE) criterion. For the MPE training, subword accuracy discriminating between the variants and the originals is also investigated. In speech recognition experiments, the proposed adaptation with the subword variants reduced the word error rate by 12.0% relative in a Japanese conversational broadcast task.
机译:我们提出了一种新的声学模型自适应判别方法,该方法可处理与任务相关的语音可变性。我们着重研究了任务之间的表达或说话风格的差异,并将此方法的目标设定为提高依赖于说话风格的不清晰发音短语的识别准确度。该适应方法为任务中经常出现的子词变体附加了子词模型。为了找到与任务相关的变体,使用其单词格从任务的适应性数据中频率较高的单词中统计选择低置信度单词。通过使用具有最小音素错误(MPE)准则的线性变换来区别地训练依赖于单词的子单词模型的HMM参数。对于MPE训练,还研究了区分变体和原始词的子词准确性。在语音识别实验中,拟议的带有子词变体的改编相对于日语会话广播任务而言,将词错误率降低了12.0%。

著录项

  • 来源
    《IEICE Transactions on Information and Systems》 |2010年第9期|P.2370-2378|共9页
  • 作者单位

    NHK(Japan Broadcasting Corporation) Science & Technology Research Laboratories, Tokyo, 157-8510 Japan;

    rnNHK(Japan Broadcasting Corporation) Science & Technology Research Laboratories, Tokyo, 157-8510 Japan;

    rnNHK(Japan Broadcasting Corporation) Science & Technology Research Laboratories, Tokyo, 157-8510 Japan;

    rnNHK(Japan Broadcasting Corporation) Science & Technology Research Laboratories, Tokyo, 157-8510 Japan;

    rnNHK(Japan Broadcasting Corporation) Science & Technology Research Laboratories, Tokyo, 157-8510 Japan;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    speech recognition; speech variability; discriminative training; acoustic model;

    机译:语音识别;语音变异性歧视性培训;声学模型;
  • 入库时间 2022-08-18 00:26:59

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号