首页> 外文会议>2012 8th International Symposium on Chinese Spoken Language Processing. >Phonotactic spoken language recognition: Using diversely adapted acoustic models in parallel phone recognizers
【24h】

Phonotactic spoken language recognition: Using diversely adapted acoustic models in parallel phone recognizers

机译:语音法口语识别:在并行电话识别器中使用多种适应的声学模型

获取原文
获取原文并翻译 | 示例

摘要

In phonotactic spoken language recognition systems, acoustic model adaptation prior to phone lattice decoding has been adopted to deal with the mismatch between training and test conditions. Moreover, combining diversified phonotactic features is commonly used. These motivate us to have an in-depth investigation of combining diversified phonotactic features from diversely adapted acoustic models. Our experiment shows that our approach achieves an equal error rate (EER) of 1.94% in the 30-second closed-set trials of the 2007 NIST Language Recognition Evaluation (LRE). It represents a 14.9% relative improvement in EER over a sophisticated system, in which parallel phone recognizers, speaker adaptive training (SAT) in acoustic models and CMLLR adaptation are used. Moreover, it is shown that our approach provides consistent and substantial improvements in three different phonotactic systems, in each of which a single phone recognizer is used.
机译:在音位练习口语识别系统中,已采用电话晶格解码之前的声学模型自适应来处理训练条件与测试条件之间的不匹配。而且,通常使用组合多种变音术特征。这些促使我们进行深入研究,以结合来自各种适应性声学模型的多种音律学特征。我们的实验表明,我们的方法在2007年NIST语言识别评估(LRE)的30秒封闭试验中实现了1.94%的均等错误率(EER)。与使用复杂的电话识别器,声学模型中的说话人自适应训练(SAT)和CMLLR自适应的复杂系统相比,它的EER相对提高了14.9%。此外,结果表明,我们的方法在三个不同的音律系统中提供了一致且实质性的改进,每个系统都使用一个电话识别器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号