首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Towards long-range prosodic attribute modeling for language recognition
【24h】

Towards long-range prosodic attribute modeling for language recognition

机译:走向用于语言识别的远程韵律属性建模

获取原文

摘要

As a high-level feature, prosody may be an effective feature when it is modeled over longer ranges than the typical range of a syllable. This paper is about language recognition with the high-level prosodic attributes. It studies two important issues of long-range modeling, namely the data scarcity handling method, and the model which properly describes prosodic boundary events. Illustrated by NIST language recognition evaluation (LRE) 2009, long-range modeling is shown to bring a 7.2% relative improvement to a prosodic language detector. Score fusion between the long-range prosodic system and a phonotactic system gives an EER of 3.07%. Exploiting boundary N-grams is the main contributing factor to global EER reduction, while different long-range prosodic modeling factors benefit the detection of different languages. Analysis reveals the evidence of language-specific long-range prosodic attributes, which sheds light on robust long-range modeling methods for language recognition.
机译:作为高级功能,韵律在比音节的典型范围更长的范围内建模时可能是有效的功能。本文是关于具有高级韵律属性的语言识别的。它研究了远程建模的两个重要问题,即数据稀缺性处理方法和正确描述韵律边界事件的模型。由NIST语言识别评估(LRE)2009说明,远程建模显示给韵律语言检测器带来7.2%的相对改进。远程韵律系统和音律系统之间的分数融合得出的EER为3.07%。利用边界N-gram是全局EER减少的主要促成因素,而不同的远程韵律建模因素则有利于检测不同的语言。分析揭示了特定于语言的远程韵律属性的证据,这为用于语言识别的强大的远程建模方法提供了启示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号