首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Long-distance rhythmic dependencies and their application to automatic language identification
【24h】

Long-distance rhythmic dependencies and their application to automatic language identification

机译:长途节奏依赖性及其在自动语言识别中的应用

获取原文

摘要

The perception of rhythmic differences among languages relies on varieties in periodicity within prominence groups. But the consensus in phonetic research on rhythm is that existing measures don't capture true rhythm by that definition - instead, they merely measure short-term timing. This work proposes a new rhythm measure, the Generalized Variability Index (GVI), that examines durational contexts over arbitrarily long linguistic distances. To evaluate this new measure, we conducted a set of experiments in automatic language identification using large amounts of data from 11 languages in the Globalphone and TIMIT corpora. When added to baseline rhythm measures, these new GVI features offer absolute improvement in 11-way language classification accuracy by as much as 12%. Moreover, the addition of wider and wider durational context in the GVI continues to contribute information useful for automatic language ID, abating in usefulness only at a distance of about 10 syllables.
机译:语言之间节奏差异的感知取决于突出组中周期性的变化。但是,关于节奏的语音研究中的共识是,现有的量度不能通过该定义捕捉真实的节奏-相反,它们仅测量短期时机。这项工作提出了一种新的节奏测度,即广义变异指数(GVI),它可以检查任意较长语言距离上的持续情境。为了评估这项新措施,我们在Globalphone和TIMIT语料库中使用来自11种语言的大量数据进行了自动语言识别的一组实验。将这些新的GVI功能添加到基准节奏测量中后,可以将11种语言的分类准确性绝对提高12%。此外,在GVI中添加越来越大的持续时间上下文继续为自动语言ID提供有用的信息,仅在大约10个音节的距离处有用性降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号