首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >A Study of Term Weighting in Phonotactic Approach to Spoken Language Recognition
【24h】

A Study of Term Weighting in Phonotactic Approach to Spoken Language Recognition

机译:语音策略中的语音称谓术语加权研究

获取原文

摘要

In the spoken language recognition approach of modeling phonetic lattice with the Support Vector Machine (SVM), term weighting on the supervector of N-gram probabilities is critical to the recognition performance because the weighting prevents the SVM kernel from being dominated by a few large probabilities. We investigate several term weighting functions that are used in text retrieval, which can incorporate the long-term semantic modeling in the short-term N-gram modeling. The functions are evaluated on „ the NIST 2007 Language Recognition Evaluation (LRE) task. Results suggest that the term weighting with redundancy of term frequency (rd) can effectively eliminate the redundancy of unit frequency co-occurrence across languages, and the combination of rd and logtf demonstrates the effectiveness of combining the local and global weighting functions.
机译:在使用支持向量机(SVM)对语音格进行建模的口语识别方法中,N-gram概率的超向量上的项加权对识别性能至关重要,因为加权避免了SVM内核被少数大概率所支配。我们研究了在文本检索中使用的几种术语加权函数,这些函数可以将长期语义建模合并到短期N-gram建模中。在“ NIST 2007语言识别评估(LRE)”任务上评估功能。结果表明,具有术语频率(rd)冗余的术语加权可以有效消除跨语言的单位频率共现的冗余,并且rd和logtf的组合证明了组合局部和全局加权函数的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号