首页> 外文会议>2012 IEEE Workshop on Spoken Language Technology. >Frame-based phonotactic Language Identification
【24h】

Frame-based phonotactic Language Identification

机译:基于帧的音律语言识别

获取原文
获取原文并翻译 | 示例

摘要

This paper describes a frame-based phonotactic Language Identification (LID) system, which was used for the LID evaluation of the Robust Automatic Transcription of Speech (RATS) program by the Defense Advanced Research Projects Agency (DARPA). The proposed approach utilizes features derived from frame-level phone log-likelihoods from a phone recognizer. It is an attempt to capture not only phone sequence information but also short-term timing information for phone N-gram events, which is lacking in conventional phonotactic LID systems that simply count phone N-gram events. Based on this new method, we achieved 26% relative improvement in terms of Cavg for the RATS LID evaluation data compared to phone N-gram counts modeling. We also observed that it had a significant impact on score combination with our best acoustic system based on Mel-Frequency Cepstral Coefficients (MFCCs).
机译:本文介绍了一种基于帧的音符语言识别(LID)系统,该系统用于国防高级研究计划局(DARPA)对语音的稳健自动转录(RATS)程序的LID评估。所提出的方法利用了来自电话识别器的帧级电话对数可能性的特征。试图不仅捕获电话序列信息而且捕获针对电话N-gram事件的短期定时信息,这是传统的仅对电话N-gram事件进行计数的音变LID系统所缺少的。基于这种新方法,与电话N-gram计数模型相比,RATS LID评估数据的C avg 相对提高了26%。我们还观察到,它与基于梅尔频率倒谱系数(MFCC)的最佳声学系统对得分组合产生了重大影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号