首页> 外文会议>International Conference on Audio, Language and Image Processing >Auditory Features with Vocal Track Length Normalization for Language Identification
【24h】

Auditory Features with Vocal Track Length Normalization for Language Identification

机译:听觉功能具有语言识别的声道长度归一化

获取原文

摘要

This paper reports on a novel feature, auditory cepstrum coefficient (ACC) with vocal tract length normalization (VTLN), for language identification (LID). The ACC feature is based on the auditory characteristics of human ear and the VTLN technology compensates the speaker variability. The detailed implementation of ACC feature with VTLN in frequency domain is given. Experimental results show that the proposed auditory feature outperforms its widely used Mel-frequency cepstrum coefficient (MFCC) counterpart and is more effective when combined with VTLN.
机译:本文报告了具有语言识别(盖子)的语言识别(VTLN)的新颖特征,听觉综糖系数(ACC)。 ACC功能基于人耳的听觉特性,VTLN技术补偿了扬声器变异性。给出了频域中VTLN的ACC功能的详细实现。实验结果表明,所提出的听觉特征优于其广泛使用的熔融频率谱系数(MFCC)对应物,在与VTLN结合时更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号