首页> 外文会议>International Symposium on Intelligent Multimedia, Video and Speech Processing >Comparison of linear prediction cepstrum coefficients and Mel-frequency cepstrum coefficients for language identification
【24h】

Comparison of linear prediction cepstrum coefficients and Mel-frequency cepstrum coefficients for language identification

机译:语言识别线性预测谱系齐系数和熔融频率综合系数的比较

获取原文

摘要

The speech parameterisation methods: Linear Prediction Cepstrum Coeficients and MelFrequency Cepstrum Coefficients were compared with regard to language identification accuracy in a Gaussian Mixture Model based language identfication system. Ten different languages were used to test against a set of ten second test files. The 12th order Linear Prediction Cepstrum Coefficients with delta and accelerate coefficients resulted the best accuracy of 60.0 percent This has shown that information obtained from linear prediction analysis has increased the ability of discriminating different languages. It also shows that language identification performance may be increased by encompass temporal information by including delta and acceleration features. Besides the performance of our test system has proved the feasibility of modeling language by a single Gaussian Mixture Model instead of using complex system such as phonetic recogniser followed by language modelling or large vocabulary continuous speech recognition system.
机译:与基于高斯混合模型的语言识别系统中的语言识别准确性相比,语音参数化方法:对语言识别精度进行了比较了线性预测综注组合和软件谱系齐。十种不同的语言用于针对一组十个测试文件进行测试。具有增量和加速系数的第12阶线性预测综糖系数导致最佳精度为60.0%,这表明从线性预测分析获得的信息增加了识别不同语言的能力。它还示出了通过包括Delta和加速度的包括时间信息,可以增加语言识别性能。除了我们测试系统的性能之外,还证明了单个高斯混合模型的建模语言的可行性,而不是使用诸如语音识别器的复杂系统,然后使用语言建模或大型词汇连续语音识别系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号