首页> 外文会议>International Conference on Signal Processing and Communications >Sub-segmental, segmental and supra-segmental analysis of linear prediction residual signal for language identification
【24h】

Sub-segmental, segmental and supra-segmental analysis of linear prediction residual signal for language identification

机译:用于语言识别的线性预测残差信号的分段,分段和超分段分析

获取原文

摘要

In this work, excitation source information is explored for language identification (LID) task. The excitation signal is represented by linear prediction (LP) residual. Different aspects of the excitation source information can be captured by processing LP residual signal at sub-segmental, segmental and supra-segmental levels. Gaussian mixture modelling (GMM) technique is used to build the language models. Present LID study has been carried out on IITKGP-MLILSC speech database. Individually, the segmental level information provides good LID accuracy followed by sub-segmental and supra-segmental level information. Combined evidences from all three levels represent the complete excitation source information. Finally, a comparative study has been carried out between the vocal tract and excitation source features, which portrays the distinct nature of these two features. Combination of both the features, yield an improvement of 10.01% in LID accuracy than only excitation source information. This observation indicates the significance of excitation source information for LID task.
机译:在这项工作中,探索激发源信息以进行语言识别(LID)任务。激励信号由线性预测(LP)残差表示。可以通过在子分段,分段和超分段级别处理LP残留信号来捕获激发源信息的不同方面。高斯混合建模(GMM)技术用于构建语言模型。目前的LID研究已在IITKGP-MLILSC语音数据库上进行。单独地,分段级别信息提供了良好的LID准确性,其后是子分段级别信息和超分段级别信息。来自所有三个级别的综合证据代表了完整的激励源信息。最后,在声道和激励源特征之间进行了比较研究,描绘了这两个特征的独特性质。两种功能的组合比仅激励源信息的LID精度提高了10.01%。该观察结果表明激励源信息对于LID任务的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号