首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >ON THE IMPORTANCE OF ANALYTIC PHASE OF SPEECH SIGNALS IN SPOKEN LANGUAGE RECOGNITION
【24h】

ON THE IMPORTANCE OF ANALYTIC PHASE OF SPEECH SIGNALS IN SPOKEN LANGUAGE RECOGNITION

机译:论语音信号分析阶段的重要性语言识别中的重要性

获取原文

摘要

In this paper, we study the role of long-time analytic phase of speech signals in spoken language recognition (SLR) and employ a set of features termed as instantaneous frequency cepstral coefficients (IFCC). We extract IFCC from long-time analytic phase, in an effort to capture long range acoustic features from speech signals. These features are used in combination with the traditional shifted delta cepstral coefficients (SDCC) for SLR. As the SDCC are extracted from spectral magnitude and IFCC are from analytic phase, they characterize long-time information of speech in different ways. The experiments conducted with NIST LRE 2017 task reveals the complementary effects of IFCC features to SDCC and deep bottleneck (DBN) features. The fusion of IFCC with SDCC/DBN features delivered relative improvements of 23.23% and 16.78% in average equal error rate over the SDCC and DBN features, respectively, indicating the benefits of information from analytic phase in SLR.
机译:在本文中,我们研究了语音信号的长时间分析相位在口语识别(SLR)中的作用,并使用一组称为瞬时频率谱系数(IFCC)的特征。我们从长时间分析阶段提取IFCC,以捕获来自语音信号的长距离声学特征。这些特征与SLR的传统移位Delta谱系齐系数(SDCC)组合使用。随着SDCC从光谱幅度提取,IFCC来自分析阶段,它们以不同方式表征了语音的长期信息。使用NIST LRE 2017任务进行的实验揭示了IFCC功能对SDCC和深瓶颈(DBN)特征的互补影响。具有SDCC / DBN特征的IFCC的融合分别在SDCC和DBN特征上分别在平均相等的误差速率下提供了23.23%和16.78%的相对改善,表明信息来自SLR中的分析相位的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号