首页> 外文会议> >Using linear models of speech trajectory in the reconstructed phase space to extract useful features for speech recognition system
【24h】

Using linear models of speech trajectory in the reconstructed phase space to extract useful features for speech recognition system

机译:使用重构相空间中语音轨迹的线性模型来提取语音识别系统的有用特征

获取原文
获取原文并翻译 | 示例

摘要

In this paper, a new speech feature extraction method is proposed to improve the performance of speech recognition systems. This method is based upon the modeling of speech trajectory, taking advantage of multivariate autoregressive (MVAR) method, and application of some linear transformation methods which are needed for the dimension reduction purposes such as linear discriminant analysis (LDA), heteroscedastic LDA (HLDA), and locality preserving projection (LPP). Since the reconstructed phase space (RPS) is a proper domain to represent true dynamics of chaotic signal, it is utilized to produce the trajectory of speech signal in a high dimension space. In addition, the mentioned linear transform techniques are used to decorrelate and reduce the dimension of final RPS-MVAR feature vectors. Our experimental results show that overall system with the proposed features achieved 9.5% absolute improvement of phoneme accuracy compared to the baseline features in the clean condition.
机译:为了提高语音识别系统的性能,本文提出了一种新的语音特征提取方法。该方法基于语音轨迹建模,利用了多元自回归(MVAR)方法,并应用了一些用于降维目的的线性变换方法,例如线性判别分析(LDA),异方差LDA(HLDA) ,以及位置保留投影(LPP)。由于重构的相空间(RPS)是表示混沌信号真实动态的合适域,因此可用于在高维空间中生成语音信号的轨迹。另外,所提及的线性变换技术用于去相关并减小最终RPS-MVAR特征向量的维数。我们的实验结果表明,与纯净条件下的基线特征相比,具有拟议特征的整个系统的音素准确度绝对提高了9.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号