首页> 外文会议>International Speech Communication Association >A Comparison of Subspace Feature-Domain Methods for Language Recognition
【24h】

A Comparison of Subspace Feature-Domain Methods for Language Recognition

机译:子空间特征域方法对语言识别的比较

获取原文

摘要

Compensation of cepstral features for mismatch due to dissimilar train and test conditions has been critical for good performance in many speech applications. Mismatch is typically due to variability from changes in speaker, channel, gender, and environment. Common methods for compensation include RASTA, mean and variance normalization, VTLN, and feature warping. Recently, a new class of subspace methods for model compensation have become popular in language and speaker recognition-nuisance attribute projection (NAP) and factor analysis. A feature space version of latent factor analysis has been proposed. In this work, a feature space version of NAP is presented. This new approach, NAP, is contrasted with feature domain latent factor analysis (fLFA). Both of these methods are applied to a NIST language recognition task. Results show the viability of the new NAP method. Also, results indicate when the different methods perform best.
机译:由于不同的火车和测试条件,对抗搏击功能的补偿对于许多语音应用中的良好性能至关重要。不匹配通常是由于扬声器,渠道,性别和环境变化的可变性。用于补偿的常用方法包括Rasta,均值和方差标准化,VTLN和功能翘曲。最近,新类别的模型补偿方法已经流行了语言和扬声器识别 - 滋扰属性投影(NAP)和因子分析。提出了一个特征空间版本的潜在因子分析。在这项工作中,呈现了一个功能空间版本。这种新方法NAP与特征域潜在分析(FLFA)形成对比。这两种方法都适用于NIST语言识别任务。结果显示了新的午睡方法的可行性。此外,结果表明不同方法何时表现最佳。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号