首页> 外文会议>European Conference on Speech Communication and Technology v.2; 20010903-20010907; Aalborg; DK >MAXIMUM-LIKELIHOOD AFFINE CEPSTRAL FILTERING (MLACF) TECHNIQUE FOR SPEAKER NORMALIZATION
【24h】

MAXIMUM-LIKELIHOOD AFFINE CEPSTRAL FILTERING (MLACF) TECHNIQUE FOR SPEAKER NORMALIZATION

机译:扬声器标准化的最大似然仿制药倒谱滤波(MLACF)技术

获取原文
获取原文并翻译 | 示例

摘要

We present a novel technique of minimizing the acoustic variability of speakers by transforming the features extracted from the speaker's data to better fit the recognition model. The concept of maximum-likelihood affine cepstral filtering (MLACF) will be introduced for feature transformation, along with solutions for the transformation parameters that maximize the likelihood of the test data with respect to a given recognition model. It is shown that for log-concave distributions, the solution of the MLACF problem can be obtained using convex programming. HMM-based digit recognition on the TIDIGITS database is presented to demonstrate the flexibility of the transformation in compensating for large acoustic mismatches between the speakers in the training and test database. In addition, it will be shown that the technique requires estimation of far fewer transformation parameters compared to existing techniques, thus allowing fast, real-time compensation.
机译:我们提出了一种新颖的技术,通过转换从说话者数据中提取的特征以更好地拟合识别模型,将说话者的声音变异性降至最低。将针对特征变换引入最大似然仿射倒谱滤波(MLACF)的概念,以及针对给定识别模型最大程度地提高测试数据可能性的变换参数解决方案。结果表明,对于对数凹面分布,可以使用凸规划获得MLACF问题的解。展示了TIDIGITS数据库上基于HMM的数字识别,以演示转换的灵活性,以补偿培训和测试数据库中扬声器之间的巨大声学失配。另外,将表明,与现有技术相比,该技术需要估计的变换参数要少得多,从而可以进行快速,实时的补偿。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号