首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Artificial stereo data generation for speech feature mapping
【24h】

Artificial stereo data generation for speech feature mapping

机译:用于语音特征映射的人工立体声数据生成

获取原文

摘要

Feature mapping technique is widely used to eliminate the mismatch between the training and test conditions of speech recognition. In the feature mapping, a target (mismatched) feature vector sequence is mapped closer to the corresponding reference (matched) feature vector stream. The training of the mapping system is usually carried out based on a set of stereo data which consists of simultaneous recordings obtained in both the reference and target conditions. In this paper, we propose a novel approach to blind parameter estimation which does not require the reference feature vectors. The proposed approach is motivated by the hidden Markov model (HMM)-based speech synthesis algorithm.
机译:特征映射技术被广泛用于消除语音识别的训练和测试条件之间的不匹配。在特征映射中,将目标(不匹配)特征向量序列映射为更靠近相应的参考(匹配)特征向量流。映射系统的训练通常基于一组立体声数据进行,该立体声数据由在参考条件和目标条件下同时获得的记录组成。在本文中,我们提出了一种新颖的盲参数估计方法,该方法不需要参考特征向量。所提出的方法是基于基于隐马尔可夫模型(HMM)的语音合成算法的启发。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号