首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >A neural fuzzy training approach for continuous speech recognition improvement
【24h】

A neural fuzzy training approach for continuous speech recognition improvement

机译:连续语音识别改进的神经模糊训练方法

获取原文

摘要

A novel training method for phoneme identification neural networks, called a neural fuzzy training method, is proposed. The difference between the proposed method and the conventional method is that the target values of each training sample are given as fuzzy phoneme class information instead of discrete phoneme class information. In the conventional training method, the target values are defined as 0s or 1s. However, in the proposed method, the target values are defined as likelihoods to phoneme classes in between 0 and 1. This likelihood is computed by a likelihood transformation function according to the distance between the input sample and its nearest sample belonging to each phoneme class in the training set. The effectiveness of the proposed method is shown by an 18-consonant identification experiment and a continuous speech recognition experiment using the ATR isolated word and phrase database. Improvements can be observed in every experiment, particularly on the continuous speech recognition results.
机译:提出了一种用于音素识别神经网络的新型训练方法,称为神经模糊训练方法。所提出的方法和传统方法之间的差异是每个训练样本的目标值作为模糊音素类信息而不是离散音素类信息。在传统的训练方法中,目标值定义为0s或1s。然而,在所提出的方法中,目标值被定义为位于0和1之间的音素类别的似然性。根据输入样本与其属于每个音素类的最近样本之间的距离来计算这种可能性的似然转换功能训练集。所提出的方法的有效性由18辅音识别实验和使用ATR隔离字和短语数据库的连续语音识别实验示出。在每个实验中可以观察到改进,特别是在连续语音识别结果上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号