首页> 外文会议> >A neural fuzzy training approach for continuous speech recognition improvement
【24h】

A neural fuzzy training approach for continuous speech recognition improvement

机译:用于连续语音识别改进的神经模糊训练方法

获取原文

摘要

A novel training method for phoneme identification neural networks, called a neural fuzzy training method, is proposed. The difference between the proposed method and the conventional method is that the target values of each training sample are given as fuzzy phoneme class information instead of discrete phoneme class information. In the conventional training method, the target values are defined as 0s or 1s. However, in the proposed method, the target values are defined as likelihoods to phoneme classes in between 0 and 1. This likelihood is computed by a likelihood transformation function according to the distance between the input sample and its nearest sample belonging to each phoneme class in the training set. The effectiveness of the proposed method is shown by an 18-consonant identification experiment and a continuous speech recognition experiment using the ATR isolated word and phrase database. Improvements can be observed in every experiment, particularly on the continuous speech recognition results.
机译:提出了一种新的音素识别神经网络训练方法,称为神经模糊训练方法。所提出的方法与常规方法之间的区别在于,每个训练样本的目标值是作为模糊音素类别信息而不是离散音素类别信息给出的。在常规的训练方法中,目标值被定义为0或1。但是,在提出的方法中,目标值被定义为介于0和1之间的音素类别的似然度。该似然度是根据输入样本与其最接近的样本中属于每个音素类别的距离之间的距离,由似然度转换函数计算得出的训练集。通过使用ATR隔离的单词和短语数据库进行18辅音识别实验和连续语音识别实验,证明了该方法的有效性。在每个实验中都可以观察到改进,特别是在连续语音识别结果上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号