A neural fuzzy training approach for continuous speech recognition improvement

机译：连续语音识别改进的神经模糊训练方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A novel training method for phoneme identification neural networks, called a neural fuzzy training method, is proposed. The difference between the proposed method and the conventional method is that the target values of each training sample are given as fuzzy phoneme class information instead of discrete phoneme class information. In the conventional training method, the target values are defined as 0s or 1s. However, in the proposed method, the target values are defined as likelihoods to phoneme classes in between 0 and 1. This likelihood is computed by a likelihood transformation function according to the distance between the input sample and its nearest sample belonging to each phoneme class in the training set. The effectiveness of the proposed method is shown by an 18-consonant identification experiment and a continuous speech recognition experiment using the ATR isolated word and phrase database. Improvements can be observed in every experiment, particularly on the continuous speech recognition results.

机译：提出了一种用于音素识别神经网络的新型训练方法，称为神经模糊训练方法。所提出的方法和传统方法之间的差异是每个训练样本的目标值作为模糊音素类信息而不是离散音素类信息。在传统的训练方法中，目标值定义为0s或1s。然而，在所提出的方法中，目标值被定义为位于0和1之间的音素类别的似然性。根据输入样本与其属于每个音素类的最近样本之间的距离来计算这种可能性的似然转换功能训练集。所提出的方法的有效性由18辅音识别实验和使用ATR隔离字和短语数据库的连续语音识别实验示出。在每个实验中可以观察到改进，特别是在连续语音识别结果上。

著录项

来源
《IEEE International Conference on Acoustics, Speech, and Signal Processing》|1992年||共4页
会议地点
作者
Komori Y.; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies [J] . Cui Xiaodong, Zhang Wei, Finkler Ulrich, IEEE Signal Processing Magazine . 2020,第3期

机译：自动语音识别深神经网络声学模型的分布式训练：当前训练策略的比较
2. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
3. A comparative study of fuzzy evolutionary techniques for footprint recognition and performance improvement using wavelet-based fuzzy neural network [J] . V. Devadoss Ambeth Kumar, M. Ramakrishnan International Journal of Computer Applications in Technology . 2013,第2期

机译：基于小波模糊神经网络的足迹识别和性能改进的模糊进化技术比较研究
4. A neural fuzzy training approach for continuous speech recognition improvement [C] . Komori, Y. . 1992

机译：用于连续语音识别改进的神经模糊训练方法
5. Wavelet transform approach for adaptive filtering with application to fuzzy neural network based speech recognition. [D] . Jung, Byung-Chul. 2001

机译：小波变换的自适应滤波方法及其在基于模糊神经网络的语音识别中的应用。
6. Auditory training of speech recognition with interrupted and continuous noise maskers by children with hearing impairment [O] . Jessica R. Sullivan, Linda M. Thibodeau, Peter F. Assmann -1

机译：听力障碍儿童的间断式和连续式噪声掩蔽器语音识别的听觉训练
7. Size matters: An empirical study of neural network training for large vocabulary continuous speech recognition [O] . Ellis Daniel P. W., Morgan Nelson 1999

机译：大小很重要：对大词汇量连续语音识别进行神经网络训练的实证研究
8. Use of Computer Speech Understanding in Training: A Preliminary Investigation of a Limited Continuous Speech Recognition Capability. [R] . Porter, J. E., Grady, M. W., Hicklin, M. B., 1977

机译：计算机语音理解在训练中的运用：有限连续语音识别能力的初步研究。

A neural fuzzy training approach for continuous speech recognition improvement

摘要

著录项

相似文献

相关主题

期刊订阅